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DECODING AS A FUNCTION OF THE NUMBER OF 
BITS PER CHUNK: 


BENNET B. MURDOCK, Jr. 


University of Toronto 


Miller's (1956) coding principle stresses the importance of maximiz- 
ing the number of bits per chunk. As a test 2 experiments on decoding 
were conducted; the source message consisted of the Ist 13 letters of 
the alphabet, and it was encoded with a minimum-redundancy Huff- 
man (1952) code. Exp. I varied the size of the code alphabet (binary, 
ternary, quarternary, or septenary) while Exp. II varied the S-R 
compatibility (Normal, Reversed, or Random mapping of (septenary) 
code words and alphabetic letters) ; in both cases S had to decode the 
message back into its original form. Mean number of letters decoded 
per trial increased with size of code alphabet and showed some S-R 
compatibility effects. Miller’s coding principal was clearly supported, 
and evidence was adduced to suggest that it need not contravene Hick’s 


(1952) Law. 


By now there is considerable evi- 
dence that short-term memory is lim- 
ited more by the number of items than 
by their informational value. This 
principle, the distinction between “bits” 
and “chunks,” and supporting evidence 
was first given by Miller (1956). Sub- 
sequently Tulving and Patkau (1962) 
demonstrated that, in free-recall verbal 
learning, as order of approximation to 
English increased, the number of 
“adopted chunks” stayed constant but 
the number of bits per chunk increased. 
Recall of categorized word lists as 

1 This research was performed while the 
writer was at the University of Vermont 
and it was supported under Contract Nonr- 
3219 (02) from the Office of Naval Re- 


search. I am endebted to Dina Oldham for 
testing Ss and analyzing data. 


1 


studied by Cohen (1963a, 1963b) sug- 
gested the same phenomenon: “cate- 
gorized word lists are handled in im- 
mediate memory as so many chunks 
of information rather than as so many 
individual words [1963b, p. 234].” 
Implications of the chunking principle 
have been developed in some detail by 
Mandler (1967), and the recent models 
of short-term memory (e.g., Atkinson 
& Shiffrin, 1967 ; Bower, 1964; Waugh 
& Norman, 1965) generally assume a 
fixed number of storage registers. The 
limitation is on the number of registers, 
not their contents. 

The experiments to be reported here 
were an attempt to test this principle 
under conditions more like those of a 
skilled performance task than the typi- 
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cal short-term memory experiment. 
The task was’ such that probably both 
short-term and long-term components 
were involved. More particularly, the 
decoding task used here involved an 
original message consisting of a long 
string of letters (first 13 letters of the 
alphabet) which, prior to the start of 
the experiment, had been encoded with 
a binary, ternary, quarternary, or sep- 
tenary code (Base 2, 3, 4, or 7). The 
task of S was to decode these base-r 
digits back into the original message 
(i.e. letters). Trials of 2-min. dura- 
tion were given, and the measure of 
performance was the number of letters 
decoded. By varying Base r from 2 to 
7 one is manipulating the uncertainty 
per symbol from one to nearly three 
bits. If Miller’s (1956) coding prin- 
ciple is generally applicable, then the 
number of letters correctly decoded 
should be a monotonic increasing func- 
tion of the size of the code vocabulary. 
Encoding may be considered the “‘ac- 
curate representation of one thing by 
another” (Pierce, 1961, p. 64). A code 
thus represents a set of A-B paired as- 
sociates wherein the A members of the 
pair represent the message ensemble 
and the B members of the pair the 
code words. Encoding is associating 
a code word with a message word, 
while decoding is associating a mes- 
sage word with a code word. Three 
types of codes may be distinguished : 
those which decrease redundancy, those 
which preserve redundancy, and those 
which increase redundancy. A _ re- 
dundancy-preserving code is simply a 
one-to-one translation (as in coding 
binary into octal digits), and such 
translations are not generally con- 
sidered in coding theory. Redundancy- 
reducing and redundancy-increasing 
codes are codes ‘which have for aver- 


age length a shorter or longer code 
word than a translation. In coding 
theory the former are minimum-redun- 
dancy or “optimum” codes while the 
latter are error-detecting or error-cor- 
recting codes (Abramson, 1963). This 
three-fold distinction is similar to that 
proposed by Posner (1964) for sequen- 
tial tasks except that his “reduction 
model” was designed for a many-to- 
one mapping and his “creation model” 
was designed for a one-to-many map- 
ping. A classification of types of cod- 
ing suggested by Tulving (1967) has 
also been helpful to the writer, though 
it is not identical with that proposed 
here. 

To take a concrete example from 
Abramson (1963), if the weather in 
Los Angeles were sunny, cloudy, rainy, 
or smoggy with probability .25, .125, 
125, and .50 then representing these 
four states of the world by the binary 
pairs 00, 01, 10, and 11 would be a re- 
dundancy-preserving code. If instead 
they were represented as 10, 110, 1110, 
and 0, respectively, this would be a re- 
dundancy-reducing code, since the av- 
erage length of the code word was 2.0 
binary digits in the first case but only 
1.875 binary digits in the second case. 
(Whether the code is maximally effi- 
cient is determined by the Shannon, 
1948, uncertainty measure which gives 
a lower bound; the second code is not.) 
If instead a parity check code were used 
the first code might be augmented to 
001, 010, 100, and 111 (odd parity) 
which would be a redundancy-increas- 
ing code. Examples of coding from 
the short-term memory literature (e.g., 
the trigram XTR encoded by S as the 
word “eXTRapolate”) may illustrate 
redundancy-increasing codes, but these 
examples are difficult to classify be- 
cause neither the message ensemble nor 
the code vocabulary form a closed set. 
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EXPERIMENT I 


Method 


The original message (first 13 letters of 
the alphabet) had a rank-frequency distri- 
bution such that probability of occurrence 
was inversely proportional to rank frequency 
(slope of —1 on log-log graph paper). The 
code consisted of the digits 0, 1,....,r—-1 
where r=2, 3, 4, 7 (r is the size of the 
code vocabulary), and each code was a re- 
dundancy-reducing code as described above. 
More particularly, a Huffman (1952) mini- 
mum-redundancy code was used which is an 
instantaneous, unique, nonsingular, block code 
(see Abramson, 1963, p. 51). It is instan- 
taneous in that each word in the sequence 
may be decoded without reference to suc- 
ceeding code symbols. It is nonsingular in 
that all code words are distinct and uniquely 
decodable since the nth extension of the code 
is nonsingular. It is a block code in that 
for every symbol of the source alphabet there 
is a fixed sequence of symbols in the code 
alphabet; these sequences of code digits are 
the code words, 

More generally, code of the Shannon 
(1948) and Fano (1949) type required 
grouping to minimize redundancy and are 
perhaps not too useful to psychologists be- 
cause their storage demands overtax the sys- 
tem. However, a coding system developed 
by Huffman (1952) made it possible to en- 
code each symbol immediately (i.e, without 
blocking) provided one used a variable- 
length code word, and it can be demonstrated 
(Abramson, 1963; Fano, 1961) that, for any 
given message, the average length of code 
words will not exceed by more than one the 


Shannon lower bound (i.e, 2p; log spi). In 
the present case, a source vocabulary of 13 


~ and code vocabularies of 2, 3, 4, and 7 items 


were used so that no dummy symbols would 
be needed in constructing the codes. That is, 
if the ratio (q—r)/(r—1) is integral (where 
q is the size of the message vocabulary) 
then the code will come out evenly (Abram- 
son, 1963, p. 84).. The source vocabulary, 
rank-frequency distribution, and the four 
codes are shown in Table 1. 

Materials—Prior to the start of the ex- 
periment the source message was determined. 
A string of 500 letters was generated by use 
of a random-number table using the frequen- 
cies shown in Table 1. Then this message 
was encoded four times, once with each of 
the four code alphabets (binary, ternary, 
quarternary, septenary). The encoded mes- 
sages (merely strings of r-ary digits) were 
typed out (rows of approximately 36 digits/ 
row, double spacing) and mimeographed. 
There were no spaces between code words, 
Tt was a paper-and-pencil task; S was in- 
structed to decode the message and write 
the correct letter above each code word. 

Procedure.—In the instructions Ss were 
told that it was an experiment on decoding 
similar to receiving Morse code. They were 
given the relevant code to study for several 
minutes before beginning, and it was always 
available for inspection and further study. 
Each trial was 2 min. in length, and the 
intertrial interval was 1 min, There were 12 
trials/session and 4 sessions/S, for a total 
of 48 trials/S. They were instructed to per- 
form as rapidly and as accurately as possi- 
ble. The E timed the trials with a stop 
watch. 


TABLE 1 
HuFFMAN Copes For Exp, I 


Letter Probability Binary 


-314 00 
-157 010 
-105 100 
.079 110 
063 0110 
-053 1010 


SUAS TOM OOWD> 
S 


Ternary 
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Subjects—There were 20 Ss from the 
subject pool at the University of Vermont. 
All Ss were tested individually, and ran- 
domly assigned to groups (binary, ternary, 
quaternary, or septenary code). 


Results 


The main results are shown in Fig. 
1, where mean number of letters de- 
coded per trial is plotted as a function 
of trial blocks. As can be seen, the 
predicted monotonic ordering was ob- 
tained. Over the entire four sessions 
the mean numbers of letters correctly 
decoded were 3,170, 4,534, 4,765, and 
5,234 for binary, ternary, quaternary, 
and septenary, respectively. An analy- 
sis of variance showed a significant 
difference, F (3, 16) = 11.43, p < .001, 
and the value of » was .61, thus indi- 
cating a strong effect (Hays, 1963, 
p. 383). 

The above analyses dealt only with 
correct responses. In analyzing errors, 
omissions (one or more digits skipped 
over in the decoding) were found to be 
infrequent. Except for two Ss in the 
binary code on the first session, the 
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MEAN NO. OF LETTERS DECODED PER TRIAL 


123456 123456 123456 123456 


TRIALS (IN BLOCKS OF TWO) 


Fic. 1, Mean number of letters decoded as 
a function of practice (Exp. I). 


overall frequency of omission was less 
than 1 in 1,000. The error rate (total 
number of mistakes excluding omis- 
sions divided by total number of letters 
correctly decoded) was .6%, .4%, 5%, 
and .6% for binary, ternary, quater- 
nary, and septenary, respectively. 
Thus, it would seem as if accuracy was 
high and approximately constant over 
groups. 

In connection with errors, the prefix 
property of Huffman (1952) codes 
should be mentioned. By “prefix 
property” is meant that no code word 
forms the first part, or prefix, of a 
longer code word. One consequence 
of this prefix property is that an error 
results in only a temporary aberration. 
That is, if a digit (or digits) is omitted, 
used twice, or otherwise wrongly 
grouped decoding will be incorrect— 
but only temporarily. There will be a 
short burst of “nonsense” but shortly 
the decoding will be back in phase with 
the original message. (For a clear il- 
lustration of this point see Pierce, 
1961.) The error rates cited in the 
preceding paragraph mean then that 
there was approximately one stretch of 
nonsense for every 200 letters correctly 
decoded. 


EXPERIMENT II 


In constructing Huffman (1952) 
codes, the constraints sometimes pre- 
vent a mapping which corresponds to 
normal alphabetic ordering. Thus, A 
B C might have to be coded as 1 3 2 
rather than 1 2 3, for instance. The 
reason is that the code words are 
formed by sequential construction of 
subsets of size r, and the order in which 
these subsets are formed is determined 
by their probability values. (For a 
description of how to construct Huff- 
man codes see Pierce, 1961.) An ex- 
ample of such an inversion is shown in 
the quaternary code of Table 1, where 
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C-E were coded 30-32 while F-I were 
coded 20-23. 


Method 


Experiment II was performed both to in- 
vestigate the general importance of inver- 
sions in decoding and also to obtain some 
base line against which to assess the particu- 
lar inversion occurring in the quaternary 
code of Exp. I. Here however only a sep- 
tenary code was used because it provides 
greater latitude for experimental manipula- 
tion than do codes to lower-numbered bases. 
Except for the points noted below, the gen- 
eral method of Exp. II was identical with 
that of Exp. I. 

Materials—There were three different 
codes, Normal, Reversed, and Random, The 
codes are shown in Table 2. Given that the 
letters were alphabetically ordered in terms 
of probability value (ie, A most common 
and M least common) it is necessary (for 
minimum-redundancy coding) that Letters 
A-F be assigned single septenary digits while 
Letters G-M be assigned a two-digit code. 
Therefore, “reversed” and “random” of ne- 
cessity refer to mappings within these two 
subsets rather than over the entire message 
vocabulary. 

It may be noted that the digits used were 
1-7 while those of the septenary code of Exp. 
I were 0-6. An intralist intrusion matrix 
for the errors of the septenary code of Exp. 
I showed that 75% of the intrusions were 
misplacements by one of the single letters $ 
i.e., 1 decoded as A, 2 as B, 3 as C, 4 as D, 
and 5 as E. Obviously 1-6 is a more com- 
patible code for A-F than is 0-5. Therefore, 
to optimize conditions for the Normal con- 


TABLE 2 
HurrMan Copes For Exp. II 
Letter | Probability | Normal | Reversed] Random 
A 314 1 6 3 
B 157 2: 5 5 
€ 105 3 4 1 
D 079 4 3 4 
E -063 5 2 6 
F 053 6 1 2 
G 045 71 17 75 
H, 039 72 76 71 
I 035 73 75 77 
Al 031 74 74 73 
K 029 75 73 76 
E 026 76 72 74 
M 024 77 71 72 


ps 
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° 
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MEAN NO. OF LETTERS DECODED PER TRIAL 
o 


12345678910112 1235456789012 


TRIALS 


Fic. 2. Mean number of letters decoded as 
a function of practice (Exp. II). 


dition the more compatible digits were used 
in Exp. II. 

Procedure——The main difference between 
Exp. I and Exp. II was that, in the latter, 
there were only two sessions (of 12 trials 
each) per S. Otherwise the same procedure 
was followed; S simply wrote down as many 
letters as he could in each 2-min. trial, 

Subjects—There were 24 Ss, students 
from the same pool; they were tested indi- 
vidually and randomly assigned to groups 
(normal, reversed, or random mapping). 


Results 


The main results are shown in Fig. 
2, where mean number of letters de- 
coded per trial is plotted as a function 
of trials. While the Normal code was 
consistently superior, the Reversed 
code was superior to the Random code 
only on the first day. In general, group 
differences decreased with practice. 
On the first day group differences were 
significant, F (2, 21) = 3.86, p < .05, 
but on the second day they were not, 
F (2, 21) =2.31, p>.05. On the 
first trial of the first session the com- 
bined Reversed and Random groups 
decoded 72% as many letters as the 
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Normal group, but on the last trial of 
the second session the combined groups 
decoded 91% as many letters as the 
Normal group. $ 

Error rates were .5%, .6%, ‘and 
7% for the Normal, Reversed, and 
Random groups, respectively. Thus, 
as in Exp. I, accuracy was essentially 
high and constant over groups. In- 
trusion matrices showed that the pro- 
portion of total errors in the cells adja- 
cent to the main diagonal were .94, .86, 
and .42 for the Normal, Reversed, and 
Random groups, respectively. Since 
chance is .15 the distributions are 
clearly nonrandom, and the obvious in- 
terpretation is that the intrusions (par- 
ticularly for the Normal and Reversed 
groups) represented errors of generali- 
zation with a very pronounced adja- 
cency effect (e.g., “3” decoded as B or 
D; “73” decoded as H or J), 


Discussion 


The fact that there was an S-R com- 
patibility effect in Exp. II means that 
the curve for the quarternary code in 
Exp. I is perhaps a bit too low (see 
Fig. 1). That is, there was a reversal in 
this code affecting Letters C-E and F-I 
which may have depressed performance 
somewhat. However, the magnitude of 
this effect must certainly be rather small. 
Also, Exp, II strongly suggests that the 
effects of incompatibility are transitory. 

Given the presumed trade off of speed 
and accuracy in tasks of human perform- 
ance (Fitts & Posner, 1967), it ap- 
pears that in the decoding task used here 
Ss clearly sacrifice speed for accuracy. 
While the instructions (“work as rapidly 
and as accurately as you can”) were de- 
liberately ambiguous, the interpretation 
of these instructions by Ss was not. As 
noted above, the error rate was con- 
sistently around 5%; the differential 
characteristics of the various codes was 
therefore reflected in the number of let- 
ters decoded per trial, not in the ac- 
curacy of decoding. 


The results of Exp. I clearly support 
the Miller (1956) coding principal; 
namely, the desirability of maximizing 
the number of bits per chunk. Perform- 
ance as measured by number of letters 
decoded is directly related to r, the size 
of the coding vocabulary. However, 
which aspect or aspects of the task bene- 
fit from chunking is certainly not clear 
from the present study. 

To obtain a more precise estimate of 
Processing rate, it was assumed that the 
output component of the task (writing 
down the alphabetic characters) was in- 
dependent of the size of the coding 
vocabulary and, as a guess, required 
500 msec/letter for execution, Then, an 
analysis of the fourth session of Exp. I 
showed that, taking account of total num- 
ber of letters decoded and the average 
length of the code words (3.215, 2.031, 
1.639, and 1.229 for r= 2, 3, 4, 7, re- 
spectively), the length of time to process 
each r-ary digit did in fact increase with 
r and the rate of gain was estimated at 
approximately 16 bits/sec. This value is 
not unreasonable (see Fig. 32, p, 105, 
Fitts & Posner, 1967) but, more im- 
portant, it suggests that the principle of 
maximizing the number of bits per chunk 
does not contravene the basic relation- 
ship between uncertainty and choice reac- 
tion time. Greater uncertainty associated 
with enriched chunks may in fact in- 
crease processing time as Hick’s (1952) 
law says it should, but this deleterious 
effect is more than overcome by the 
shorter length of the resulting code 
words, 
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A SERIAL POSITION EFFECT IN NUMBER-RECOGNITION 


H. B. G. THOMAS! 


Illinois State Psychiatric Institute 


A recognition-memory task has been used to investigate the handling 
of the numbers 0-99. Of these 100 integers, 20 critical primes and 20 
critical composite numbers, matched approximately for size and dis- 
tributed across target-list positions on a Latin-square design, have 
been compared. The number of recognition-failures nı on a number 
X is found to be determined largely by the serial position of X, in the 
sequence 0, 1 . . . 99 and shows a typical serial position effect. A 
mathematical model, described elsewhere, fits this serial position curve 
well. However for composite numbers, but not primes, recognition- 
failures on individual items show a significant amount of additional 
variability not accounted for by the serial position effect. The impli- 


cations of these findings are discussed. 


Although numbers and digits are 
commonly used in experimental psy- 
chology as stimuli, responses, learning 
materials, and the like, number-size has 
been somewhat neglected as a factor 
which may affect performance in vari- 
ous types of task. Wheeler (1939, 
1941), among others, has shown that 
for children the relative difficulty of 
mastering arithmetical facts such as 2 
x 3 = 6, or 7 + 8 = 15, depends on the 
sizes of the three numbers involved; 
in fact, the difficulty correlates highly 
with the logarithm of their sum 
(Thomas, 1963b). An analogous func- 
tion seems to determine calculation 
time in adults, under suitable conditions 
(Thomas, 1963a). Namikas and Brog- 
den (1960) used the numbers 10, 20, 
30, 40 in experiments on the trial and 
error learning of redundant lists and 
their data show a monotonic increase of 
learning-difficulty with number-size. 


1 The author is grateful to D. E. Broad- 
bent for permitting this experiment to be 
conducted in the M. R. C. Applied Psychol- 
ogy Research Unit, Chaucer Rd., Cam- 
bridge, and to A. D. Baddeley for his con- 
structive criticism and for administering the 
task, The experiment was prepared while 
the author held the post of Psychiatric 
Registrar at Fulbourn Hospital, Cambridge. 
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Experiments by Dale and Baddeley 
(1966) will be discussed below. 

The investigation reported here was 
suggested by certain incidental findings 
of a pilot experiment on the recogni- 
tion-learning of numbers. In that 
study the probability of failing to recog- 
nize a “target” number X, was found 
to be significantly correlated both with 
the logarithm of its size (log X,) and 
with its association value (m;) as es- 
timated by Battig and Spera (1962). 
When primes and composites were con- 
sidered separately, these correlations 
were significant for those target-items 
which were prime numbers but not for 
those which were composites. As only 
one list was used in the pilot study it 
seemed desirable to reexamine the ef- 
fect of number-size and the apparent 
difference between prime and composite 
numbers, under conditions designed to 
distribute these numerical properties 
uniformly across list positions and us- 
ing a greater number of lists. 

As will appear, the results of the 
pilot study are only partially confirmed 
under these conditions. Both primes 
and composites show a clear effect of 
number-size, but the effect is now of a 
different kind; recognition-failures are 
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not linearly related to log X; but in- 
stead show a serial position effect, with 
reference to the position of X; on the 
numerical scale 0-99. As before, the 
composites show greater variability 
than the primes. 


METHOD 


Procedure-—The Ss were 18 Cambridge 
housewives, tested as a group in a single 
session lasting about 1 hr. Each received 
about $3.00 for expenses. Altogether 10 
“target lists” were presented, each contain- 
ing 10 different integers from the range 0-99 
so that each of these 100 integers occurred 
just once in the set of target lists. The Ss 
heard a given target list twice and were then 
tested by the recognition method: A test 
series of 100 integers, also from the range 
0-99, was presented and S was required to 
enter a check or a cross on his response 
sheet on hearing each number, according to 
whether he believed it to have been in the 
current target list or not. 

The Ss were told in advance the range in 
which both target and test numbers would 
lie but were given no indication as to 
whether, or how often, target numbers would 
occur in the test series. In fact, every tar- 
get number occurred just once in the en- 
suing test and might also occur as a distrac- 
tor in other test series. The Ss were told 
that the numbers they would hear in the test 
series would form a haphazard, random 
series in which any number from 0 to 99 had 
an equal chance of occurring next, so that 
repetitions might occur occasionally. They 
were instructed to guess if they were un- 
certain, but not to omit any responses. Each 
S should therefore have yielded 10 series 
of exactly 100 responses. Any S whose re- 
sponse sheet contained fewer or more than 
100 responses to any of the test series was 
discarded from the analysis; this excluded 
6 Ss, The data of the remaining 12 Ss were 
accepted as valid. 

Apart from the preliminary instructions 
the entire presentation was prerecorded. 
Target and list numbers came at intervals 
of 2 sec. and each was preceded by a brief 
timing signal. Each new target list was an- 
nounced with the words, “Here is the next 
list; listen carefully,” and presented after 
a brief pause. The second presentation was 
preceded by the words, “Here is the list 
again; listen carefully,” and followed by the 
words, “Now here is the test. Are you 


ready?” It was never found necessary to 
stop the tape at this point and the test series 
therefore followed at once, beginning with 
the first timing signal. The end of each 
test was indicated by the word “Stop!” in 
place of a number. 

Target list—In order to compare the han- 
dling of prime and composite items it was 
decided to choose 20 “critical” numbers of 
each kind, matching primes and composites 
approximately for size. The 20 primes were 
selected first, so as to sample the range 0- 
99 as representatively as possible, since 
primes are more densely distributed at the 
lower end of the range. The 20 composites 
were then chosen so that each lay not more 
than four integers above or below the cor- 
responding prime, Two critical numbers of 
each kind were inserted into each row of a 
10 X 10 matrix according to a Latin-square 
design, rows denoting lists and columns de- 
noting list positions. Over the whole set 
of lists, two critical primes must therefore 
occur in any given position. By a ranking 
method it was arranged so that in each posi- 
tion a larger prime should be paired with a 
smaller one and vice versa; a similar pro- 
cedure was used for the critical composites. 
Consequently, the sum of the two primes 
varied only from 70 to 99 and that of the 
two composites from 73 to 102, from one list 
position to another. 

The remaining 60 places in the matrix 
were filled with the integers not selected as 
critical items, initially by writing down a 
random permutation (Moses & Oakford, 
1963) with the critical integers omitted, 
These noncritical items were then shuffled 
about, making the fewest possible inter- 
changes, until the sum of all numbers in 
each row and also in each column of the 
matrix fell within the limits 495+ 50. This 
procedure was primarily designed to counter- 
balance the factor of number-size across the 
various lists and list positions, for both criti- 
cal and noncritical numbers; but it also 
achieved a fairly uniform distribution of as- 
sociation values, the mean mı lying between 
the limits 1.81 +.20 for all list positions. 
Table 1 shows the resulting set of target 
lists in their final order of presentation. 

Test series—Each contained 100 integers 
from the range 0-99, drawn initially at ran- 
dom but edited, with the fewest possible 
changes, so as to contain from 8 to 12 in- 
tegers in each decade of the range; from 17 
to 23 digits of each kind except zero and 
from 8 to 12 zeros. Also in constructing 
each series, integers which had occurred rela- 
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TABLE 1 
TARGET Lists EMPLOYED 


List Position 

List 

1 2 3 4 5 6 7 8 9 10 
I 62 12 73 41 54 84 50 
II 96 34 53 10 32 22 46 
II 31 99 40 28 15 23 60 
IV 6 43 75 29 69 48 64 
V 38 63 17 27 44 90 3 
VI 47 25 8 94 13 57 93 
VII 88 12 1 79 98 il 21 
VIII 33 2 82 85 51 52 17 
IX 55 97 70 39 91 4 89 
x 65 24 37 45 67 61 36 
Note,—Italics denote critical items. 

tively often in previous series were replaced RESULTS 


by integers which had occurred rarely or not 
at all, so that when all 10 series had been 
completed the frequencies of the 100 integers 
0, 1, ... , 99 conformed closely to a rec- 
tangular distribution: x* (99) =1.28, p> 
.995. The order. of presentation of test series 
in the experiment bore no relation to the or- 
der in which the series had been constructed. 
Each target-item occurred just once in the 
corresponding test series, its place in the 
series being determined randomly. These 
placings were edited so that over all 10 
series, target-items were distributed uni- 
formly through the series, never occurring 
however in the first or last three places or 
with fewer than two distractors between 
adjacent target-items. 


Table 2 presents, for each critical 
item, its numerical size (X,), the num- 
ber of Ss out of 12 who failed to recog- 
nize this item (nm), and the association 
value of the item (mj). It is obvious 
on graphical inspection that there is no 
monotonic relationship between n; and 
X;, whether primes and composites are 
examined separately or together. The 
pilot study finding of a significant cor- 
relation between n; and log X; (for 
primes only) is not, therefore, con- — 
firmed under the present conditions. 


TABLE 2 


Numper-Size (Xj), NUMBER or RECOGNITION-FAILURES (nj), AND 
AssoctATION-VALUE (m;) FoR EACH CRITICAL ITEM 


Primes 


Composites 


Target 
List 
Positions es i ? 
mi ni ni mi 

1 71 4 0.83 4 | 2.57 6 1.55 | 18 2 | 2.67 
2 19 3 2.01 0 | 0.88 2 2.29 | 81 8 | 1.61 
3 7 2 2.62 6 | 0.79 7 1.14 | 14 4 | 2.06 
4 47 6 0.95 6 | 0.72 6 1.91 6 1 | 2.23 
5 97 5 1.61 5 | 3.11 4 2.43 | 63 12 | 1.34 
6 53 7 0.79 1 | 1.85 6 2.43 8 3. ee 
7 29 9 1.18 8 | 1.00 9 1.69 | 45 7 | 2.21 
8 67 7 1,12 4 | 3.06 7 1.02 | 32 2 | 1.695 
9 11 4 2.08 EETA 8 2.71 | 84 2 | 1.09 

10 3 1 2.55 4 | 1.38 2 3.03 | 60 8 | 2.14 
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However the recognition-failures are 
not randomly distributed among the 40 
critical items, x? (39) = 55.19, p < .05, 
and it is proper to look for the sources 
of this variance. 

The main source of variability ap- 
pears to be the position of the item X; 
on the linear numerical scale, 0,1,..., 
99. For both the primes and the com- 
posites, considered separately, the graph 
of m against X; is an upward-convex 
curve, masked by scatter. The two 
curves are similar and when prime and 
composite items are matched approxi- 
mately for size (by rank ordering them 
according to size) a comparison of n; 
values supports the null hypothesis of 
no difference between the curves: 
xX (15) = 6.98, p > .95. 

The form of the relationship is 
clearly revealed if critical prime and 
composite items are combined and 
grouped by size into the class intervals 
0-19, 20-39, 40-59, . . . 80-90 and if 
mean n; is then plotted against mean 
X; for each class. The result is a typi- 
cal serial position curve (Fig. 1) with 
a maximum a little above X; = 60. 

Now it has been shown that typical 
serial position curves found in rote 


mean nj 
8 


i 


F 

so Ng 
Fic. 1. Mean number (ni) of Ss failing 
to recognize critical items in each interval, 
as function of mean size (X:) of critical 
items in that interval. 


learning are in general closely fitted by 
a mathematical model described else- 
where (Thomas, 1968). According to 
that model, a function U’(p,,h) mea- 
sures the amount of information han- 
dled by a particular psychophysical 
process when it operates on the item 
or list position lying a proportion pı 
of the way down the list. In the case 
of the present experiment, where it was 
made clear to Ss in advance that they 
would encounter only numbers in the 
range 0-99, it is reasonable to assume 
that a given number X, may be re- 
garded as lying a proportion p; of the 
way from 0 to 99 on a linear psycho- 
physical dimension which represents 
this numerical range, where p; = X;/99. 
The model can be fitted to the curve of 
Fig. 1 on this assumption. 

The function U’(p;,h) is fitted to a 
given serial position curve by estimat- 
ing the value (f) of pı at which the 
maximum occurs ; a parameter h is then 
calculated from p using the relation 
h = (1-p)/(2p-1)# which is specified 
by the model. Given h, a value of 
U'(p,h) can be computed for each 
yalue of p; according to the definitions : 


U'(pyh) = [— pi'log pi’ — 
(1-py’) log (1-py') | 
where 


(1-pi’) = a2/a; a = arctan [1/h]; 
a, = arctan [(1-p:)/h]. 


In the present case, the maximum of 
the curve was taken to lie at X; = 63. 
This value is consistent with graphical 
inspection and with the fact that, out of 
all the 100 target integers, 63 was the 
only one which failed to be recognized 
by all 12 Ss. (Conversely, the only 
two integers which were recognized by 
all 12 Ss were 0 and 99). This esti- 
mate yields the working value h= 
696. Using this h-value, U’(p,h) 
was calculated for each size class of 
critical integers, from the value of p; 
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corresponding to the mean X, of the 
class. Figure 2 shows that an im- 
peccable fit is the result; mean n; is 
almost exactly proportional to U’(p,,h). 

In view of this result a comparison 
was made, for each individual critical 
integer Xj, of the observed n; with the 
value of n; which would be expected if 
recognition-failures were entirely at- 
tributable to the linear relation shown 
in Fig. 2. Taking each critical X; in 
turn, first p; then U’(p;,h) was calcu- 
lated (using the same h-value as be- 
fore) and finally the expected n; was 
computed from the equation n; = 1.50 
+ 19.64 U’(p,,h) which describes Fig. 
2. The differences between observed 
and expected n; were not significant 
for the 20 critical primes, x?(17) = 
21.46, p > .10, were barely significant 
for all 40 critical items combined, x? 
(37) = 52.66, p = .05 for x? = 52.16, 
but were highly significant for the 20 
composite items alone, x?(17) = 31.20, 
p < 025. 

For the prime numbers, therefore, the 
recognition-failures observed under the 
present conditions may be attributed 
very largely to the effect of the serial 
position of the item in the numerical 


meon n; 


U'lg,h) for meon X, 


010 0.20 030 DU 


Fic. 2. Mean m as function of U’ (p,,h) 
- calculated from mean X, for critical items 
in each interyal 0-19, 20-39, . . . 80-99, 


range 0-99; but for the composite num- 
bers some other source of variability 
was Clearly affecting the recognition- 
difficulty. ' 

Serial position of the critical item in 
the target list does not appear to affect 
recognition-difficulty; the distribution 
of recognition-failures across target list 
positions does not depart significantly 
from randomness, For the primes y? 
(9) = 9.88, p > .10, and for the com- 
posites x°(9) = 7.96, p > .10. 

Taking critical primes and com- 
posites together, there is a moderate 
correlation between n and m;:r = 
—0.462, p < .01. This is mainly due 
to the primes, since for these r= 
—0.626, p < .01, whereas, for the com- 
posites alone the correlation is not sig- 
nificant: r = —0.364, p > .10. 


Discussion 


Dale and Baddeley (1966) used two- 
digit numbers 10-99 in experiments on 
free recall and recognition memory and 
were led by some of their results to sug- 
gest that such numbers might be coded in 
terms of their positions on a scale corre- 
sponding to this numerical range. An 
experiment to test this hypothesis failed 
to confirm it; numbers differing by one 
unit from target numbers’ were not sig- 
nificantly more effective as distractors, 
under recognition conditions, than more 
distant numbers. However, as these 
“distant” distractors lay only 2-5 units 
away from their respective target num- 
bers it is perhaps not surprising that the 
observed effect, though in the predicted 
direction, was very small. It was also 
shown that intrusion errors in free recall 
tended to have the same left-hand digit 
as omission errors. This resemblance 
was not found for the right-hand digit 
however. If intrusion errors are substi- 
tutes for omissions, this interesting result 
suggests that since the error is mainly 
concentrated in the “less significant” 
right-hand digit, the numbers were per- 
haps coded on a two-level decimal tree, 
rather than on a simple linear array as 
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Dale and Baddeley conjectured and as 
the present results seem to indicate. 

There may, therefore, be a discrepancy 
between the two sets of results, as there 
is between the number-size effect re- 
ported here and that suggested by the 
pilot experiment. The number of Ss in 
the present experiment is small but sta- 
tistically significant results are obtained 
and the close fit of Fig. 2 cannot easily 
be dismissed as fortuitous; it may there- 
fore be necessary to assume that under 
different conditions Ss are capable of 
adopting two or more different cognitive 
sets towards number-symbols. It is, of 
course, possible to manipulate “set” de- 
liberately, though not necessarily in a 
predictable way (Wing, 1964); inad- 
vertent manipulation in favor of this or 
that “number-set” is therefore a possibil- 
ity. It is customary to distinguish be- 
tween the cardinal and the ordinal con- 
cepts of Number and the results of the 
present experiment accord well with what 
might be expected if Ss were to adopt the 
ordinal number set. When a number- 
symbol, X, is used as a cardinal number, 
i.e. arithmetically, it stands for a class 
containing X members; when it is used 
as an ordinal number it denotes the Xth 
member of an ordered series. Since an 
ordered series is essentially a list, the 
adoption of this set might well cause a 
serial position effect to be shown in 
performance. 

In this experiment, where stringent pre- 
cautions were taken to ensure an equi- 
probable representation of the entire 
numerical range 0-99 in both target lists 
and test series, it has been shown that 
under “yes-no” recognition conditions the 
likelihood of failing to recognize a num- 
ber X; varies with the serial position of 
X; in this range and conforms to a typi- 
cal serial position curve, Although the 
serial position curves obeyed by primes 
and composites do not differ significantly, 
the primes fit the function U’(p;,h) with- 
out significant error, whereas for the 
composites the deviations from fit do 
reach significance. Since composite num- 
bers have prime factors (which might be 
expected to provide additional oppor- 
tunities for the individual S to form his 


own idiosyncratic associations to the 
number) whereas primes have no fac- 
tors, the greater variability of perform- 
ance with composites may be due to this 
cause. Examination of Battig and 
Spera’s (1962) data supports this view. 
When m; is plotted against X; the 28 
prime numbers in the range 0-100 con- 
form, with relatively little scatter, to a 
U-shaped curve; the 72 composites show 
much more dispersion and the points for 
these nearly all lie above the curve for 
the primes. Conversely, if the range 0- 
100 is divided into decades and the mean 
m, is found for the composites in each 
decade, 21 of the 28 individual primes 
have m; values below the mean m; for 
composites in their decade, a result sig- 
nificant at p < .008 on a binomial test. 

The simplest interpretation of the facts 
is that composite numbers are, indeed, 
richer and more variable in number of 
associations than primes and that this 
variability tends to mask number-size ef- 
fects which are shown more clearly by 
primes. The m; values obtained by Bat- 
tig and Spera (1962) are themselves the 
result of a task in which Ss were required 
to rate each number on a scale of associa- 
tive richness. Whatever may be the 
psychophysical processes involved in as- 
signing ratings to a set of items, the 
U-shaped relation between m, and X, 
suggests that the serial position of X, on 
the scale 0-100 was exerting an effect 
on these processes, but that effect was 
masked, for the composites, by their 
greater variability. The U-shaped rela- 
tion is, however, shown by composites as 
well as by primes when mean m; is plotted 
against mean X; for each decade. 

The serial position effect reported here 
was shown with respect to the serial posi- 
tion of each target-item in a purely con- 
ceptual ordering, that of the first 100 
integers. There may be a parallel be- 
tween this result and those of De Soto 
and Bosley (1962) and Pollio (1966) 
who also obtained serial position effects 
with reference to conceptual dimensions 
but used a paired-associate learning tech- 
nique. In the present case, since a recog- 
nition method was used, no numbers were 
uttered by the Ss and the temporal order 
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of presentation of both target and test 
items was randomized with respect to 
number-size. Consequently the genera- 
tion of the effect cannot have been di- 
rectly linked either to the input of stimuli 
or to the output of responses but must 
have occurred during the execution of 
some cognitive operation which inter- 
vened between the perception of a test 
item and the utterance of a “yes-no” re- 
sponse and which presumably has to do 
with a search of the dimension along 
which the effect is shown. 
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MEDIATION AND MEDIATION TIME IN 
PAIRED-ASSOCIATE LEARNING* 


GORDON WOOD anp MARTIN BOLT 


Michigan State University 


3 experiments were conducted to study mediation and the relationship 


between temporal factors and mediation, 


Exp. I and II gave support 


to the view that providing relevant mediators during the study trial can 
facilitate paired-associate learning. 
of mediation, study time (1, 2, 4 sec.), and test time (1, 2, 4 sec.). 
All 3 variables influenced performance, but interactions between either 


of the 2 temporal variables and mediation were not significant. 


Exp. III investigated the effects 


It 


appears that mediation processes are independent of time factors. 


Investigators have used various 
“mediation” paradigms and manip- 
ulated the length of the anticipation 
interval for the final list to assess 
whether temporal factors influence the 
amount of positive transfer. Although 
some studies (e.g., Runquist & Mar- 
shall, 1963; Schulz & Lovelace, 1964) 
support the view that temporal factors 
can influence the amount of positive 
transfer, other studies (e.g., Richard- 
son, 1967; Schwenn & Underwood, 
1965; Spear, Mikulka, & Podd, 1966) 
do not. The use of a transfer para- 
digm may not be the most satisfactory 
means to assess how temporal factors 
influence mediation in that other inter- 
pretations (e.g., parasitic reinforce- 
ment) of the positive transfer may be 
more appropriate. Also, the amount of 
positive transfer obtained with some 
mediation paradigms (e.g., A-B, A-B’) 
is typically small. 

Another approach to the study of 
mediation would be to provide one 
group of Ss with mediators for the 
learning of a single list of paired-asso- 
ciate (PA) pairs during the study 


1 Thanks are due David C. Raskin for his 
helpful comments and criticisms, and Ronald 
E. Buchter and Diane E. Worden for their 
assistance in collecting the data. This inves- 
tigation was supported in part by a grant 
from Michigan State University. 
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trials and not provide mediators for a 
control group of Ss. If the two groups 
differ in the learning of the PA pairs, 
as measured by test-trial performance 
in which neither group is provided with 
mediators, it follows that this facilita- 
tion is due to the presence of the medi- 
ators. The use of a single list has at 
least one important advantage over the 
use of a transfer paradigm in that in- 
terpretations based on processes other 
than mediation (e.g., parasitic rein- 
forcement) are likely to be inappro- 
priate. Yet, there has been no clear 
demonstration that providing Ss with 
a mediator will facilitate their learning 
of a single list. Dallett (1964) 
attempted to facilitate the PA learning 
of a single list by priming mediating 
responses to the stimuli. The group 
provided with relevant mediators dur- 
ing the first presentation trial had bet- 
ter absolute performance than the ap- 
propriate control, but this difference 
was not significant. The present study 
employed a modification of Dallett’s 
procedure in an attempt to demonstrate 
that PA performance can be markedly 
facilitated by providing Ss with a rele- 
vant mediator. If providing mediators 
results in facilitated performance, then 
it should be feasible to use this pro- 
cedure to study how temporal factors 
influence mediation. 
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' EXPERIMENT I 
Method 


Design—-A total of 24 Ss from under- 
graduate psychology courses at Michigan 
State University was randomly divided into 
two equal groups of 12 each. The mediation 
(M) group was provided with a mediator 
for each PA pair on the study trials; the 
control (C) group was not provided with a 
mediator. Neither group was provided with 
a mediator for the test trials. 

List—The stimuli for the PA list were 
single letters. Each mediator started with 
the same letter as the corresponding stimu- 
lus; each response was a strong associate 
of a mediator. For example, the pairs t— 
chair, K—queen, and M—lamb had the media- 
tors table, king, and mutton, respectively. 
The words were selected from the Minnesota 
norms (Russell & Jenkins, 1954) ; there was 
a total of 16 pairs. 

Procedure.—A study-test procedure was 
used. In addition to the standard PA in- 
structions given to each group the M group 
was told that they would be given a mediator 
to link each of the letter-word pairs. The 
mediator was presented in parentheses be- 
tween the stimulus and response on each of 
the study trials. Both groups were given 
an example (Q—/oud) similar to the pairs 
of the list. The M group was also given a 
mediator (quiet) and told that the mediator 
should aid the linking of the pair. Each 
S received four alternating study and test 
trials. The materials were presented on a 
Stowe memory drum at a 2-sec. rate with 
an intertrial interval of 4 sec. Only one 
order of the 16 pairs was used. 


Results and Discussion 


The mean number of correct re- 
sponses for the M group was 5.92, 9.50, 
12.08, 13.17 for Test Trials 1-4, re- 
spectively. The mean correct for the 
C group was 3.25, 6.25, 8.42, and 10.17 
for Test Trials 1-4, respectively. An 
analysis of variance on these scores 
indicated that providing mediators sig- 
nificantly facilitated performance, F (1, 
22) =9.45, p<.01. Moreover, the 
superiority of the M group was main- 
tained across the four trials as indi- 
cated by the nonsignificant interaction 
between trials and conditions (F = 
46). 

The performance of the M group 


appears to have been facilitated because 
the mediators provided efficient links 
between the letter-word pairs. The 
extent of the efficiency would appear 
to depend on the strength of the asso- 
ciation between the stimulus and the 
mediator and the association between 
the mediator and the response. It is 
possible, however, that the facilitation 
obtained may not be dependent on 
a preestablished connection between 
the mediator and the correct response. 
One could argue that facilitation was 
obtained because it is easier to asso- 
ciate a word with a word than a word 
with a letter. That is, if facilitation 
could be obtained when a “mediator” 
is provided which is not related to the 
correct response, then the facilitation 
obtained in Exp. I probably resulted 
from S using a word (i.e., the media- 
tor) rather than the single-letter stim- 
ulus as the functional stimulus, Ex- 
periment I may also be criticized be- 
cause instructing the M group that 
the mediators should aid their learning 
might have produced different levels 
of motivation for the two groups. 

To establish that the superiority of 
the M group of Exp. I was due to 
mediational processes, a motivational 
or functional stimulus interpretation 
of the results must be ruled out. Ex- 
periment IT was designed to determine 
if the facilitation of the M group of 
Exp. I was dependent on a preestab- 
lished association between the mediator 
and the response. If this association 
is necessary in order to obtain facilita- 
tion, then it follows that the results 
of Exp. I cannot be accounted for 
solely on the basis of a motivational or 
functional stimulus interpretation. 


EXPERIMENT IT 
Method 


Design and procedure—A total of 72 
Michigan State University students from 
undergraduate psychology classes was as- 
signed to four groups of 18 each. Each 
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group was given six study and six test trials 
on one of two lists. For each list, one 
group was provided with a mediator during 
the study trial and one group was not given 
a mediator. Neither group was given a 
mediator for the test trials, For any one 
list, Ss were randomly assigned to an M or 
C group in order of their appearance at the 
laboratory. The instructions were essen- 
tially identical to those used in Exp, I. 

Lists—For both lists, the 16 single-letter 
stimuli were the same as those used in Exp. 
I, and each mediator started with the same 
letter as the corresponding stimulus term. 
For List 1, the PA pairs were identical to 
those used in Exp. I, but the mediators for 
the M group were selected so that there was 
no obvious relationship between the medi- 
ators and the correct responses. In other 
respects (e.g., frequency of occurrence) an 
attempt was made to select mediators which 
were comparable to those employed in Exp. 
I. For example, the mediators for the stim- 
ulus-response combinations of t—chair, K— 
queen, and M—lamb were town, kiss, and 
mile, respectively. For List 2 the stimuli 
and the mediators presented to the M group 
were identical to those used in Exp. I, but 
the responses were changed so that there 
was unlikely to be a preestablished associa- 
tion between the mediators and the re- 
sponses. For example, the mediators for 
the PA pairs t—doctor, kK—sour, and M— 
afraid were table, king, and mutton, respec- 
tively. Three random orders of each list 
were employed in order to minimize serial- 
learning effects. 


Results and Discussion 


Of particular interest is whether pro- 
viding Ss with a mediator that does 
not have a preestablished connection 
with the correct response facilitates 
performance. For List 1 (e.g, K— 
queen), Ss provided with mediators 
(e.g., kiss) had a mean recall of 2.56, 
5.33, 7.44, 8.56, 10.44, and 11.39 for 
Trials 1-6, respectively. The Ss not 
provided with mediators had a mean 
recall of 6.33, 9.11, 10.33, 11.83, 12.33, 
and 13.50 for Trials 1-6, respectively. 
An analysis of variance on these scores 
indicated that the superiority of the 
C group to the M group was signif- 
icant, F (1, 34) = 12.22, p< .001. 
The interaction between trials and con- 


ditions was significant, F (5, 170) = 
2.77, p < 05, when all six trials were 
considered. When possible ceiling ef- 
fects were minimized by considering 
only the first three trials, the inter- 
action between trials and conditions 
was not significant, F (2, 68) = 1.33, 
p> .05. The mean number correct on 
List 2 for the M group was 1.33, 
3.94, 6.11, 7.83, 9.72, and 10.78 for 
Trials 1-6, respectively. The mean 
number correct on List 2 for the C 
group was 2.11, 6.44, 8.11, 8.94, 10.00, 
and 10.67 for Trials 1-6, respectively, 
Although the C group was slightly su- 
perior to the M group, the difference 
between the two groups was not sig- 
nificant, F (1, 34) = 1.04, p> .05; 
nor was the interaction between condi- 
tions and trials, F (5, 170) = 1.78, 
p> 05. 

The results suggest that a functional 
stimulus or motivational interpretation 
of Exp. I is inappropriate. It is likely 
that the superior performance of the 
M group of Exp. I depended on a pre- 
established connection between the 
mediators and the responses, since 
mediators not having this connection 
(Exp. II) did not facilitate perform- 
ance, It is also unlikely that giving in- 
structions to the M group of Exp. I 
account for the superior performance 
of this group, since similar instructions 
to the M group of Exp. II did not 
result in facilitated performance. 
Thus, the results of Exp. I and II of- 
fer convincing evidence that the pres- 
ence of a mediator can facilitate per- 
formance, and this facilitation depends 
on a preestablished association between 
the mediator and the response. 


ExpertmMent IIT 


Since it appears quite likely, under 
the conditions of Exp. I, that Ss pro- 
vided with relevant mediators used 
these mediators to link the PA pairs, it 
seemed appropriate to consider how 
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temporal factors influenced the per- 
formance of an M group relative to a 
C group. If mediation takes additional 
time, it follows that the superiority of 
Ss provided with mediators would be 
greater with a longer anticipation inter- 
val. Yet, the anticipation interval may 
not be the only temporal interval in- 
fluencing mediational processes. Schulz 
and Lovelace (1964) suggest that it 
might be useful to postulate that the 
discovery and utilization of mediators 
are separate factors. It is possible that 
the amount of time for discovering 
mediators (study time) may facilitate 
learning only if Ss are given sufficient 
time to use these mediators. A pro- 
cedure similar to the one employed in 
Exp. I was used in Exp. III in another 
investigation of the relationship be- 
tween temporal and mediational fac- 
tors. The design was a 2X3X3 
factorial with mediation (providing or 
not providing S with a mediator), rate 
of presentation on the study trial (1, 2, 
or 4 sec.), and rate of presentation on 
the test trial (1, 2, or 4 sec.) as the 
three variables. 


Method 


A total of 270 students from undergradu- 
ate psychology courses at Michigan State 
University was randomly assigned to the 
18 conditions by preparing 15 random blocks 
of the 18 conditions and assigning Ss to a 
condition in the order of their appearance in 
the laboratory. The list presented was the 
same as in Exp, I. The instructions 
were slightly modified in that Ss were told 
(appropriately) that their test-trial rate of 
presentation would be a little faster, slower, 
or thé same rate as their study-trial rate of 
presentation. There were three random 
orders of the 16 PA pairs to minimize serial- 
learning effects. The Ss were given six 
alternating study and test trials or were 
taken to a criterion of one perfect test trial, 
whichever came first. Following the last 
trial, Ss were presented with a list of the 
16 PA pairs and were instructed to write 
down any mediator or mnemonic that they 
used in learning the pairs. 


Results 


The mean number of correct re- 
sponses for the six test trials for the 
18 conditions of Exp. III is presented 
in Fig. 1. Subjects reaching criterion 
prior to the sixth test trial were given 
a score of 16 correct on each of the re- 
maining test trials, As can be seen in 
Fig. 1, the groups provided with medi- 
ators were consistently superior to the 
groups not given mediators, clearly 
supporting the results of Exp. I. To 
determine whether the variables pro- 
duced significant effects, a trend analy- 
sis was performed on the number cor- 
rect for the first three test trials. The 
first three trials were selected in an 
effort to minimize ceiling effects. 

Study time, F (2, 252) = 54.39, 
test time, F (2, 252) = 68.70, and 


MEAN NUMBER CORRECT 


Fic. 1. Mean number correct over six 
trials as a function of mediation, study 
time (1, 2, 4 sec.), and test time (1, 2, 
4 sec.). 
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mediation, F (1, 252) = 78.82, were 
all significant, p< .001, sources of 
variance. The failure to find a signif- 
icant interaction between either study 
time and mediation, F (2, 252) = 1.33, 
p> .05, or test time and mediation 
(F < 1) is consistent with other recent 
findings (eg., Richardson, 1967; 
Richardson & Brown, 1966 ; Spear etal., 
1966). The Study Time Xx Test Time 
Xx Mediation interaction, F (4, 252) 
= 4.10, p < .01, seems to be account- 
able for by the fact that the difference 
between the M and C groups with a 
l-sec. study time was greatest with a 
4-sec. test time. The difference be- 
tween the M and C groups at the 2-sec. 
study time was greatest at the 1-sec. 
test time; the difference between the 
M and C groups at the 4-sec. study 
time was greatest at the 2-sec. test 
time. Thus, for the present materials, 
the difference between the M and C 
groups appears to depend on particular 
combinations of study time and test 
time. Although Richardson and 
Brown failed to obtain a significant 
interaction of these three variables a 
comparison with the present study may 
not be meaningful due to many differ- 
ences in materials and procedures. 
The mean number of mediators that 
Ss in each of the 18 conditions reported 
following the last trial is presented in 
Fig. 2. Essentially all Ss provided 
with mediators during the study trial 
reported using a mediator to learn 
each of the 16 PA pairs. In almost 
all cases the mediator provided to S 
was the mediator S reported using to 
connect the pair. If the discovery of 
mediators by control Ss not provided 
with mediators takes place during the 
study trial, and the test trial interval 
is spent in attempting to use the medi- 
ators, then it follows that more 
mediators should be reported for the 


MEAN NUMBER MEDIATORS REPORTED 


STUDY TIME 


Fic. 2. Mean number. of mediators re- 
ported as a function of mediation, study 
time (1, 2, 4 sec.), and test time (1, 2, 4 
sec.). : 


control groups given a longer study 
time. No increase in the number of 
reported mediators should result from 
an increase in the length of the test 
interval. As Fig. 2 indicates, control 
Ss given more study time reported 
significantly more mediators, F (2 
126) = 13.03, p < .001. The test-time 
variable and the interaction of Test 
Time X Study Time were not signif- 
icant. A mean Spearman rho cor- 
relation of .47 was obtained between 
the number of mediators reported by 
Ss in all of the control groups and the 
number of correct responses each S 
made. The correlations ranged from 
.12 for the control group having a 4- 
sec. study and 2-sec. test time to a 
correlation of .88 for the control group 
having a 4-sec. study time and a 1-sec. 
test time. There was no apparent re- 
lationship between the degree of the 
correlation and the amount of study 
and/or test time. 


Discussion 


The results of the three experiments 
strongly indicate that providing Ss with 
appropriate mediators can markedly facil- 
jtate their learning of letter-word paired 
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associates. Moreover, it appears that a 
functional stimulus or motivational inter- 
pretation of this finding is inappropriate, 
since Ss provided with comparable medi- 
ators not having an obvious connection 
with the responses generally performed 
worse than control Ss not given medi- 
ators, Since there was a clear superiority 
for Ss provided with effective mediators, 
this procedure seemed a particularly fea- 
sible one for considering how temporal 
factors influence mediational processes. 
Therefore, the present failure to obtain 
an interaction between mediation and 
either study or test time strongly indicates 
that mediation processes are not influ- 
enced by temporal factors. 

Katz (1963) demonstrated that a medi- 
ated response can take additional time, 
but his materials and procedure differed 
markedly from those used in the present 
study. He made time comparisons for 
responses which were mediated by 0, 1, 
or 2 mediators. However, the strength 
of each connection, whether from a stim- 
ulus to a mediator or from a stimulus to 
the correct response, was approximately 
the same. In the present study, the con- 
nections between the stimuli and medi- 
ators and mediators and responses were 
considerably stronger, prior to the first 
study trial, than the direct stimulus-re- 
sponse connections. Since there is some 
evidence (e.g., Peterson, 1965) that re- 
sponse latencies decrease as the strength 
of the stimulus-response connection in- 
creases, two responses involving strong 
associations (ie, a mediated response) 
may not require any more time than one 
response involving a weak association 
(i.e, a direct response). 
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STIMULUS CONTROL OF APPROACH BEHAVIOR? 


JOHN W. DONAHOE, VINCENT G. SCHULTE, an» ALAN E. MOULTON 


University of Kentucky 


Using 128 rats Ss in a series of experiments employing bar-pressing 
tasks, the following sources of stimulus control of the shape of 
approach gradients were identified: (a) external stimuli controlled 
instrumental chains in which response speed was uncorrelated with 
within-chain delay of reinforcement (Exp. I), (b) internal, response- 
produced stimuli controlled chains in which response speed was cor- 
related with within-chain delay (Exp. II and III), and (c) the differ- 
ences in the origins of stimulus control noted in a and b were not a 
function of associated differences in trial duration (Exp. IV). The 
emotional consequences of nonreinforced responding within the inter- 
trial interval were shown to be the origin of differences in the heights 
of approach gradients (Exp. I and II) as such differences were 
eliminated when intertrial responding was prevented (Exp, III) or 


extended training was given (Exp. IV). 


The present experiments investi- 
gated the effects of external and inter- 
nal (response-produced) stimulation 
on performance within approach chains 
which differed in the correlation of re- 
sponse speed with within-chain delay 
of reinforcement. Fixed-ratio (FR) 
schedules of reinforcement and run- 
way behavior (Keller & Schoenfeld, 
1950, pp. 93-94) are prototypic of 
approach chains in which response 
speed is correlated with within-chain 
delay in that delay of reinforcement is 
a function of the rate of responding. 
Fixed-interval (FI) schedules of rein- 
forcement are prototypic of approach 
chains in which response speed is un- 
correlated with within-chain delay in 
that delay of reinforcement is inde- 
pendent of the rate of responding 
(Morse, 1966, p. 62). Thus, response- 
produced stimuli are necessarily cor- 
related with the duration of within- 
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chain delay in FR approach chains, 
but are uncorrelated with within-chain 
delay in FI chains. 

Table 1 summarizes the salient fea- 
tures of the present experiments. The 
rows and columns of the table define 
the relationship of external and inter- 
nal stimulation to within-chain delay 
of reinforcement while the cells contain 
appropriate experimental procedures. 
Consistent with the principle of stim- 
ulus generalization, it was assumed that 
response strength is directly a function 
of the similarity of prevailing stimulus 
conditions to those external and inter- 
nal stimulus conditions present during 
reinforcement. With the relationship 
of internal stimulation to within-chain 
delay held constant, external stimulus 
control of performance within an ap- 
proach chain is identified with differ- 
ences in the shape of the approach 
gradient as a function of the correlation 
of external stimulation with delay in 
reinforcement. 


EXPERIMENT I 


The first study investigated the 
effects of correlated and uncorrelated 
external stimulation on performance 
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TABLE 1 
EXPERIMENTAL PROCEDURES FOR THE STUDY OF THE EXTERNAL AND INTERNAL 


(RESPONSE-PRODUCED) STIMULUS 


CONTROL OF APPROACH BEHAVIOR 


Relationship of 


Relationship of 


Internal Stimulation (Response-Produced) 
to Delay of Reinforcement 


External Stimulation 
to Delay of 
Reinforcement 


Correlated 


Uncorrelated 


Response Schedule 


Stimulus Schedule 


Response Schedule | Stimulus Schedule 


Correlated Fixed Monotonic change Fixed Monotonic change 
ratio with responding interval with time 

Uncorrelated Fixed Uncorrelated Fixed Uncorrelated 
ratio with responding interval with time 


Note.—The relationship of external and internal stimulation to within-chain delay of reinforcement is repre- 
sented as either correlated or uncorrelated for convenience only. The various relationships may, of course, assume 
any value between zero (no correlation) and unity (perfect correlation) inclusive through the use of variable response 


and stimulus schedules. 


within an FI approach chain in which 
response rate was not correlated with 
delay of reinforcement. These are the 
two conditions described in the second 
major column of Table 1. 


Method 


Subjects—Sixteen male albino rats, 
Sprague-Dawley strain, 90 days old at the 
beginning of the experiment, served as Ss. 

Apparatus—Two test chambers of identi- 
cal construction were used. Each chamber 
was enclosed in a sound-attenuating hull with 
a blower to provide ventilation and a mask- 
ing noise. A response bar was mounted in 
one chamber wall 14 in. above the grid floor 
with a foodcup located 1 in. to the right 
of the bar. A pilot lamp with a 4-in. white 
jewel was mounted 24 in. above the bar. 
This lamp served as the source of external 
stimulation and could be remotely varied in 
intensity from 3 to 20 ftc. in 10 equal loga- 
rithmic increments. The experimental events 
were controlled by electromechanical and 
solid-state switching and timing circuits lo- 
cated in an adjacent room. Responding was 
monitored by counters and cumulative re- 
corders. 

Procedure—All Ss were reduced to 85% 
of free-feeding weight over a 10-day period 
and were given 2 days of feeder training and 
2 days of continuous reinforcement (sixty 
45-mg. pellets per day) under the same ex- 
ternal stimulus condition (20 ftc.) which was 
to, prevail during reinforcement in subse- 
quent FI training. Following bar-press 
training, Ss were randomly subdivided into 
two groups of eight animals. Both groups 


were reinforced using an FI 50-sec. approach 
chain with a constant 90-sec. intertrial period 
in which the test chamber was dark and re- 
sponding was not reinforced. For the group 
with the correlated external stimulus (Group 
C), the light over the bar was illuminated 
at the onset of each trial at an intensity of 
3 ftc. Each 5 sec. thereafter, the intensity 
increased in 10 equal log units until a maxi- 
mum intensity of 20 ftc. was reached after 
45 sec. The light remained at this intensity 
until 5 sec. after the reinforced response 
had been made to ensure that the foed pellet 
was consumed in the presence of the 20-ftc. 
stimulus intensity. In the group with the 
uncorrelated external stimulus (Group U), 
the light remained at 20 ftc. throughout the 
trial and for 5 sec, after the reinforced 
response. 

Both Groups C and U were given 20 FI 
trials per day for 15 days with four Ss 
of each group being run in each test cham- 
ber. Responses were recorded in successive 
10-sec. fractions of each trial to secure five 
estimates of response strength within the FI 
approach chain. 


Results and Discussion 


The number of responses in succes- 
sive fractions of the FI trial was trans- 
formed into speed scores by dividing 
the number of responses in each of 
the five fractions by the amount of 
time (10 sec.) available per fraction. 

Gradient shape-—Figure 1 depicts 
the mean response speed in successive 
fractions of the FI chain averaged over 
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SPEED(RESPONSES/MIN.) 
t 
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FIXED-INTERVAL FRACTIONS 
Fıc. 1. Fixed-interval approach gradients 


as a function of the correlation of external 
stimulation with within-chain delay of re- 
inforcement. 


the last 3 days of training. Perform- 
ance may be considered stable as, at 
this stage of the training, the main 
effect of days and all interactions con- 
taining the days term were nonsignif- 
icant by analysis of variance methods, 
As is clear from an inspection of Fig. 1, 
the slope of the Group C approach 
gradient was steeper than that of 
Group U. This impression was con- 
firmed by a significant interaction be- 
tween stimulus conditions and FI frac- 
tions, F (4, 56) =7.22, p<.001. 
The greater steepness of the gradient 
obtained in Group C indicates that, 
with approach chains in which response 
speed is uncorrelated with within-chain 
delay, changes in response strength co- 
vary with the similarity of prevailing 
external stimulus conditions to the 
stimulus conditions present during re- 
inforcement. 

Gradient height—The approach 
gradients in Fig. 1 differed in height as 
well as in slope, F (1, 14) = 5.82, p < 
05. The greater average speed within 
the approach chain of Group C may 
may be due to either or both of the 
following two factors: (a) Inhibitory 
effects from nonreinforced responding 


within the chain may have generalized 
maximally within the FI trial in Group 
U because of the constancy of external 
stimulation but generalized less in 
Group C because of the changing ex- 
ternal stimulus. Thus, because of the 
possibility of greater net generalization 
of inhibition in Group U, responding 
was depressed below that of Group C. 
(b) The emotional consequences of 
nonreinforced, intertrial responding 
may have had differential effects on the 
heights of the FI approach chains with 
correlated and uncorrelated external 
stimuli, Experiment III examined the 
empirical question of whether the locus 
of differences in gradient height was 
nonreinforced intratrial or intertrial 
responding. 


EXPERIMENT IT? 


In Exp. II, approach behavior was 
studied under the two conditions de- 
scribed in the first major column of 
Table 1. Rate of responding was cor- 
related with delay in reinforcement 
through the use of an FR chain and 
external stimulation was either cor- 
related or uncorrelated with delay in 
reinforcement. 


Method 


Subjects and apparatus—Sixteen Ss of the 
same description as Exp, I were used in this 
study. The same test chambers were also 
employed. 

Procedure——Following food deprivation, 
feeder training, and bar-press training pro- 
cedures used in Exp. I, Ss were divided 
into two groups of eight animals. Both | 
groups were placed on an FR-20 chain. For 
Group C (correlated external stimulus), the 
light over the bar increased in intensity 
from 3 to 20 ftc. every two responses. Fol- 
lowing reinforcement, the intensity remained 
at 20 ftc. for 5 sec. In Group U (uncor- 
related external stimulus), the intensity re- 


2This experiment is based on a thesis 
conducted by Alan E. Moulton in partial 
fulfillment of requirements for the MS de- 
gree at the University of Kentucky. 
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mained constant at 20 ftc. during the trial 
and for 5 sec. after reinforcement. Both 
Groups C and U were given 30 FR trials per 
day for 10 days. Trials were separated by 
a 90-sec. variable-intertrial period with a 
range of 60-120 sec. during which the test 
chamber was dark and responding was not 
reinforced. The dependent variable was the 
elapsed time (to the nearest .1 sec.) in four, 
successive, equal fractions of each FR ap- 
proach trial. 


Results and Discussion 


To secure speed scores, the number 
of responses in each fourth of the FR 
trial (five responses) was divided by 
the time required to complete that 
fraction of the approach chain. 

Gradient shape-—Figure 2 contains 
the average approach gradients ob- 
tained from Groups C and U during 
the last 3 days of training. Unlike 
Exp. I, the shapes of the approach 
gradients did not differ as a function 
of the external stimulus conditions. 
This was confirmed by a nonsignificant 
interaction of Groups X FR Fractions, 
F (3, 42) = 1.64, p>.20. Thus, 
under conditions in which internal, 
response-produced stimulation was cor- 
related with delay in reinforcement, 
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Fıc. 2. Fixed-ratio approach gradients as 
a function of the correlation of external 
stimulation with within-chain delay of 
reinforcement, 


environmental stimulation exerted no 
measurable control over performance 
within an approach chain. 
approach gradients did not have a 
slope of zero as shown by a significant 
difference among FR fractions, F (3, 
42) = 25.34, p < .001, the assumption 
is made here that performance was 
under the control of internal cues. 
This assumption is consistent with the 
stimulus generalization literature which 
indicates greater external stimulus 
control with variable-interval schedules 
than with variable-ratio schedules of 
reinforcement (Thomas & Switalski, 
1966). A more direct test of the 
assumption is made in Exp. III. 
Gradient height.—As in Exp. I, the 
approach gradient for Group C was 
higher than the gradient for Group U 
(see Fig. 2), F (1, 14) = 20.18, p < 
.001. Again, either differential gen- 
eralization of intratrial or intertrial ex- 
tinction effects may be the locus of the 
disparity of gradient heights. 


Experiment III °? 


Experiment III represents a contin- 
uation of concern with the stimulus 
variables controlling performance 
within an approach chain in which 
internal, response-produced stimuli are 
correlated with delay in reinforcement. 
Unlike Exp. II, however, the present 
study orthogonally manipulated both 
the external and internal stimulus con- 
ditions within the approach chain. 

As in Exp. II, an FR approach chain 
was employed and the external stim- 
ulus was either correlated or uncorre- 
lated with responding. Two correlated 
external stimulus conditions were used, 
a stimulus whose intensity increased 
throughout the approach chain (Group 


3 This experiment is based on a thesis 
conducted by Vincent G. Schulte in partial 
fulfillment of requirements for the MS de- 
gree at the University of Kentucky, 
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I) and a stimulus whose intensity de- 
creased (Group D). Group D pro- 
vided a control for the effects of stim- 
ulus intensity independent of the cor- 
relation of intensity with delay of re- 
inforcement. Two uncorrelated stim- 
ulus conditions were also used. For 
one group, the stimulus intensity re- 
mained constant as in previous experi- 
ments (Group C). For the second 
group, the stimulus intensity varied 
randomly within each approach chain 
(Group R). Group R provided a con- 
trol for the effects of response-conting- 
ent external stimulus change on ap- 
proach behavior and also permitted an 
evaluation of the hypothesis that differ- 
ences in response speed previously 
found between correlated and uncor- 
related external stimulus conditions re- 
sulted from greater net intratrial gen- 
eralization of extinction effects under 
constant stimulus conditions. If the 
hypothesis of differential intratrial gen- 
eralization of inhibition is valid, re- 
sponse speed in Group C should be less 
than in Group R. 

The internal, correlated stimulus 
conditions prevailing throughout the 
approach chain were manipulated by 
using two FR chains of different length 
(FR-20 and FR-40) within each of 
the four conditions of external stimula- 
tion. Increasing the length of the 
chain was designed to increase the dif- 
ference between the response-produced 
stimuli at the beginning and end of 
the trial. To the extent that FR ap- 
proach behavior is under the control 
of internal stimuli, the shape of the 
approach gradients should differ as a 
function of the length of chain con- 
sistent with the principle of stimulus 
generalization. To optimize internal 
control of behavior within the FR 
chain, the bar was retracted from the 
test chamber between trials. Thus, 
unlike Exp. I and II in which respond- 


ing was permitted during the intertrial 
period, each response reduced within- 
chain delay. Prevention of intertrial 
responding also permitted an evalua- 
tion of the hypothesis that the greater 
response speed found under correlated 
external stimulus conditions in Exp. I 
and II arose from nonreinforced inter- 
trial responses. 


Method 


Subjects and apparatus.—Sixty-four Ss 
of the same description as in Exp. I and II 
were used. Two new test chambers of iden- 
tical construction were employed in addition 
to the two test chambers used in Exp, I 
and II. 

Procedure—The 64 Ss were randomly 
assigned to one of the eight treatment com- 
binations generated by the four external 
stimulus conditions (I, D, C, and R) and 
two lengths (n) of fixed-ratio schedule 
(FR-20 and FR-40). In all groups with a 
changing external stimulus within each trial, 
the change in stimulus intensity occurred 
every n/10 responses. In Group I, the in- 
tensity increased from 3 to 20 ftc. in 10 
equal logarithmic increments; in Group D, 
the intensity decreased in the same manner 
from 20 to 3 ftc. In Group R, the intensity 
changed every n/10 responses such that all 
10 stimulus intensities were present within 
each FR trial but the change was random, 
Across all Ss in Group R, the average in- 
tensity within each fraction of the fixed- 
ratio schedule was equal as the intensities 
were selected according to a Latin-square 
principle. Each S in Group R was exposed 
to four of the 10 possible stimulus orders 
during each day of training. In Group C, 
the stimulus remained at the geometric mean 
of the 10 stimulus intensities, 7.8 ftc., 
throughout each trial. For all groups, 
therefore, the average intensity of the ex- 
ternal stimulus was identical. 

Following reduction to 80% of free-feed- 
ing weight, Ss received 1 day of feeder 
training and 4 days of continuous reinforce- 
ment bar-press training (60 pellets per day) 
Feeder training and bar-press training oc 
curred under the same external stimulus con- 
ditions in which reinforcement was to be 
delivered in subsequent FR training. (This 
was true of all but Group R. For half of 
these Ss, the first 2 days of bar-press train- 
ing were given under the most intense ex- 
ternal stimulus; the last 2 days of bar-press 
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training were given under the least intense 
external stimulus, For the remaining Ss 
in Group R, the stimulus order was re- 
versed.) 

All groups were given 12 days of FR 
training, each day consisting of 20 trials with 
a constant intertrial period of 60 sec. during 
which the bar was retracted. On the first 
FR training day, all Ss were given FR-20. 
Half of the Ss remained at FR-20 there- 
after. The other half of the Ss were run at 
FR-30 on the second day of training and at 
FR-40 on the third day and all days there- 
after. 

To differentiate starting speed (SS) from 
speed within the first fraction of the sched- 
ule, the latency of the first response of each 
trial was recorded. In addition, in order to 
obtain a more detailed analysis of speed 
within the approach gradient, the time was 
recorded in 10 successive fractions of each 
FR trial. 


Results and Discussion 


Performance within the approach 
chain was measured by speed scores, 
Starting speed was obtained by com- 
puting the reciprocal of response 
latency and multiplying by a constant 
to convert to units of responses per 
minute. Speed measures after the 
first response of the trial were obtained 
for the 10 fractions of the fixed-ratio 
schedule as in Exp. II. The speed 
measures were evaluated by an analy- 
sis of variance having two factors with 
independent levels (FR length and ex- 
ternal stimulus correlation), one nested 
factor (stimulus conditions within the 
correlated and uncorrelated external 
stimulus groups), and two factors with 
dependent levels (training days and 
FR fractions). 

Gradient shape-—Figure 3 depicts 
the effects of external stimulus corre- 
lation and FR length on the approach 
gradient during the last 3 days of train- 
ing. The effect of the external stim- 
ulus correlation on gradient shape was 
nonsignificant as measured by the in- 
teraction of stimulus correlation with 
FR fractions, F <1, and with FR 
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Fic. 3. Fixed-ratio approach gradients as 
a function of the correlation of external 
stimulation with within-chain delay of rein- 
forcement and chain length. 


Fractions X Ratio Length, F <1. 
Thus, the major finding on Exp. II 
was confirmed—performance within an 
approach chain in which response 
speed is correlated with delay of rein- 
forcement is not under measurable ex- 
ternal stimulus control. The posi- 
bility that the initiation of the approach 
chain was under external stimulus 
control was also examined. Again, 
there was no evidence of external stim- 
ulus control as the starting speed of 
the correlated and uncorrelated stim- 
ulus conditions did not differ, F (1, 
56) = 2.33, p < .10. As additional 
evidence of the lack of external stim- 
ulus control, no differences in the 
shape of the approach gradients were 
found between the I and D subgroups 
of the correlated condition or between 
the R and C subgroups of the uncor- 
related condition, F <1 in each in- 
stance. 

The effects of internal stimulation 
are shown by a comparison of the 
shapes of the gradients obtained with 
the two lengths of FR chains. As 
depicted in Fig. 3, response speed in 
the earlier fractions of FR-40 was 
less than the response speed of FR-20 
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but rapidly increased and surpassed the 
FR-20 speed in later ratio fractions. 
The differences in gradient shape as a 
function of ratio length were statis- 
tically reliable, F (9, 504) = 5.58, p 
<.001. Also, starting speed was 
greater with the FR-20 schedule than 
with the FR-40 schedule, F (1, 56) 
= 6.65, p < .001. Thus, the assump- 
tion made in Exp. II that approach 
chains in which response speed is cor- 
related with delay of reinforcement are 
under the control of internal stimuli 
associated with responding was sup- 
ported. 

Gradient height—As shown in Fig. 
3, the prevention of intertrial respond- 
ing by retraction of the bar eliminated 
the differences in gradient height pre- 
viously observed between the corre- 
lated and uncorrelated external stimu- 
lus conditions. A statistical evalua- 
tion of differences in gradient height 
as a function of the external stimulus 
correlation was nonsignificant, F < 1. 
This finding is consistent with the in- 
terpretation that the locus of earlier 
differences in gradient height was non- 
reinforced intertrial responding. No 
support was obtained for the hypothe- 
sis that differences in gradient height 
were due to differences in net gener- 
alized intratrial extinction effects as 
the C and R subgroups within the un- 
correlated stimulus condition did not 
differ in response speed, F (1, 56) = 
1.56, p > .20. 


EXPERIMENT IV 


The final experiment of the series 
was designed to evaluate an alternative 
source of differences in external stimu- 
lus control other than variations in the 
correlation of response speed with 
within-chain delay of reinforcement. 
The FI trials in Exp. I were of longer 
duration (50 sec.) than the FR trials 
in Exp. II (approximately 10 sec.). 


Thus, the type of approach chain and, 
hence, differences in the origin of stim- 
ulus control were confounded with the 
duration of the trial, The purpose of 
the present experiment was to examine 
the effects of external stimulation on 
FR and FI approach behavior with 
trial duration equated. 


Method 


Subjects and apparatus—Thirty-two Ss at 
80% of free-feeding weight and of the same 
description as preceding experiments were 
used. The four test chambers described in 
Exp. III were employed. 

Procedure—The Ss were assigned at 
random to one of four groups of eight ani- 
mals—FR with a correlated external stimu- 
lus (FR-C), FR with an uncorrelated ex- 
ternal stimulus (FR-U), FI with a cor- 
related external stimulus (FI-C), and FI 
with an uncorrelated external stimulus 
(FI-U). Following 2 days of feeder train- 
ing and 2 days of continuous reinforce- 
ment with conditions strictly analogous to 
previous experiments, all Ss were run for 
24 days (20 trials per day) under the ap- 
propriate combination of approach chain and 
external stimulus correlation. The inter- 
trial period, during which all responses were 
nonreinforced, had a mean value of 75 sec. 
with a range of 60-90-sec. Training for 
Groups FI-C and FI-U began immediately 
with the final schedule of FI-20 sec. Train- 
ing for Groups FR-C and FR-U gradually 
approached the final schedule of FR-40 in 
order to prevent the extinction of bar press- 
ing occasioned by the precipitous initiation of 
a long ratio chain. The two FR groups 
were given 1 day of FR-10, 1 day of FR-20, 
2 days of FR-30 and, then, when a number 
of animals began to extinguish, 3 additional 
days of FR-20 followed by 7 days of FR-30. 
For the last 10 training days (Days 15 
through 24), all FR Ss were run at the final 
schedule of FR-40. For groups with 
a correlated external stimulus, the intensity 
of the stimulus light increased in 10 equal 
logarithmic steps every four responses in 
the FR-C group and every 2 sec. in the 
FI-C group. For groups with an uncor- 
related external stimulus (FR-U and FI-U), 
the stimulus light remained at an intensity of 
78 fte. Thus, the geometric mean of the 
stimulus intensity was identical for all 
groups. The stimulus light remained at 
the terminal intensity for 5-sec. after the 
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last response of the trial to ensure that rein- 
forcement occurred during the appropriate 
stimulus intensity. During the intertrial 
period, the test chamber was completely 
dark, 

The following dependent variables were 
recorded: (a) the latency of the first re- 
sponse of the trial, (b) the number of 
responses within 10 successive fractions of 
the FI trials, (c) the elapsed time within 
10 successive fractions of the FR trials, and 
(d) the number of responses during the 
intertrial period. 


Results and Discussion 


As in the preceding experiments, all 
figures and statistical analyses were 
based on transformations of the de- 
pendent variables to speed scores (re- 
sponses/min). 

Gradient shape.—Bar-pressing speed 
within 10 successive fractions of the 
FR chain averaged over the last 3 
training days is shown in Fig. 4. The 
mean FR trial duration was 26.4 sec. 
and was, therefore, approximately 
equal to the duration of the FI 20-sec. 
approach chain. The approach gradi- 
ents for both the FR-C and FR-U 
groups were highly similar and were 
adequately described by a single trend 
line having statistically reliable linear 
and quadratic components, F (1, 14) 
= 22.92 and 20.97, respectively, p < 
.001. The omnibus interaction of stim- 
ulus correlation with FR fractions was 
not reliable at any stage of practice, F 
<1, and starting speeds were also 
unaffected by the external stimulus 
correlation, F (1, 14) = 1.67, p > .05. 

By contrast, the shapes of the FI 
approach gradients were clearly af- 
fected by the external stimulus correla- 
tion. As shown in Fig. 5, the FI-C 
gradient was positively accelerated 
whereas the FI-U gradient was nega- 
tively accelerated. The two trend 
lines in Fig. 5 resulted from an ortho- 
gonal polynomial analysis which 
yielded a significant quadratic com- 
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a function of the external stimulus correla- 
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ponent of the Stimulus Correlation X 
FI Fractions interaction, F (1, 14) = 
6.96, p < .025. Thus, with FR and 
FI trial duration equated, earlier work 
was confirmed. The shapes of the FI 
approach chains were controlled by 
the correlation between external stim- 
ulation and within-chain delay of rein- 
forcement while the shapes of the FR 
approach gradients were not. 
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Gradient height—In addition to the 
differential effects of stimulus correla- 
tion on gradient shape as a function 
of the type of approach chain, previous 
experiments found approach speed to 
be greater with a correlated than with 
an uncorrelated external stimulus when 
intertrial responding was permitted. 
Approach speed in the present experi- 
ment did not differ during the last 3 
days of training for either the FR or 
FI chains, F< 1 and F (1, 14) = 
1.64, p < .20, respectively. Previously 
obtained differences in gradient height 
were attributed to the emotional conse- 
quences of nonreinforced intertrial re- 
sponding; however, Terrace (1966) 
has shown that such emotional effects 
disappear with extended discrimina- 
tion training as is suggested by Am- 
sel’s (1962, p. 314) account of the frus- 
tration hypothesis. Since 24 days of 
training were given in the present ex- 
periment and only 10 (FI in Exp. I) 
or 15 (FR in Exp. II) days in earlier 
work, the acquisition data were exam- 
ined for the presence of greater ap- 
proach speeds early in training with a 
correlated stimulus. (Only the acqui- 
sition of FI approach behavior could 
be meaningfully evaluated because of 
the procedural changes necessary to 
achieve the final FR-40 trial length in 
the FR-C and FR-U groups.) Con- 
sistent with the work on extended dis- 
crimination training, the FI-C group 
responded at a higher speed than the 
FR-U group early in training (Days 
1-6). The decline in gradient-height 
differences was reliable as shown by 
the significant interaction of Training 
Days Xx Stimulus Correlation, F (23, 
322) = 3.16, p < .001. The presence 
of response-strength differences in the 
Present study and in discrimination 
learning with errors, together with the 
absence of such differences in extended 
training and in discrimination learning 


without errors (Terrace, 1963), 
strengthens the hypothesis that these 
phenomena are the result of emotional 
factors whose origin is nonreinforced 
responding. 


GENERAL Discussion 


The implications of the preceding ex- 
periments for the stimulus control of per- 
formance within an approach chain seem 
quite clear: (a) External stimulation 
controls responding if and only if re- 
sponse speed is uncorrelated with within- 
chain delay of reinforcement. This state- 
ment is consistent with the presence of 
differences in the slopes of the FI ap- 
proach gradients in Exp, I and IV and 
with the absence of such differences in 
the FR approach gradients in Exp, II, 
III, and IV. (b) Internal stimulation 
of a response-produced origin controls 
responding if response speed is correlated 
with within-chain delay of reinforcement. 
This statement is consistent with the ob- 
tained differences in the slopes of the FR 
approach gradients associated with FR 
chains of unequal length in Exp. III. 

The finding that external stimulus con- 
trol is dependent on the correlation of 
response speed with within-chain delay 
places boundary conditions on theoret- 
ical efforts attempting to specify the 
mechanisms through which external stim- 
ulus effects are mediated within approach 
chains. For example, one of the stimulus- 
dependent variables affecting the slope of 
the approach gradient in Spence’s (1956, 
pp. 150-151) theory of instrumental 
chaining is incentive motivation (K). 
According to the theory, as the similarity 
of prevailing stimuli to stimuli present 
during reinforcement increases, K is as- 
sumed to increase and, because of the 
multiplicative relationship between K and 
the learning variable (H), response speed 
should increase disproportionally for ani- 
mals given a correlated external stimulus 
as compared to animals given a constant 
external stimulus. The quantitative 
implications of the theory are confirmed 
to a rough approximation by the diverg- 
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ence of the FI approach gradients in 
Exp. I. However, with FR chains in 
which bar-pressing speed and within- 
chain delay are correlated, the slopes of 
the approach gradients were invariant 
over transformations of the external stim- 
ulus. Consistent with this result, other 
experiments employing running responses 
in which within-chain delay is correlated 
with speed, have also failed to alter per- 
formance within an approach chain by 
the manipulation of runway-goal box sim- 
ilarity (DiLollo, 1964; Gonzales & Dia- 
mond, 1960; Hughes, David, & Grice, 
1960). Thus, while a number of theorists 
have alluded to the potential importance 
of response-produced stimuli (Hull, 1952, 
p. 158; Spence, 1956, p. 135), the present 
experiments indicate that a heavy ex- 
planatory burden must be borne by stimuli 
of internal origin in future theoretical 
accounts of approach chains, especially 
those chains in which response speed is 
correlated with within-chain delay. 
The last major finding was that the 
heights of approach gradients with cor- 
related external stimuli were increased 
relative to gradients with uncorrelated ex- 
ternal stimuli if intertrial responding was 
permitted (Exp. I, II, and the initial 
stages of training in Exp. IV) but not if 
intertrial responding was prevented (Exp. 
III). The origin of this selective increase 
in gradient height was, therefore, non- 
reinforced intertrial responses. Such a 
result was shown not to be a function of 
the differential generalization of inhibi- 
tion of intratrial origin and is clearly con- 
trary to expectations based on the gen- 
eralization of inhibition from intertrial 
stimuli to intratrial stimuli, Rather that 
inhibitory mechanisms, the selective in- 
crease in approach speed seems most akin 
to previous reports in the stimulus gen- 
eralization literature of a greater response 
strength to the positive stimulus after 
discrimination training than after single 
stimulus training (Hanson, 1959) and, 
thereby, suggests facilitory mechanisms 
associated with the emotional conse- 


quences of nonreinforcement 
1962; Terrace, 1966). 
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120 Ss at each of 6 developmental levels performed an instrumental 
lever-pulling task. 6 reward groups (0%, 10%, 30%, 50%, 70%, 
100%) were formed within each age level and measures of response 
speed were taken. The typical finding of faster response speeds for 
partially as compared with continuously rewarded groups was not 
evident across all ages. Furthermore, some partial reward schedules 
other than 50% also lead to more vigorous responding as compared 
with 100% reward. In terms of frustration theory, it was suggested 
that desire to win and problem-solving strategies employed at different 
age levels, as well as expectancy for reward, may all play important 


roles in determining reaction to nonreward. 


For almost a decade, Amsel’s 
(1958) theory of frustrative nonreward 
has stimulated investigations of in- 
strumental performance of infrahuman 
Ss under conditions of noncontinuous 
reward, Recently, studies with human 
Ss have employed the terminology of 
frustrative nonreward theory and have 
made use of performance tasks as well 
as independent and dependent variables 
analogous to those utilized in studies 
with infrahumans. A summary of this 
work is provided in a recent review 
(Ryan & Watson, 1968). 

Investigations with human Ss have 
focused upon (a) the frustration ef- 
fect—faster performance of a second 
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response following nonreward as com- 
pared with reward for a first response 
(Penney, 1960; Ryan, 1965; Watson 
& Ryan, 1966), and. (b) resistance to 
extinction. Longstreth (1960, 1966a, 
1966b, 1966c) has demonstrated that 
frustration as compared with secondary 
reinforcement theory can more success- 
fully account for extinction perform- 
ance in instrumental tasks with human 
Ss. The present paper is concerned 
with response acquisition. 

A summary of research (Spence, 
1960, Chap. 6) pertaining to discrete- 
trial acquisition in instrumental condi- 
tioning with rats indicates that starting 
and running speeds for a partially re- 
warded group are initially slower, but 
eventually faster than those of a con- 
tinuously rewarded group. In com- 
parable investigations with preschool 
and kindergarten children, the perform- 
ance task has involved pulling a lever 
in order to win candy or marbles. In 
two studies (Bruning, 1964; Ryan & 
Moffitt, 1966), half of the Ss were re- 
warded on a random 50% schedule 
while the remainder received 100% re- 
ward. In both investigations lever- 
movement speeds were significantly 
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faster for partially as compared with 
continuously rewarded Ss. A similar 
finding has been obtained by Pederson 
(1967) where the 50% reward group 
received valueless marbles—ones which 
could not be traded for a toy, rather 
than no marble on nonrewarded trials. 
Semler and Pederson (1966) have 
shown the same effect entirely within- 
Ss. First-grade children were admin- 
istered 20 trials to each of two stimulus 
lights. Movement speed was faster to 
the light associated with 50% reward 
than to the light that was always re- 
warded. These authors have all in- 
terpreted their findings in terms of a 
nonreward-produced increment in 
drive level. Two investigations 
(Ryan, 1966; Ryan & Voorhoeve, 
1966) have suggested further than 
asymptotic response speed bears an 
inverted U-shaped relationship to re- 
ward schedule, at least for young 
children. 

The present investigation sought to 
determine whether this type of relation- 
ship would also be obtained when data 
on various reward schedules were 
gathered across a wide developmental 
scale. An additional interest was 
whether or not schedule of reward led 
to distinguishable effects between the 
two sexes. 


METHOD 


Subjects—Data were collected from 600 
Ss, 120 at each of the following chrono- 
logical age levels: Grade 2 (X =7.5 yr.), 
Grade 4 (X=98 yr.), Grade 6 (X=114 
yr.), college freshmen (X¥=198 yr.), and 
old age (X=75 yr.). There were 60 males 
and 60 females at each of the levels except 
for old age which had 56 males and 64 fe- 
males. For comparative purposes, data col- 
lected earlier (Ryan & Voorhoeve, 1966) 
from 120 kindergarten (X = 5.45 yr.) Ss 
will also be presented. 

Apparatus——The apparatus consisted of a 
lever protruding from the face of a 12 X 20 
in. box. A hand pattern was located at the 


bottom right-hand corner in arder to provide 
a common starting point for all Ss. The 
signal to pull the lever was the onset of a 
š-in. wide light completely surrounding the 
15-in. excursion channel. A goal box was 
centered at the bottom front of the apparatus 
and was covered with clear plastic so that 
the accumulated marbles could be seen but 
not handled. Prior to each response a toggle 
switch was set that controlled whether or 
not S would be rewarded. 

Each trial was initiated by a start button 
which illuminated the stimulus light and 
started a .0l-sec, timer. As soon as the 
lever was moved, a microswitch opened 
which terminated the first timer and started 
a second. The timers recorded starting time 
(time between stimulus onset and initial 
movement of the lever) and movement time 
(duration of the 15-in. lever pull). The 
controls were positioned about 2 ft. behind S. 

Procedure—The procedure was the same 
across all age levels except where noted. 
The Ss were brought individually to an ex- 
perimental room and shown the lever box. 
The children were informed that this was a 
game which they were to try to win. For 
adult Ss, the word task was substituted for 
game. The Ss were informed that the game 
or task could be won by accumulating many 
marbles in the goal box. A departure from 
the procedure of earlier studies was that the 
marbles were not later traded for a toy or 
some other incentive. The old age Ss, how- 
ever, were given their choice of writing 
paper, kleenex, or cigars after performing 
the task. 

Within each age level, 10 males and 10 
females were randomly assigned to one of 
six reinforcement groups (0%, 10%, 30%, 
50%, 70%, 100%). The one exception to 
this was that the 100% group for old age 
Ss consisted of 8 males and 12 females. The 
reinforcement schedule operated within each 
10 trials; the particular trial or trials on 
which reward was to be delivered were 
randomly selected within each block. 

After two rewarded practice trials, all Ss 
were given 40 test trials. Following test 
trials, all Ss in the 0% group were given 10 
rewarded trials in order to relieve any ap- 
prehension. The intertrial interval was 
approximately 10 sec. and a “Ready” signal 
preceded each presentation of the stimulus 
light. 

All time measures were converted to recip- 
rocals (1/t in seconds) and will be referred 
to as speeds. The data were analyzed sep- 
arately for each age level because of marked 
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differences in variability across ages. The 
design at each level was exactly the same. 
For between-Ss, the main effects were re- 
ward schedule and sex; for within-Ss, the 
main effect was trial blocks (4 blocks of 10 
trials per block). 


RESULTS 


Only the movement-speed analyses 
will be reported. The findings for 
starting speed are similar but the differ- 
ences are not as large resulting in 
slightly fewer significant F ratios. 

Sex and age—The findings for the 
sex variable can be dealt with sum- 
marily. At every age level, except 
kindergarten, movement speed was sig- 
nificantly faster for males as compared 
with females (see Table 1). In ad- 
dition, for old age Ss there was a sig- 
nificant interaction between sex and 
trial blocks, F (3, 324) = 2.97, p< 
.05. This was due to the fact that the 
difference in speed between males and 
females was not as large on the first as 
on the last three trial blocks. The 
main interest in employing the sex 
variable was in order to assess possible 
interactions between reward schedule 
and sex; however, the findings were 
nonsignificant at each of the six age 
levels and consequently, the sex vari- 
able will not be discussed further. 
Table 1 also shows that mean move- 
ment speed increases in an orderly 
manner from kindergarten to the 
freshman level; old age Ss attain a 


speed level intermediate to kinder- 
garten and Grade 2 Ss. 

Reward schedule and trial blocks— 
For both old age and freshmen Ss, 
speeds increased significantly over 
training, F (3, 324) = 4.98, p < 01; 
F (3, 324) = 14.48, p < .001, respec- 
tively, but reward schedule entered into 
no significant relationships. 

Figures 1, 2, and 3 present, respec- 
tively, mean movement speeds of the 
various reward groups for Grades 2, 4, 
and 6 Ss. In each case, the inter- 
action Reward Schedule x Trial 
Blocks was significant, F (15, 324) 
= 2.12, p < .05; F (15, 324) = 1.69, 
p<.05; F (15, 324) = 2.23, p < 01, 
respectively. The mean squares for the 
within-Ss error terms were .083, .104, 
and .091, respectively. The trial-blocks 
effect was also significant for Grade 2, 
F (3, 324) =7.58, p<.001, and 
Grade 6, F (3, 324) = 12.57, p < .001. 
In no case did the main effect for re- 
ward schedule attain significance. Fig- 
ures 1, 2, and 3 each indicate increasing 
speeds for the partially rewarded 
groups relative to the 100% condition. 
It is noted that performance under 0% 
is markedly different for Grade 2 as 
compared with both Grade 4 and 
Grade 6 Ss. 

The next step involved separate com- 
parisons, within a given age level, of 
each pair of reward groups. For each 
of Grades 2, 4, and 6, 15 analyses were 


TABLE 1 
MEAN MOVEMENT SPEEDS FOR MALES AND FEMALES 


Overall Mean 


Males Females F (1,108) Error Mean Square 
Kindergarten 1.79 1.77 2.08 1.78 
Grade 2 2.54 1.98 18.02** 2.08 2.26 
Grade 4 3.03 2.62 6.26* 3.35 2.83 
Grade 6 3.96 3.09 28.19** 3.22 3.53 
Freshmen 4.78 3.46 51 05** 4.09 4.12 
Old Age 2.59 1.79 17.62** 4.33 2.19 


*p <.05. 
> < 001. 
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Fic. 1. Mean movement speed of six reward groups as a function of 
four blocks of 10 trials for Grade 2 Ss. 
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Fic. 2. Mean movement speed of six reward groups as a function of 
four blocks of 10 trials for Grade 4 Ss. 
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Fic. 3. Mean movement speed of six reward groups as a function of 
four blocks of 10 trials for Grade 6 Ss. 


group; (2) significant interactions be- 
tween reward schedule and trial blocks 
with a significant difference on Trial 
Block 4 in favor of the partial group 
(c); and (3) significant interaction 
Reward Schedule X Trial Blocks, but 
no significant difference between partial 
and continuous reward at any individ- 
ual trial block (d). ‘Findings from a 
previous investigation with kindergar- 
ten Ss (Ryan & Voorhoeve, 1966) are 
also included in Table 2. 

Terminal performance——A final in- 


TABLE 2 


GROUPS ATTAINING FASTER MOVEMENT 
Speens THAN Group 100 


Kindergarten a 


30% ° | 50 


— b 
Grade 2 — 10 30% | 50% | 70%> 
Grade 4 od 10%> | 30%>| 50%> joes 
Grade 6 od 10%> | 30%4| 50%» — 
Freshmen — = i ité pas 
Old Age = = ae = D 


^ Data for this group were not analyzed due to an 
apparent sampling error (Ryan & Voorhoeve, 1966). 
Partial significantly faster than continuous reward. 
a Significant interaction; see text. 
Significant interaction; see text, 


terest was to determine the nature of 
the relationship between response speed 
and reward schedule at the end of 
training. A 2(Sex) X 6(Schedule) 
factorial analysis was performed on 
the last trial block for each of Grades 
2, 4, and 6. Significant effects for re- 
inforcement schedule were obtained for 
Grade 6, F (5, 108) = 2.40, p< .05, 
and Grade 4, F (5, 108) = 2.57; error 
terms for these comparisons were .953 
and .913, respectively. For Grade 2, 
F (5, 108) =2.24 was short of the 
.05 level, probably due to the marked 
drop in performance for the 10% and 
50% groups. A similar analysis was 
then performed on the third trial block 
for Grade 2 and the reinforcement 
schedule effect was significant, F (5, 
108) = 2.37, p < .05; the error mean 
square was .597. Figure 4 presents 
data for the last trial block for Grades 
4 and 6 and the third trial block for 
Grade 2. For each reward group, mean 
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Terminal Speeds As Ratios Of Group 100 Speed 


40 50 60 70 80 90 100 
% Reinforcement 


O 10 20 


Fic. 4. Asymptotic speeds for Grades 
2, 4, and 6 Ss. 


movement speed on the particular trial 
block is plotted as a ratio of the 100% 
group mean. These data suggest an 
inverted U-shaped relationship between 
response speed and schedule of #eward. 
Trend analyses revealed that the quad- 
ratic component is significant in each 
case, but accounts for decreasing 
amounts of variance as age level in- 
creases : kindergarten (96%, cf. Ryan 
& Voorhoeve, 1966), Grade 2 (92%), 
Grade 4 (61%), Grade 6 (43%). For 
college and old age Ss, the quadratic 
component was, of course, nonsignifi- 
zant. There was a suggestion that 
she linear trend accounts for increasing 
proportions of the variance as age level 
progresses. However, the linear com- 
ponent did not attain significance at 
any age level. 


Discussion 


From kindergarten to Grade 6, a well- 
replicated finding is that 50% reward con- 
sistently leads to faster speeds as com- 
pared with 100% reward (Bruning, 1964; 
Pederson, 1967; Ryan, 1965, 1966; Ryan 
& Moffitt, 1966; Ryan & Voorhoeve, 1966; 
Semler & Pederson, 1966). As age level 
increases, it appears that reward schedules 


lower than 50% also result in vigorous 
responding. Thus, for Grade 2 and 
Grade 4 Ss, 30% reward results in sig- 
nificantly faster responding than 100% 
reward; at the Grade 4 and 6 levels, a 
similar finding is evident for the 10% 
reward schedule. Interpretations of these 
data are as follows. For younger chil- 
dren, some minimum number of rewards 
must be received in order to maintain 
reward expectancy. Unless this is the 
case, nonreward will not lead to a frus- 
tration-produced increment in response 
speed, Reward expectancy, defined in 
terms of the number of rewards received, 
probably plays a less important role as 
Ss reach middle childhood. Two alterna- 
tives may be that older Ss set up their 
own expectancies based more upon past 
experience rather than the specific situ- 
ation and/or a “desire” concept may 
have to be fncorporated into the theory, 
For exampl the Grade 4 and Grade 6 
children in the 10% reward condition may 
have a low expectancy for reward because 
of the few rewards won, but they still 
“want” to win and nonreward is a rela- 
tively potent frustrating event. It is 
recalled that the 0% group at both the 
Grade 4 and Grade 6 levels, showed sig- 
nificantly increasing speeds over training 
relative to the 100% group. These data 
cannot be interpreted in terms of frus- 
trating a reward expectancy, as the latter 
term is presently defined. 

It is not suggested that reward ex- 
pectancy plays no role whatsoever at the 
Grade 4 and Grade 6 levels. At both 
grade levels, the 10% group mean was 
significantly faster as compared with 
100% reward, whereas, the 0% group 
mean was not, Possibly the four rewards 
received over training by the 10% group 
served to develop some expectancy, thus 
adding another source of frustration on 
nonrewarded trials, This line of reason- 
ing might imply that speeds of a 10% 
group should eventually be faster than the 
0% group but this was not the case at any 
grade level in the present study. Also, it , 
is not suggested that “desire” plays no 
role in affecting performance at the 
younger age levels. The emphasis is on 


‘the relative importance of expectancy and 
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desire at different age levels. It remains 
for future research to assess the utility 
of the desire concept within the-present 
context. ; 

Whatever process is introduced to ex- 
plain behavioral differences between kin- 
dergarten and middle elementary school 
children wofild not seem to be applicable 
to the adult data. One might argue that, 
with adult Ss, the consistently nonsig- 
nificant findings with respect to reward 
schedule were due to the fact that the 
task was not interesting. However, un- 
structured, posttask interviews lead one to 
reject this interpretation. It is suggested 
that, for adult Ss, a nonreward is not 
arousal producing as is the case for young 
children. Instead, upon experiencing non- 
reward, adult Ss may decide that there is 
a problem to be solved and attempt to 
solve it so that the apparatus will pay off 
more frequently. Attempts at problem 
solving may take the form of trying out 
variations in responding, such as some- 
times moving the lever quickly and some- 
times slowly. Support for this contention 
is indicated in comparisons of variability 
over age levels, The old age and college 
Ss -were significantly more variable in 
response speed as compared with kinder- 
garten and Grade 2 Ss. Variability at 
the Grade 4 and Grade 6 levels was inter- 
mediate to the two extremes, but did not 
differ significantly from either. In in- 
vestigations of probability learning, Weir 
(1964) has already demonstrated differ- 
ential effects of nonreward, as well as 
reward, upon the problem-solving strate- 
gies of Ss at different age levels. 
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PSYCHOLOGICAL REFRACTORINESS WITH VARYING 


DIFFERENCES BETWEEN TASKS? 


THOMAS C. WAY anp ROBERT GOTTSDANKER 2 


University of California, Santa Barbara 


2 experiments on psychological refractoriness (PR) were conducted, 
each with 8 different adult human Ss. PR was estimated by the 
increase in mean reaction time to the 2nd of 2 choice tasks when the 
intersignal interval was reduced from 900 to 100 msec. Thus 
measured, PR was greater for a 2nd task opposite in direction to the 
1st than for one which was perpendicular. This held whether or not 
the perpendicular task required a revision of signal-response coding. 
Conclusions were that PR: (a) is a function of the differences 
between the 2 tasks, (b) is dependent on antagonism between common 
elements of the 2 tasks rather than on unrelatedness or absence of 
common elements, and (c) is capable of being analyzed into com- 
ponents representing signal antagonism and response antagonism. It 
was also found that response antagonism increased RTs, showing that 
the Ist response may be affected by the 2nd signal, contrary to the 


storage hypothesis of PR. 


When successive signals require dis- 
crete movements of different response 
units it is almost invariably found that 
reaction time to the second signal 
(RT,) is longer at a short intersignal 
interval (ISI) than at a long ISI. A 
pair of values of ISI which would show 
this effect clearly are 100 and 900 
msec. This elevation of RT, at the 
shorter ISI may be said to be a direct 
reflection of a state called psychological 
refractoriness (PR). However, with 
a step-tracking task in which S re- 
sponded to both signals by moving the 
same hand, evidence has been found 
against the effect (Gottsdanker, 1966) ; 
Vince’s (1948, 1950) frequently cited 
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contrary conclusions were not sup- 
ported. 

A possible explanation of the dis- 
crepancy between findings in the two- 
response-unit task and in the step- 
tracking task is that the former im- 
poses a greater demand on S: that of 
bringing about a perceptual-motor or- 
ganization different from one which 
had been established earlier. In the 
step-tracking task all that is required 
is a modulation of the existing organi- 
zation since the initial response is or- 
ganized from the outset to include con- 
comitant impelling and braking forces. 
This is the case for all directed motions 
which have been studied (e.g., Levine 
& Kabat, 1952; Slater-Hammel, 1948). 
The foregoing argument implies that 
PR occurs solely because of the neces- 
sity to reorganize the existing context 
of preceptual-motor processes and 
states. In an earlier paper, (Gotts- 
danker & Way, 1966) this was called 
the “organization-persistence” explana- 
tion of PR. 

In order to demonstrate that varia- 


eana 


TASK AND PSYCHOLOGICAL REFRACTORINESS 39 


tions in PR are due to degrees of dif- 
ference in the organization, an experi- 
ment must compare PR for conditions 
which have clearly identifiable variation 
in the difference between the first and 
second task. This was done in an 
earlier experiment by the writers 
(Gottsdanker & Way, 1966). With 
each of the two tasks a two-direction 
choice, the second signal and response 
could be either in the same or opposite 
direction from the first. The predicted 
greater amount of PR for the opposite 
direction was not found. This failure 
might be accounted for in one of two 
ways: the effect was obscured by an 
artifact, or the hypothesis was inade- 
quate. Some support is given for the 
first possibility in that there were more 
wrong-direction errors when the tasks 
were in the same direction; there was 
a tendency to alternate responses. 

Concerning the second possibility, 
there are two reasonable formulations 
regarding the nature of effective dif- 
ferences between tasks. The antago- 
nism interpretation, which has been 
discussed, holds that reorganization 
is impeded by opposition in common 
components of the two tasks. Opposi- 
tion implies communality. The unre- 
latedness interpretation holds that re- 
organization is difficult and time con- 
suming to the extent that the two tasks 
are unrelated or lack common compon- 
ents. These considerations prompted 
the present study. 


EXPERIMENT I 


An arrangement in which the dis- 
play was integral with the control was 
employed on the chance that the pre- 
viously obtained imbalance of errors 
was somehow due to the particular 
display-control arrangement. In one 
condition, the display-control axes. for 
the two tasks were parallel. In another 
Condition, relatedness between the 


tasks was reduced by rotating the sec- 
ond display-control axis 90°. This 
second condition would produce great- 
est PR according to the unrelatedness 
interpretation and an amount inter- 
mediate between the opposite and same 
directions according to the antagonism 
interpretation. 


Method 


Subjects—Eight right-handed male in- 
troductory psychology students, ages 17-21, 
served as Ss. 

Task.—After a constant warning period of 
900 msec., S was required to move his left- 
hand lever either toward or away from him- 
self, according to which half of the upper 
surface of the lever was illuminated. Then, 
after the ISI, a second signal of the same 
nature required a choice response with the 
right hand. All time intervals are described 
in terms of signal onset. 

Apparatus—The S performed the task in 
an air-conditioned, sound-insulated room. <A 
board on the table in front of S was in- 
clined 30° toward him. Mounted side-by- 
side on the board were two cylindrical levers 
ł in. in diameter and 4 in. apart. Each lever 
consisted of two clear plastic pieces, semi- 
circular in cross-section and separated by a 
metal strip vs in. thick. The flat upper sur- 
faces were buffed uniformly translucent. At 
the base of each half of the lever, below the 
level of the mounting board, there was a 
neon lamp. The lighting of one of these 
lamps caused the semicircle above it to glow, 
thus providing the signal to respond. Each 
signal lasted 100 msec. (See Fig. 1 for the 
display-control arrangement.) The semi- 
circular signal emitted 18 ftc. against a 
background which reflected 1.8 apparent ftc. 
Movement of a lever, which required a force 
of 30 gm. through a distance of % in., closed 
a microswitch and constituted a response. 

The programming and recording devices 
were as described by Gottsdanker and Way 
(1966) with the addition of an automatic 
digital printer which recorded RT: and RT: 
in milliseconds. 

Procedure: Variables—Trials differed in 
ISI, axis of orientation of the second task, 
and in the relation of the direction of the 
correct response on the two tasks. The two 
values of ISI were 100 and 900 msec. Since 
no PR is expected at the 900-msec. ISI, a 
suitable index of PR is the difference in RT 
between these two ISIs, This eliminated 
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WARNING LIGHT. 


START 
AND END OF 
BLOCK LIGHTS 


Fic. 1, The S’s display-control arrangement. 


non-PR effects associated with the other two 
variables. The second task either had its 
axis parallel to that of the first task, or per- 
pendicular in a left-right orientation. When 
the two tasks were in parallel orientation, the 
required second response was randomly either 
in the same or opposite direction from the 
first. When the second task was in the 
perpendicular orientation, the relation of 
away on the first task and left on the second, 
or toward on the first task and right on the 
second were arbitrarily called relation X. 
The other two combinations were called Y. 
Figure 2 shows this arrangement. 

Conduct of the experiment—The Ss par- 
ticipated individually in 10 1-hr. sessions 
over a period of 2 wk. In the first 2 ses- 
sions they practiced the various combinations 
of conditions, In each of the remaining 
fiye-block sessions, the first five trials were 
considered practice and data collected from 
the remaining 20 trials were analyzed, In- 


FIRST TASK 
ALL CONDITIONS 


ea AWAY 


Tees 


structions included the admonition, “make 
each response with as little delay as possible 
after the appearance of the signal for that 
response. You are not to wait for both sig- 
nals to appear before you start to respond.” 
Values of both RTs were reported to S 
immediately after each practice trial. At 
the end of the block, S was informed of his 
shortest and longest RT for both the first 
and second task, 

The variables ISI and axis of orientation 
remained constant for a session and were 
counterbalanced for each $ across sessions, 
different Ss having different counterbalancing 
orders. 


Results 


The findings concerning RT, are of 
primary interest. In Table 1 the up- 
per row shows the means for the eight 
conditions. There is immediate evi- 
dence against the unrelatedness inter- 
pretation in that at the 100-msec. ISI 
the means for the perpendicular direc- 
tional relation were not the highest. 
Instead, they were intermediate be- 
tween those for opposite direction and 
same direction, as would be expected 
from the antagonism interpretation. 
This order was a PR effect as can be 
seen in the entirely different order at 
the 900-msec. ISI. Next is shown the 
index of PR obtained by subtracting 
the mean RT, at 900 msec. from that 
at 100 msec. Since there was only an 
arbitrary difference between the X and 
Y directional relations, the values for 
the two conditions were combined. 
While the order was that predicted by 
the antagonism interpretation, the 


SECOND TASK 


PARALLEL OR PERPENDICULAR AXES 


Si. D 


goles. 
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Fic, 2. Signal-response relationships for Exp. I. 
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closeness of the values for the opposite 
and perpendicular conditions raised the 
question of the reliability of the order. 
Each S was treated as a replication for 
confirmation to the predicted order, 
and the L statistic (Page, 1963) con- 
firmed the prediction, L (eight Ss, 
three conditions) = 107; p < .01. 
Additional findings —RT, was strik- 
ingly higher for the 100-msec. ISI than 
for the 900-msec. ISI. While this 
poses a serious problem for testing the 
hypothesis of the exact extent to which 
elevation of RT, is affected by ISI 
(Gottsdanker & Way, 1966), it does 
not prevent the testing of different con- 
ditions at the same ISI. There was 
another interesting feature of the RT, 
results; at the 100-msec. ISI, mean 
RT, was noticeably longer for the op- 
posite direction than for the same di- 
rection. This effect was shown by all 
eight Ss. Values for the perpendicular 
conditions were very similar to those 
for the same direction. 
Product-moment correlations be- 
tween RT, and RT, were averaged 
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form (Fisher, 1941). As has generally 
been found, the correlation was higher 
at the short than at the long interval. 
To date no real effort has been made 
to incorporate this fact into an explana- 
tion of PR. 

Almost all errors in the experiment 
were wrong-direction second responses, 
consequently they are the only ones 
shown in Table 1. It is seen that there 
were somewhat more errors at the 100- 
msec. ISI for same direction than for 
different direction; the imbalance of 
the previous study was not abolished. 
Since RT, for detected wrong-direc- 
tion responses was lower than the 
mean reported for the other trials, an 
adjustment for undetected “lucky 
guesses” would increase the values 
slightly. However, this would not 
affect the analyses which have been 
described. 


EXPERIMENT II 


Experiment I supported the antago- 
nism interpretation of PR and not 


across the eight Ss using the Z trans- the unrelatedness interpretation. It 
TABLE 1 
Mean RTs AND OTHER SuMMARIZATIONS, Exe. I 
Axis orientation 
Parallel Perpendicular 
Statistic E x y 
Direction Direction Relation Relation 
00 900 100 900 
1005.0 SOON rer | S a E a a AES! 
Mean RT: 327 273 348 267 337 261 332 258 
B Eso. tO) 54 81 15 
Mean RT 3 316 272 330 274 314 271 312 213 i 
Mean r, RT; and RT: .60 AZ 62 .17 69 16 65 i 
Percentage Second Re- 
sponses in W: 
DREN sad 8.0 4.8 5.8 5.5 13 4.8 8.6 5.5 


Note.—In milliseconds except correlation coefficie 


error-free trials. O; j; ials, the lowest frequency for any cell was 6! 
is. Of the potential 800 peer Sy the mean) were between 17% 


cients of variation (standard deviation 
18% and 22% for RTs. 


. Each primary cell is based on 
391. Across conditions, the coeffi- 
and 20% for RT: and*between 


nts and error percentages, 
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may be contended that though the effect 
of antagonism cannot be denied, un- 
relatedness was not fairly tested be- 
cause the perpendicular second task 
was not sufficiently unrelated to the 
first task, They both used the same 
unequivocal code for direction of re- 
sponse. If the code were changed be- 
tween tasks, unrelatedness might yet 
prove to be a stronger factor in PR 
than antagonism. If this is not true 
and antagonism is the only required 
interpretation, then it might be possible 
to analyze the separate contributions of 
signal difference and response differ- 
ence. 

Experiment II bore on these two 
goals. It was accomplished through a 
second task in which the signal could 
be the same or opposite to that of the 
first, but in which the response direc- 
tion was perpendicular. The problem 
that had to be resolved was to do this 
without introducing differential signal- 
response compatibility. 


Method 


Experiment II was conducted the same as 
Exp. I except that the lines dividing the 
top surfaces of the levers were set as 
shown in Fig. 3. Thus, both parallel and 
perpendicular second responses were made to 
the same signals, and with the same degree 
of signal-response compatibility. Eight dif- 
ferent Ss were drawn from the same pool 
as that used in Exp, I. 


Results 
Again, response direction could be 
the same or opposite in the two tasks 


FIRST TASK 
ALL CONDITIONS 


AWAY 


Ce 


only if the response axes were parallel. 
However, in this experiment, signal 
directions could be same or opposite 
with either response axis setting. It is 
seen (Table 2) that the RT, means at 
the 100-msec. ISI for perpendicular 
axis orientation were again intermedi- 
ate. Thus the expectation based on 
unrelatedness was again not upheld. 
Rather, there was a precise correspond- 
ence with the predictions which fol- 
low from the antagonism interpreta- 
tion: the highest value when both 
signal and response were opposite, 
next when signal was opposite and re- 
sponse was perpendicular, next when 
signal was the same and response per- 
pendicular, and least when both signal 
and response were the same. That 
these effects were due to PR is seen 
in the virtual absence of differences 
between conditions at the 900-msec. 
ISI. The PR index (obtained by sub- 
tracting RT, at 900-msec. from that 
at the 100-msec. ISI) maintained the 
same order. The L statistic shows 
the order to be reliably consistent, L 
(eight Ss, four conditions) = 228, 
p < 001. 

As for additional findings, there was 
again the large difference in RT, be- 
tween the two ISIs. For parallel re- 
sponse axes at the 100-msec. ISI, op- 
posite direction of response again had 
a higher mean RT, than did same di- 
rection. The effect was shown by all 
eight Ss. The values for perpendicular 
axes were almost identical and very 
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Fic. 3. Signal-response relationships for Exp. IT. 
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TABLE 2 
MEAN RTs AND OTHER SUMMARIZATIONS, Exp, II 
Axis Orientation 
Parallel Perpendicular 
Statistic Si O; i Si i 
N Signal Signal Signal Berl 
Direction Direction Direction Direction 

100 900 100 900 100 900 100 900 

ISI ISI ISI ISI ISI ISI ISI ISI 
Mean RT: 320 247 352 244 331 249 344 246 
PR Index = RT:(100) 

— RT:(900) 73 8 82 98 
Mean RT: 309 250 327 251 301 255 303 252 
Mean r, RT; and RT: 64 21 63 16 56 23 69 .05 
Percentage Second 

Responses in Wrong 

Direction 12.1 6.1 14.3 2.5 | 13.0 Sil 17.1 4.0 


Note.—In milliseconds except correlation coefficients and error percentages. Eaca primary cell is based on 
error-free trials, Of the potential 800 trials, the lowest frequency for any cell was 628. Across conditions, the coeffi- 
cients ot variation were between 17% and 24% for RT: and between 18% and 25% for RTs. 


slightly lower than for same direction 
with parallel axes. Errors were some- 
what more frequent for the 100-msec. 
interval than in Exp. I but the imbal- 
ance was slightly reversed, there being 
more in the opposite direction. 


Discussion 


In neither experiment could PR be as- 
cribed to unrelatedness between tasks. 
Instead it appears that PR can be ac- 
counted for by antagonism between com- 
mon elements of the two tasks. In 
accordance with the antagonism inter- 
pretation, Exp. I demonstrated that PR 
was greatest when the two tasks were 
antagonistic, intermediate when they were 
unrelated, and minimal when they were in 
agreement. 

_Experiment II supported this conclu- 
sion and showed further that the effects 
of Signal antagonism and response antag- 
onism can be separated. A subtractive 
Procedure can be employed to evaluate 
these separate contributions to PR. First 
with the response axes perpendicular, the 
PR index when the signals were the same 
(82 msec.) is subtracted from that when 
they were opposite (98 msec.) to yield an 


estimate of 16 msec, contributed by signal 
antagonism, with responses unrelated, 
Since the overall difference in the PR in- 
dex was 35 msec., the contribution of re- 
sponse antagonism was 19 msec. The 
writers are quite aware that the values 
used in this derivation may not have been 
sufficiently reliable. Their order was 
tested but the values ranked were not. 
The outcome of these manipulations may 
be fortuitous but it is amenable to further 
empirical test. 

More detailed study of the elements of 
a signal-response process may reveal dif- 
ferences in the amount of initial effect for 
each element and in its rate of decay. 
An experiment with more ISI values may 
indicate, for example, that signal antag- 
onism and response antagonism have dif- 
ferent time courses in maintaining their 
influence. : 7 

In none of the conditions of these ex- 
periments was PR entirely eliminated. It 
might be argued that though PR was 
affected by these variables their influence 
was so slight that they were not major 
determiners of the behavior. Is this the 
case? In Exp. I the index: of PR when 
the two tasks were the same was 54 msec. ; 
this was increased by 27 msec. or 50% 
when the tasks were opposite. The com- 
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parable value for Exp. II was 48%. 
These are not slight effects. Moreover, 
each of the 16 Ss had a higher index for 
opposite than for same tasks. From the 
difference in result between step-tracking 
and two-response-unit tasks, it is argued 
that a large portion of the remaining PR 
here is due to the requirement to change 
hands between tasks. 

For a full account of S’s behavior, it 
is necessary to consider the effect of ex- 
perimental conditions on RT,. Variation 
of RT, with condition may be attributed 
either to before-trial tendencies or to 
within-trial effects. Further, since the 
circumstances of the first task itself did 
not vary across conditions in either ex- 
periment, within-trial effects were con- 
fined to those associated with the second 
signal, 

The most striking effect is the markedly 
higher mean RT, at the 100-msec. in- 
terval. This is “largely a before-trial 
tendency since ISI was constant over a 
block of trials and since it has been shown 
before that, under these conditions, RT, 
is elevated over its normal value when the 
second signal occurs during or shortly 
after the first response. (Fraisse, 1957; 
Gottsdanker, Broadbent, & Van Sant, 
1963; Slater-Hammel, 1958). 

In both experiments, at the 100-msec. 
ISI, mean RT, was about 15 msec. higher 
when the axes were parallel and the direc- 
tions opposite than when the axes were 
parallel and the directions same. Since 
the elevation did not occur when only the 
signals were opposite, this can be con- 
sidered a within-trial response antagonism 
effect. It is reasonable that the effect be 
only on the response side, Presumably 
interpretation of the first signal was ac- 
complished before the second signal could 
compete. 

Gottsdanker (1966) presented evidence 
against the assertion of the “storage hy- 
pothesis” of PR that the first response is 
impervious to the second signal. The 
present RT, results corroborate Gotts- 
danker’s evidence, 

Differences in RT, with axis orienta- 
tion could result either from before-trial 
tendencies or from within-trial effects. 


If the former is the case, perhaps the RT, 
differences should be reconsidered from 
the point of view of trade-off strategy: a 
longer RT, might help produce a shorter 
RT,. If it is assumed that the entire dif- 
ference in RT, associated with axis orien- 
tation represents a before-trial tendency 
and, further, that it takes the form of a 
trade-off strategy, then the PR index for 
each condition should be corrected by 
adding the mean RT, for that axis orien- 
tation. Granting these two assumptions, 
the correction would better represent the 
effect of a condition on RT,. In Exp. I, 
the order of PR indexes is maintained 
after this correction, but the value for 
the perpendicular condition is moved 
closer to that for parallel-same. In 
Exp. II, the order of indexes is disturbed 
and the conclusion is forced that the only 
within-trial condition which increases 
PR is response antagonism. 

However, the assumptions required for 
this treatment should be examined. First, 
the 15-msec. parallel-opposite elevation of 
RT, is probably not entirely a before-trial 
affair. Rather, it appears to be an inter- 
action Axis Orientation (possible before 
trial) Xx Response Direction (clearly 
within trial) with no simple effect. It 
is difficult to conceive of this leading to 
a strategy. Second, with what confidence 
could the before-trial portion of the in- 
crease in RT, be taken as a demonstration 
of a trade-off strategy? In the earlier 
study (Gottsdanker & Way, 1966), for 
constant ISI, the mean RT, at 100 msec. 
was not below that for 50 msec., as would 
be expected on the basis of PR, despite 
the fact that mean RT, was higher at the 
longer interval. Moreover, as compared 
with the varied condition in which RT, 
differences did not appear, the correlation 
between RT, and RT; was higher, not 
lower. This is evidence against trade-off 
(a still higher correlation was found in 
the present study). Such consideration 
led the writers to conclude, “It is curious 
that S seems to have adopted a long RT; 
as a technique at the 100- and 200-msec. 
intervals, when it is quite clear that this 
does not lead to a short RT, [p. 802].” 
Third, even with the trade- off correction, 
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there remains the 14-msec. difference in 
the PR index between same and opposite 
signal direction with perpendicular re- 
sponse axes. This remains attributable 
only to signal antagonism. Finally, 
grouping did not exist in the sense of 
waiting for the second signal before mak- 
ing the first response. The strong in- 
structions on this point were effective. 
The writers see no reason to change their 
conclusion that PR was affected by signal 
antagonism as well as by response antag- 
onism. 

Since only two degrees of signal dif- 
ference and three degrees of response 
difference were employed in these experi- 
ments, this is hardly a parametric evalu- 
ation of task antagonism. It is the hunch 
of these writers that Ss would quantize 
deviations from same and opposite and 
that functions of PR with opposite re- 
sponses against objective difference would 
not be continuous. That is an empirical 
question involving scaling of differences. 

It should finally be pointed out that no 
explanation was given for the failure to 
obtain higher PR in the earlier experi- 
ment (Gottsdanker & Way, 1966). The 
effect was obtained in Exp. I here even 
with an imbalance of wrong-direction er- 
rors. Confusion effects from the partic- 
ular display-control relationship seem ab- 
solved ; the remaining possibilities are that 
Ss in the earlier experiment were sub- 
jected to more ISIs, were given varied as 
well as constant intervals and had control 
trials in which no first signal was given 
and no first response required. Sen- 
sitivity of RT, to within-trial yariables 
was not found in the earlier experiment, 
either. This difference also lacks ex- 
planation. 


REFERENCES 


Fisuer, R. Statistical methods for research 
workers. (8th ed.) Edinburgh: Oliver & 
Boyd, 1941. 

Fraisse, P. La périod réfractaire psycho- 
logique. L’Année Psychologique, 1957, 
57, 315-328. 

GOTTSDANKER, R. The effect of superseding 
signals. Quarterly Journal of Experi- 
mental Psychology, 1966, 18, 236-249. 

GOTTSDANKER, R., BroapBent, L., & VAN 
Sant, C. Reaction time to single and to 
first signals. Journal of Experimental 
Psychology, 1963, 66, 163-167. 

Gorrspanxer, R., & Way, T. C. Varied 
and constant intersignal intervals in psy- 
chological refractoriness. Journal of Ex- 
perimental Psychology, 1966, 72, 792-804. 

Levine, M. G., & Kasat, H. Cocontraction 
and reciprocal innervation in voluntary 
movement in man. Science, 1952, 116, 
115-118. 

Pace, E. B. Ordered hypotheses for multi- 
ple treatments: A significance test for 
linear ranks. Journal of the American 
Statistical Association, 1963, 58, 216-230. 

SLATER-HAMMEL, A. T. Action current 
study of contraction-movement relation- 
ships in golf stroke. Research Quarterly 
of the American Association for Health, 
Physical Education, and Recreation, 1948, 
19, 164-177. 

SLATER-HAMMEL, A, T. Psychological re- 
fractory period in simple paired responses. 
Research Quarterly of the American As- 
sociation for Health, Physical Education, 
and Recreation, 1958, 29, 468-481. 

Vince, M. A. The intermittency of control 
movements and the psychological refrac- 
tory period. British Journal of Psychol- 
ogy, 1948, 38, 149-157. 

Vince, M. A. Some exceptions to the psy- 
chological refractory period in unskilled 
manual responses. APU Report No. 124, 
1950, Medical Research Council, Psycho- 
logical Laboratory, Cambridge, England. 


(Received August 5, 1967) 


Journal of Experimental Psychology 
1968, Vol. 78, No. 1, 46-49 


ALPHA RHYTHM AND TIME JUDGMENTS 


C. F. LEGG 
University of Cambridge, England 


Previous studies have indicated correlations between time judgments 
and certain characteristics of the EEG. Here, using intervals between 
.5 and 8 sec, no significant correlations were found between judg- 


ment lengths and mean alpha rate or “alpha index.” 


Reliable if 


weak associations between judgments and aspects of concurrrent EEG 


may yet be established. 


The only previous investigations of 
the relation between the EEG and 
time judgments, by Werboff (1962) 
and Dureman and Edstrom (1964), 
indicated a negative correlation be- 
tween the dominant spontaneous EEG 
frequency and the length of judgments 
of short time intervals. Werboff also 
reported a negative correlation be- 
tween the frequency of the EEG con- 
current with time judgments and the 
length of those judgments. In a dif- 
ferent context, it has been suggested 
by a number of authors (see Ellingson, 
1956; White, 1963) that the cycle 
period of the alpha rhythm determines 
the duration of the “psychological mo- 
ment” (postulated to be the psycho- 
logically indivisible unit of time). To- 
gether these points suggest the hypoth- 
esis that the alpha cycle forms the unit 
of subjective time in time judgments, 
as if each cycle represented a “tick” of 
the “subjective clock.” It would fol- 
low that the rate at which subjective 
time passes is determined by the fre- 
quency of the alpha rhythm. Two 
predictions arise from this hypothesis 
and the previous results : that the length 
of an S’s judgment of a short interval of 
time will be negatively correlated with 
(a) the frequency of his spontaneous, 
resting alpha rhythm, and (b) in par- 
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ticular, the frequency of his alpha 
rhythm during the time interval being 
judged. Two further predictions aris- 
ing from Werboff’s results are (a) 
that the length of such judgments will 
be positively correlated with S’s “alpha 
index” (i.e, the proportion of time 
that the alpha rhythm dominates his 
spontaneous EEG), and (b) that judg- 
ments made with eyes closed (EC) will 
be longer than those made with eyes 
open (EO) since the frequency of the 
concurrent EEG is presumably lower 
in the EC case. 


EXPERIMENT I 
Method 


Subjects—Twelve males, aged 17-25 yr., 
each completed one test session of 55 min. 

Placing of electrodes—Pad_ electrodes 
were placed bilaterally as follows: occipital, 
1 cm. above the inion and 2 cm. from the 
midline; parietal, above the ears and 4 cm. 
from the midline. Traces were recorded 
from left and right occipital-parietal connec- 
tions on two channels of an A.E.I. Mark III 
pen recorder with a standard deflection of 1 
cm. per 50 av. The Ss were discarded if 
either of their interelectrode resistances was 
above 5,000 ohms. 

Preliminary routine—In a half reclining 
position in dim illumination S was asked 
alternatively to open and close his eyes. 
This cycle was repeated 10 times in about 2 
min. and was usually sufficient to produce a 
clear alpha rhythm in the EC state. The Ss 
were discarded if they showed no alpha 
thythm, on the grounds that an adverse reac- 
tion to the experimental situation might be 
masking their natural spontaneous EEG. 
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Spontaneous EEG.—One minute of EEG 
was recorded with S resting in dim illumina- 
tion with his eyes closed (the EC condition). 

Time judgments and concurrent alpha 
rhythm.—Each S made 16 judgments each of 
.5, 1, 2, and 3 sec. by the method of pro- 
duction without counting (see Bindra & 
Waksberg, 1956) under two conditions: (a) 
EC and (b) with his eyes open and attend- 
ing to a cross 5 ft. away on a well-illumi- 
nated wall (the EO condition). Responses 
were made by depressing a Morse key which 
produced a deflection on another channel of 
the pen recorder. The order of presentation 
of the eight types of trial—four intervals 
and two conditions—was pseudorandom, the 
constraint being that four trials of each type 
should occur in each quarter of the sequence. 
Each trial was as follows: the appropriate 
lighting condition was set up; the S was in- 
structed as to whether it was an EC or EO 
trial and which interval he was to judge; 
after a short pause, S made his judgment. 
The duration of the trials was reasonably 
constant at 7-8 sec. One practice trial of 
each type of trial was given at the beginning 
of the session; knowledge of results was 
withheld, 

Treatment of data—The total length of 
the two traces of spontaneous EEG for 
each S was measured and using the alpha 
rhythm produced in the preliminary routine 
as a guide, the traces were divided (by eye) 
into those parts showing clear alpha rhythm 
(A parts); some alpha rhythm (S parts— 
ie, it was difficult to determine whether it 
was alpha rhythm or not); or no alpha 
thythm (N parts). The total length of each 
type of pattern was determined and the alpha 
index (i.e, A/A+S+N) calculated. The 
duration of each time judgment was deter- 
mined to the nearest 1/60 of a second and 
the EEG during the judgment classified as 
A, S, or N. For each A judgment (i.e, 
those filled with alpha rhythm) the mean 
frequency of the concurrent rhythm was 
determined by counting cycles of oscillation. 


Results 


EC vs. EO.—For each interval, a 
Mann-Whitney U test (see Siegel, 
1956) was used to compare each S’s 
EC and EO judgments. As no signifi- 
cant differences were found, each S’s 
judgments of each interval were then 
Pooled and the overall median value 
calculated. 


Mean alpha rate—For each interval, 
a Kendall’s rank-correlation coefficient, 
tau, was calculated for the relation be- 
tween Ss’ mean alpha rates (deter- 
mined from the spontaneous EEG and 
that in the A judgments) and their 
median judgments. The tau values ob- 
tained for the .5-, 1-, 2-, and 3-sec. 
intervals were +.24, +.06. —.06, and 
—.18, respectively (the 5% one-tailed 
significance value is .39), indicating 
only that significant results might per- 
haps be obtained with longer or shorter 
intervals. 

Concurrent alpha rhythm frequen- 
cy.—Where an S had four or more A 
judgments of an interval, a tau value 
was calculated for the relation between 
alpha frequency and length of judg- 
ment. The values obtained and the 
sizes of the samples from which they 
were calculated are shown in Table 1. 
Of the 28 coefficients, 15 are negative; 
4 of the negative values are statistically 
significant, whereas only 1 of the posi- 
tive values is. This is similar to Wer- 
boff’s results—a distribution of coeffi- 
cients skewed towards negative values. 

If the dominant frequency of the con- 
current EEG influences the length of a 
judgment, it could be predicted that A 
judgments would be longer than N 
judgments since the frequency during 
the former would tend to be lower. 
Therefore, it is worth noting that, for 
those Ss who gave both A and N judg- 
ments for the .5-, 1-, 2-, and 3-sec. in- 
tervals, only 4 out of 10, 6 out of 11, 4 
out of 10, and 2 out of 7, respectively, 
gave longer judgments in the A state 
than in the N state. 

Alpha index—For each interval, a 
tau value was calculated for the rela- 
tion between alpha index and median 
judgment. The values obtained for 
the .5-, 1-, 2-, and 3-sec, intervals were 
—.03, +.12, +.10, and +.21, respec- 
tively (the 5% one-tailed significance 
value is 39). Since it was a subjective 
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TABLE 1 


Tau VALUES FOR THE RELATION BETWEEN LENGTH OF JUDGMENT AND CONCURRENT ALPHA 


FREQUENCY AND THE SIZES OF THE SAMPLES (N) FROM WHICH THEY ARE CALCULATED 


Interval in Seconds 

s 0.5 1.0 2.0 3.0 
Tau N Tau N Tau N Tau N 
1 — 0.59 7 +0.60 a = — — — 
2 +0.13 20 —0.29 17 —0.54 14 —0.01 15 
3 —0.24 10 —0.27 10 +0.24 11 —0.09 10 
4 +0.06 9 +0.02 10 = +0.05 7 
5 +0.02 14 —0.11 10 +0.67 4 — = 
6 —0.31 8 0.00 14 —0.11 15 —0.24 7 
7 —0.41 10 +0.03 13 +0.20 5 = = 
8 = = lis = Pir aS w —, 

9 +0.20 6 

10 —0.22 12 —0.55 11 +0.74 5 m = 
i L+0.15 7 = = =0.29 7 = x 
ripe al 


classification that the S parts of the 
EEG traces were not alpha rhythm, the 
analysis was repeated using another 
measure of alpha index: This was the 
sum of the total length of the A parts 
plus half the length of the S parts 
divided by the total length of the traces, 
ie, A+4S/A+S+N. Using this 
measure, the tau values obtained were 
+.06, +.21, +.19, and +.30, respec- 
tively. Both these analyses suggest 
that significant results might be ob- 
tained with longer intervals. 

This experiment confirms Werboff’s 
finding of a weak relationship between 
time judgments and the dominant fre- 
quency of concurrent EEG, and sug- 
gests the prediction that significant 
relations between both alpha index and 
mean alpha rate and time judgments 
might be found using a wider range of 


intervals. The aim of Exp. II was to 
test this prediction. 

EXPERIMENT II 
Method 


Unless otherwise stated, the conditions in 
this experiment were the same as in Exp. I. 
Subjects —The Ss were 19 males and 3 fe- 
males, aged 18-27 yr., none of whom had 


taken part in Exp. I. Each completed one 
test session of 55 min. The placement of 
electrodes and preliminary routine were the 
same as before. 
Spontaneous EEG.—Three minutes of 
EEG were recorded in the EC condition. 
Time judgments.—In pseudorandom order, 
each S made 10 judgments each of .5, 2, 4, 
and 8 sec. under the EC and EO conditions. 
(Concurrent EEG was not recorded.) 
Spontaneous EEG—Three more minutes 
of EEG were recorded in the EC condition. 
Treatment of data—For the four inter- 
vals, median scores were calculated for each 
S’s 10 judgments made under each of the 
two conditions. Each S’s four EEG traces 
were divided into A and N parts and the 
average alpha index determined. Mean 
alpha rates were calculated from the A parts. 


Results 


EC vs. EO.—The number of Ss hav- 
ing longer EC than EO median judg- 
ments for the .5-, 2-, 4-, and 8-sec. in- 
tervals were 7, 11, 10, and 9 out of 22, 
respectively (for a sign test to be sig- 
nificant at the one-tailed 5% level, 16 
out of the 22 would have had to show 
the same result). Therefore the EC 
and EO judgments were pooled and 
overall medians calculated. 

Mean alpha rate—Tau values for 
the relation between mean alpha rate 
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and judgment length for the .5-, 2-, 4-, 
and 8-sec. intervals were +.07, +.09, 
+.06, and —.01, respectively (the 5% 
one-tailed significance level is .25). 
Alpha index.—Tau values for the 
relation between alpha index and judg- 
ment length for the .5-, 2-, 4-, and 8- 
sec. intervals were —.05, +.12, —.19, 
and —.01, respectively (the 5% one- 
tailed significance level is .25). 


Discussion 


No evidence was found for a relation 
between either alpha index or the mean 
frequency of spontaneous alpha rhythm 
and the length of time judgments, Since 
these experiments showed no difference 
between either A and N judgments or 
EC and EO judgments, they indicate how 
weak the earlier reported relation between 
the frequency of concurrent EEG and 
time judgments must be. On the other 
hand, the fact that Werboff’s study and 
the present investigation both found small 
correlations between aspects of concur- 
rent EEG and judgment length may indi- 
cate the existence of a reliable if weak 


association between these two variables. 
However, for the moment the evidence 
suggests that the EEG is not directly re- 
lated to time judgments. Certainly the 
simple model in which cycles of the alpha 
rhythm act as ticks for a subjective clock 
must be discarded. 
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EFFECT OF TEMPORAL SEPARATION OF TWO TASKS 


ON PROACTIVE INHIBITION? 


BENTON J. UNDERWOOD anp JOEL S. FREUND 


Northwestern University 


2 groups learned A-B for 32 trials, learned A-C to one perfect 
recitation 3 days later, and recalled A-C after 24 hr. 2 other groups 
learned both lists in immediate succession followed by 24-hr. recall of 
A-C. 1 group from each schedule had 6 A-B pairs retained in A-C. 
The results showed that the 3-day separation of A-B and A-C 
markedly reduced proactive inhibition (PI) ; this finding was associated 
with the theoretical concept of between-list differentiation. Given the 
temporal separation, differentiation does not completely break down 
when A-B pairs are retained in A-C. With no temporal separation 
and with repeated pairs, differentiation was assumed to be essentially 
0 since the number of interlist intrusions was about equivalent to 


the number of correct responses. 


Proactive inhibition (PI) in the re- 
call of A-C following the learning 
of A-B can be markedly reduced if 
A-B learning extends over several days 
(Underwood & Ekstrand, 1968). The 
evidence also indicated that about two- 
thirds of this reduction was eliminated 
if some of the A-B pairs were retained 
in the A-C list. These results sug- 
gested that differentiation between two 
interfering lists can be increased by the 
temporal dispersion of the learning of 
the interfering list, a differentiation 
which is partially destroyed when both 
lists contain the same pairs. Two 
questions will be asked of the present 
experiments, questions which are aimed 
toward a refinement of the concept of 
differentiation. 

Will a temporal separation between 
A-B and A-C reduce PI? In the 
above studies, A-B learning extended 
over 4 days with the last trials oc- 
curring just prior to A-C learning. 
Thus, A-B was given by widely spaced 
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practice. The question is, therefore, 
whether it is the spaced practice as 
such or the temporal separation (of 
at least the initial learning of the two 
lists) associated with such practice that 
is responsible for the reduction in PI. 
To answer this question, a 3-day inter- 
val was inserted between A-B and A-C 
learning for one group, while another 
group had both lists on the same day 
with both given recall of A-C 24 hr. 
following its learning. If temporal 
separation is responsible for differen- 
tiation, PI should be less with the 3- 
day interval inserted than with no in- 
terval. 

As was noted above, retaining some 
A-B pairs in the A-C list greatly in- 
creased PI even with A-B learning 
temporally dispersed. However, the 
increase shown was not of such mag- 
nitude as to reach the level shown when 
no interval occurred between the learn- 
ing of the two lists. Two alternative 
explanations were suggested. First, 
with some old and some new pairs in 
A-C, within-list differentiation may 
have been aided, resulting in some- 
what better recall than in the case 
where all items were new, Secondly, 
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it is possible that differentiation result- 
ing from temporal dispersion was not 
completely destroyed by retaining pairs 
from A-B in A-C. These two alterna- 
tive interpretations will be evaluated. 
If within-list discrimination is involved, 
recall should be enhanced when A-B 
pairs are retained in A-C even with no 
temporal separation between the two 
lists. If temporal separation of A-B 
and A-C results in a between-list dif- 
ferentiation that is not entirely de- 
stroyed by retaining some A-B pairs in 
A-C, a group with a temporal separa- 
tion between the two lists should give 
better recall than one without the sepa- 
ration when in both cases A-B items 
are retained in A-C. 

Four conditions (four groups) will 
answer both of the above questions. 
Two of the groups were given a 3-day 
interval between A-B and A-C, and 
two groups had no interval. One 
group given each schedule had half the 
pairs of A-B retained in A-C, half 
had none. 


METHOD 


Materials—The 12-pair lists were exactly 
the same as used in the previous study (Un- 
derwood & Ekstrand, 1968), and consisted of 
three-letter words as stimuli and two-syllable 
adjectives as responses. All groups learned 
the same A-C list. For two conditions, 6 
of the 12 pairs of A-B were retained in 
A-C; these were the same ones retained in 
the previous study. 

Conditions —The A-B list was always 
Presented for 32 anticipation trials, while the 
A-C list was carried to a criterion of one 
Perfect recitation. Recall was taken 24 hr. 
after learning A-C with five relearning trials 
being given. After relearning, a Modified- 
Modified Free-Recall (MMFR) test was 
given in which S was supplied the stimuli 
and was asked to write the two appropriate 
responses: for. each and to identify each by 
list (first or second). This test was un- 
Paced. Learning and relearning occurred at 
a 2:2-sec, rate, with a 4-sec, intertrial 
interval, 

Two of the groups had A-B on Monday 
and A-C on Thursday. Two other groups 


‘set. 


had both lists on Thursday, One group, 
given each schedule, had six items from A-B 
retained in A-C. The four groups may be 
identified by the interval (0 or 3 days) and 
by repetition (R) or nonrepetition (NR) of 
A-B pairs in A-C. Thus, the four groups 
became: 0-R, 0-NR, 3-R, and 3-NR, All 
Ss were scheduled for a three-session ex- 
periment (Monday, Thursday, Friday) and 
all reported on Monday. Those Ss assigned 
to the 0 conditions were dismissed and re- 
turned on Thursday. A total of 120 Ss 
were run, 30 in each condition. A schedule 
of conditions, block randomized, associated 
each S with a given condition when S ap- 
peared at the laboratory. The Ss were not 
necessarily naive to verbal learning studies, 


RESULTS 


A-B learning.—Out of a possible 384 
correct responses for each S on the 32 
anticipation trials, the four groups 
varied in average percentage correct be- 
tween 87.7 and 93.0, The mean total 
number of perfect trials (out of 32) 
varied between 18.03 and 20.83. 

A-C learning—The mean number 
of trials to learn the second list to one 
perfect recitation varied between 4.37 
and 8,03 trials. As expected, having 
six items from A-B repeated in A-C 
facilitated learning, F (1, 116) = 
10.76,p < .01. However, there was 
also a significant interaction (F= 
10.00). The means were 8.03, 5.30, 
4.37, and 5.23 for Groups 3-NR, 0-NR, 
3-R, and 0-R, respectively. The two 
groups having both lists on Thursday 
barely differ, in spite of the fact that 
one had six repeated items. No en- 
tirely satisfactory explanation of this 
interaction has been found, The Ss 
in Group 0-R produced 62 intrusions 
in learning A-C, those in Group 0-NR, 
only two. This may suggest differen- 
tial interference, or even differences in 
(Groups 3-NR and 3-R had 9 
and 14 intrusions, respectively.) The 
difference in learning for the two NR 
groups could involve warm-up differ- 


ences. However, as noted above, no 
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simple accounting of the interaction can 
be suggested. The effects noted were 
also present on the initial learning trials 
but lacked statistical significance. 
Recall—The basic recall data are 
based on the six nonrepeated items. 
The recall percentages for each of the 
four conditions are shown in Fig. 1. 
An analysis of variance shows that 
temporal separation (0 vs. 3) is highly 
significant, F (1, 116) = 18.55, p< 
.01, that R vs. NR is not (F = 2.61), 
and the interaction is, F (1, 116) = 
5.23, p< .05. The NR conditions 
show that recall was 65% when A-B 
was separated from A-C, 38% when it 
was not. For all 12 pairs the values 
were 63% and 38%, or a difference of 
25%. In a previous study (Under- 
wood & Ekstrand, 1968) the 0-NR 
condition had 37% recall, and a group 
having spaced practice over the interval 
between Monday and Thursday re- 
called 67%, or a difference of 30%. 
In still another study (Underwood & 
Ekstrand, 1966) these same two condi- 
tions resulted in recalls of 58% and 
31%, or a difference of 27%. Thus, 
spacing practice over the interval pro- 
duces about the same effect as allowing 
the interval to elapse without practice. 


PERCENT RECALL 


TIME (DAYS) 
BETWEEN A-B AND A-C 


Fic. 1. Percentage of recall for the six 
nonrepeated pairs for each condition. 


It may also be noted that a control 
group (no A-B) in the previous ex- 
periment had 72% recall, indicating 
little PI in that experiment when A-B 
was widely spaced, and presumably, 
therefore, little PI in the present 3-NR 
condition. It may further be deduced 
from the above data that differentiation 
is a negatively accelerated increasing 
function of the temporal separation be- 
tween two lists, 

The second question asked of the 
experiment was whether the use of 
repeated items provided some within- 
list differentiation not present under 
NR conditions. The answer appears 
to be “no,” since the 0 groups differ 
only by 2.7%. On the other hand, 
with a temporal separation between the 
two lists, recall is better by 8.3% than 
without the temporal separation even 
when both cases involve repeated 
items. Although this difference is not 
statistically significant, it corresponds 
closely with the difference found in the 
previous experiment (10%) and sug- 
gests that not all differentiation be- 
tween two lists temporally separated 
is destroyed by repeating items. 

Recall of the repeated items was not 
perfect, being 90.6% and 86.1% for 
Groups 3-R and 0-R, respectively. 
This, as did the earlier experiment, 
suggests the presence of some gener- 
alized interference in this paradigm. 

Table 1 shows a complete breakdown 
of the responses given in the attempts 
to recall the six nonrepeated pairs. 
The results for Group 0-R are of par- 
ticular relevance for the concept of 
differentiation. This group produced 
73 correct responses and 66 intrusions, 
with 28 of the 30 Ss supplying at least 
one intrusion and 29 supplying at least 
one correct response. The near equal- 
ity of frequency of intrusions and cor- 
rect responses suggests that differ- 
entiation was minimal and that S re- 
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TABLE 1 


TOTAL RESPONSES OF ALL TYPES ON 
THE Six NONREPEATED PAIRS 


3-NR | 117 41 17 5 
0-NR 68 81 25 6 
3-R 88 53 26 13 
0-R 13 32 66 9 


sponded on a random basis with one of 
the two responses. Indeed, if 32 “no 
responses” were divided equally be- 
tween intrusions and correct responses, 
Ss would have produced 89 correct re- 
sponses (out of a possible 180), or 
almost exactly 50%. In short, given 
repeated items and no temporal sepa- 
ration between the two lists, as long as 
S responds with some frequency 
(which success on repeated items may 
cause him to do), PI will probably 
never be greater than found here (37% 
recall) for a two-list situation. 

The second relearning trial showed 
the same difference among conditions 
as the first (recall) but the differences 
were much attenuated and by the third 
trial no differences were present. 

MMFR.—As is usually found with 
the MMFR test, when S produces a 
first-list response it is almost always 
paired with the appropriate stimulus 
and correctly identified as to list. 
Therefore, the MMFR for the six non- 
repeated items was scored correctly if 
the response was produced. The means 
were 4.43, 5.63, 4.83, and 5.60 for 3- 
NR, 0-NR, 3-R, and 0-R. Analysis 
shows that only the separation variable 
was significant, F (1, 116) = 24.38, p 
<.01. Recall of A-B was significantly 
less when learned on Monday than 
when learned on Thursday. Even so, 
the lowest recall (3-NR) was 74%. 
Tt will be noted, however, that recall 
for 0-NR was 5.63 (94%). This 
indicates little if any unlearning and 


at least not as much as found for this 
condition in two previous experiments 
(Underwood & Ekstrand, 1966, 1967). 


Discussion 


The evidence from the present study 
shows that a temporal separation between 
A-B and A-C markedly reduces PI. At 
the theoretical level this is said to result 
from between-list differentiation which 
increases as a negatively accelerated func- 
tion of the temporal-separation interval. 
The data further indicate that retaining 
items from A-B in A-C does not com- 
pletely eliminate differentiation resulting 
from a 3-day separation of the two lists. 
Previous studies (Underwood & Ek- 
strand, 1966, 1967) showed that spacing 
A-B over the temporal intervals used 
here to separate A-B and A-C gave the 
same quantitative recall results. The 
comparability of the results may be inter- 
preted to mean that it is not the spacing 
but the temporal separation which is the 
relevant variable. Nevertheless, it is pos- 
sible that spacing per se could accomplish 
the same effect as temporal separation and 
interpreting both findings as being due to 
temporal separation is merely a simplify- 
ing conclusion. 

Could the high recall resulting from 
temporal separation be due to forgetting 
of A-B (either “normal” forgetting, un- 
learning, or both)? If, for interference 
purposes, forgetting can be thought of as 
merely reducing the degree of learning of 
A-B, forgetting or unlearning should be 
of little consequence since an earlier study 
showed no difference in PI between 12 
and 32 trials on A-B (Underwood & 
Ekstrand, 1966). Further, the group 
with the temporal separation but with 
repeated items (Group 3-R) showed 
rather severe PI and the forgetting or 
unlearning of A-B for this group was 
equivalent on MMFR to that for Group 
3-NR. It does not seem likely that the 
relative lack of PI in Group 3-NR is due 
to lack of interference potential of A-B, 
This is to say, therefore, that the present 
results would be expected even with a 
very high degree of learning of A-B. 

At the more general level, the present 
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study supplies further evidence of the 
importance of the temporal dimension in 
memory in the sense that events appear to 
be time-tagged (Yntema & Trask, 1963), 
and this tagging is at least partially inde- 
pendent of the forgetting which must, by 
definition, occuf as a function of time. 
Further, it becomes possible to view PI 
as being entirely due to a loss of differ- 
entiation, a loss along the temporal dimen- 
sion being of considerable importance. It 
remains to be seen just how the temporal 
separation of the learning of the inter- 
fering tasks interacts with the length of 
the retention interval, in view of the fact 
that PI increases as the length of the 
retention interval increases (Postman, 
1962). 
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ERRORS IN RECOGNITION LEARNING AND RETENTION 
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Ss were presented a list of 40 words (CR words) with instructions 
to remember. the words but not the order. A recognition test for each 
consisted of a high associate (A word), one formally similar (F 
word), and one neutral (N word). All Ss pronounced the words on 
the learning trial and one group (F) was, in addition, required to 
pronounce an adjacent word which was either an A word, F word, 
N word, or number. Group F and 1 other group had an immediate 
recognition test. 2 other groups had recognition tests 24 hr. following 
learning, 1 of these having had an immediate free-recall test, For. 
Group F the nature of the errors was determined in large part by the 
nature of the adjacent item, but correct recognition did not differ as a 
function of the type of adjacent item. However, the confidence 
ratings of correctness indicated compensatory factors in the cases of 
A and F words. Without forced adjacent items on the learning trial, 
F words had no greater influence than N words, the A words 
providing the major source of error. Free recall enhanced 24-hr. 
recognition a small amount for the words recalled. 24% forgetting 
was observed over 24 hr. when measured by correct responses, 18% 


by confidence ratings of correct responses. 


In recognition learning S is shown a 
group of N items and then on a recog- 
nition test is asked to select these items 
from among a group consisting of at 
least N +1 items. The recognition 
test can be arranged in a number of 
ways. One simple arrangement would 
be to pair each item presented for learn- 
ing with a new item and ask S to 
choose the member of each pair which 
had been presented for learning. Such 
a test becomes an exact parallel to a 
test trial in verbal-discrimination learn- 
ing and the results may be interpreted 
in the same manner. One such inter- 
pretation assumes that 5 may acquire a 
discrimination between two or more 
events as a result of a frequency differ- 
ential between them (Ekstrand, Wal- 
lace, & Underwood, 1966). Recog- 
nition learning may also be interpreted 

1This work was supported by Contract 
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as being based on a simple frequency 
discrimination, a discrimination be- 
tween the situational or experimentally- 
produced frequency of the items pre- 
sented and of items not presented. The 
present studies were planned with this 
interpretative scheme in mind but were 
thought of as exploratory in nature. 
Consider a situation in which S is 
presented a list of words for a single 
trial under instructions that he should 
remember as many words as possible 
with the order of the words being un- 
important. It may be assumed that at 
least for some Ss an implicit associa- 
tional response (IAR) will occur for 
some words. For example, if the word 
presented is “sky,” the IAR “blue” 
may occur. If it does occur, perform- 
ance on a recognition test in which S 
must choose “blue” and “sky” should 
be poorer than if “sky” and a neutral 
word were paired. In a certain sense, 
the IAR on the learning trial acts as 
an interpolated word. Since the inter- 
polated word also has situational fre- 
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quency, discrimination on a recogni- 
tion test which includes the interpolated 
word should be poorer than if it were 
not included. Or, if the interpolated 
word and a neutral word were both 
included on the recognition test, more 
errors should involve the selection of 
the interpolated word than the neutral 
word. The present experiment ex- 
amines recognition learning with the 
latter type of recognition test. 

Although certain gross predictions 
can be made by assuming the occur- 
rence of IARs during learning, the sit- 
uation can be made more analytical by 
forcing S to produce’ them explicitly. 
One of the conditions of the present 
experiment used this technique. The 
S was presented a list of words for a 
single learning trial, and told that he 
would be given a memory test for the 
words. Adjacent to each word was 
another item, The S was required to 
pronounce both the critical word (the 
one he was to remember) and the ad- 
jacent item (which he was told he did 
not have to remember). The adja- 
cent items were either high associates 
of the critical words, formally similar 
to the critical words, neutral, or of a 
different class (numbers). The recog- 
nition test, consisting of the critical 
word, its high associate, a formally 
similar word, and a neutral word, was 
expected to provide information on the 
interfering properties of each type of 
adjacent item which S was forced to 
pronounce, 

‘Finally, measurements were made on 
the retention of recognition learning 
over 24 hr. In one case measurements 
were made after S had been given a 
free-recall test immediately after the 
presentation of the list, and in another, 
without such a test. Since the free-re- 
call test should increase the situational 
frequency of the items recalled, recogni- 
tion 24 hr. later should be enhanced for 
those items. 


METHOD 


Material.—The list to be remembered con- 
sisted of 40 words, called critical words (CR 
words). All were taken from the list used 
as stimuli to compile the Connecticut word- 
association tables (Bousfield, Cohen, Whit- 
marsh, & Kincaid, 1961). Only those words 
to which the primary response was pro- 
duced by over 50% of the Ss were used. 
The CR words are listed in the first column 
of Table 1, the high associates (A words), 
in the second column. A neutral (N word), 
not occurring to the CR word as a response, 
was chosen as a zero associate. Finally, a 
word was chosen having high formal simi- 
larity (F word) to the CR word. As may 
be seen in Table 1, this formal similarity 
usually, but not always, represented high 
acoustic. similarity (clang word). The N 
word and the F word had roughly the same 
frequency count (Thorndike & Lorge, 1944) 
as the A word and had about the same num- 
ber of letters. In all conditions the recogni- 
tion test consisted of S choosing among the 
four words in each of the 40 sets. 

The list of 40 words was printed in capital 
letters. In the condition in which S was 
forced to pronounce an adjacent item, the 
adjacent item was printed in lower case. 
For 10 CR words the adjacent words were A 
words, for 10 they were N words, for 10 F 
words, and for the final 10, numbers (1 
through 10). Four sublists were constructed 
so that across all Ss each CR word had ap- 
peared with all four types of adjacent items 
an equal number of times. Within each of 
these four lists, each type of adjacent item 
occurred in each successive block of four CR 
words, The order of the CR words was al- 
ways the same. A practice list of five items 
was used to acquaint S with the nature of the 
task. 

In the recognition test the order of the 
CR words was different from the order on 
the learning trial but was the same for all 
conditions. However, each successive block 
of four CR words included each of the four 
types of adjacent responses involved in the 
forced-response condition. The four words 
in each of the 40 sets were randomly ordered 
on the recognition test: The S was forced to 
make a decision on each set before moving to 
the next, all sets yet to be marked being 
covered with a blank sheet. The S pro- 
ceeded at his own pace. A choice was made 
for each set by encircling the word. 

In addition to choosing a word from each 
set, S made a rating as to his confidence in 
the correctness of each decision. This was 
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TABLE 1 

WORDS USED ON THE RECOGNITION TEST 
CR Word | A Word N Word FWord | CR Word | A Word N Word F Word 
ale beer knot hail give take add live 
cat dog put sat hammer | nail basin grammar 
bad good since bag hard soft deep card 
bed sleep state said high low fat lie 
bitter sweet path better hog pig gas log 
blotter ink roam totter income money happy become 
bottom top choice autumn lock key maid block 
boulder | rock dinner shoulder | long short noon lung 
boy girl lord toy lost found part cost 
bread butter cotton head major minor rascal manor 
buy sell fact sigh maple tree scare ample 
chill cold man hill organ music utter groan 
cork bottle nortl work poor rich rule your 
cow milk rate how right wrong large sight 
cradle baby temper | ladle sky blue job cry 
lamp light list lamb smooth rough hung tooth 
day night gate bay spool thread gift pool 
down up view town starved hungry eager carved 
false true waste faults tango dance helmet | mango 
father mother equal bother web spider neat wet 


Note.—CR, word to be remembered; A, high associate; N, neutral; F, formally similar. 


done at the time the word was encircled and 
before moving to the next set. The rating 
was made by encircling a number from 1 to 
5, printed to the right of each set of four 
words. This scale was always in front of S 
and the following descriptions were associ- 
ated with each number: 1, not sure at all; 
2, not very sure; 3, moderately sure; 4, 
reasonably sure; 5, very sure. 

Conditions —There were four basic groups 
involved : 

Group I: This group of 48 Ss was given 
the 40 critical words for one presentation 
followed by an immediate recognition test. 
As each word appeared on the learning trial, 
S pronounced it. 

Group F: Members of this group were 
forced to pronounce not only the CR word, 
but also an adjacent word, appearing to 
the right of the CR word. This was the 
only group for which the adjacent words ap- 
peared. The recognition test was given 
immediately after the learning trial. There 
were four subgroups of 12 Ss each. The 
lists given each subgroup differed only in 
the adjacent word as noted above. Across 
the 48 Ss, each critical word had been 
Paired with each type of adjacent item 
(A word, N word, F word, and number) 12 
times. 

Group D: This group of 40 Ss was treated 
the same as Group I except that the recogni- 


tion test was delayed 24 hr. following the 
learning trial. 

Group FL-D: This group of 40 Ss was 
treated the same as Group D except that a 
free-recall test was given immediately after 
the learning trial. The recognition test 
followed 24 hr. later. For the free-recall 
test, S was asked to write down as many 
words as possible even though S was not 
sure of the correctness (instruction empha- 
sized the production of words). The S 
did not make confidence ratings for the re- 
called words, 

In summary, all Ss in the four groups 
were given a single trial under instructions 
to remember the CR words (always printed 
in capitals). Each word was presented for 
3 sec. on a memory drum. For three of the 
groups (I, D, and FL-D) these were the 
only words presented with S being required 
to pronounce each. Only for Group F 
did adjacent words appear with S being re- 
quired to pronounce both the CR word and 
the adjacent word, although fully instructed 
that he would be tested only for the CR 
word. Learning instructions emphasized 
memory for words, not the order of the 
words. No S was instructed concerning the 
nature of the memory test prior to it being 
administered. 

Groups I and F were run simultaneously, 
with Ss being randomized between the two 
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and with further randomization for the 
sublists of Group F. Groups D and FL-D 
were also run simultaneously but at a later 
point in the school year than were Groups 
I and F. However, there is no reason to 
believe that appreciable shifts in the ability 
level of Ss occur during the school year. 
This is to say, therefore, that although pre- 
cise comparisons can only be made between 
Groups I and F, and between Groups D and 
FL-D, gross comparisons or tests of in- 
teractions can be made among all groups. 


RESULTS 


Some preliminary remarks need to 
be made about the confidence ratings. 
In general, the ratings reflect rather 
faithfully the data based on correct and 
incorrect recognitions, i.e., items cor- 
rectly recognized were given higher 
ratings than those not correctly recog- 
nized. For example, Group I gave a 
mean rating of 4.14 for items correctly 
recognized, 2.25 for those which were 
incorrect. As will be seen, forgetting 
occurred over 24 hr., and this is re- 
flected in the mean confidence ratings 
for Group D of 3.41 and 2.16 for cor- 
rect and incorrect recognitions, re- 
spectively. 

In general, Ss who recognized the 
greatest number of items correctly gave 
higher ratings to their correct re- 
sponses than did Ss who recognized 
relatively few correctly. Product- 
moment correlations for number cor- 
rect and mean ratings for correct items 
for each S show positive values of .49, 
63, 38, and .18 for Groups I, D, FL-D, 
and F, respectively. Correlations be- 
tween ratings for correct and incorrect 
items were also positive, being .29, 
.29, .50, and .25 for the four groups. 
Reliabilities of the ratings (regardless 
of correctness of recognition) tended 
to be higher than did the reliabilities of 
the correct-response measure. The 
first 20 sets of words on the recognition 
test were compared with the second 20. 
For Groups I and D the correlations of 


mean ratings were .70 and .77, re- 
spectively, and for correct responses, 
.54 and .64. 

Immediate recognition—On the rec- 
ognition test members of Group I gave 
a mean of 32.71 correct responses. 
Group F, forced to say adjacent re- 
sponses on the learning trial, gave a 
mean of 24.97. The difference between 
the two groups is of small interest since 
total study time for the CR words dif- 
fered (as a consequence of Ss in Group 
F being required to say the adjacent 
item). However, it is of theoretical 
interest to examine the nature of the 
errors made. These are plotted in Fig. 
1 for the two groups. It can. be seen 
that for Group I, the A words are re- 
sponsible for the greatest number of 
errors, with the number associated with 
N words and F words being about 
equivalent. For Group F the A words 
are again responsible for the greatest 
number of errors but the F words also 
contribute many, and appreciably more 
than the N words. Both error types 
and the interaction of Groups X Types 


GROUP F 


D 


GROUP | 


a 


MEAN NUMBER OF ERRORS 
o + 


N 


A WORD N WORD 
TYPE OF ERROR 


F WORD 


Fic. 1, Types of errors in immediate 
recognition when adjacent responses are 
forced (Group F) and not forced (Group I), 
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are highly significant statistically, Fs 
(2, 188) = 27.74 and 7.51, p < .01, re- 
spectively. Some notion of the pro- 
cesses responsible for the interaction 
may be obtained by examining in more 
detail the effects of forcing particular 
types of adjacent responses during the 
learning trial. 

Each list given members of Group F 
contained 10 CR words for each type 
of adjacent response. Four scores may 
be obtained for each S representing the 
mean number of correct recognition 
(out of 10 possible) for each adjacent 
response type. These means varied be- 
tween 6.04 and 6.60, F=1.50. On 
the surface, at least, this finding indi- 
cates that the relationship between the 
CR word and the adjacent word does 
not influence the recognition of the CR 
word; performance is no better when 
numbers are the adjacent responses 
than when A words are the adjacent 
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responses. It will be argued shortly, 
however, that this equality results from 
compensatory factors. It is necessary 
first to examine the nature of the errors 
made by Group F as a function of the 
adjacent item with which S was forced 
to respond. The data are shown in 
Fig. 2. 

The first fact apparent from Fig. 2 
is that the maximum frequency of a 
given type of error is determined by 
the type of the adjacent item. When 
the A word is the adjacent word, nearly 
80% of the errors are A words; when 
the adjacent word is the F word, about 
70% of the errors are F words, Even 
N words follow the rule. Clearly, the 
forced interpolated item is a major 
source of interference in recognition. 
With an irrelevant adjacent item (num- 
bers), the A words are the largest 
source of error with the F words only 
slightly less so. 


F WORD 
ITEM PRESENTED 


Fic. 2. Types of errors as a function of type of adjacent forced response. 
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Both Fig. 1 and 2 have indicated that 
overall the A words are the major 
source of interference; if S is in error 
on a recognition attempt, the A word 
has the greatest probability of being 
chosen. The question arises again, 
therefore, as to why correct recognition 
did not differ as a function of type of 
adjacent item. As noted above, the 
means were all statistically equivalent. 
The rating data provide a clue as to the 
possible reason why type of adjacent 
item did not influence correct recogni- 
tion. For Group F the mean ratings of 
correct recognitions were 3.87, 3.68, 
3.39, and 3.10, F (3, 132) = 11.70, p 
<.01, when A words, F words, N 
words, and numbers were adjacent 
items, respectively. Thus, although the 
error data indicate that A and F words 
are, as adjacent items, more potent 
sources of interference than N words 
and numbers, the confidence ratings are 
higher for the former, It seems highly 
probable that with A words and F 
words as adjacent words, the recogni- 
tion alternatives are effectively reduced 
from four to two. Assuming some 
memory for the words presented, with 
an A word as the adjacent item S 
knows that the correct word is either 
the CR words or its associate (A 
word); thus, the two other words in 
the set are largely eliminated as sources 
of confusion. When the F word is the 
adjacent word, S may also remember, 
because of the acoustic similarity of 
the CR word and the adjacent word, 
that the correct response is one of two, 
When, however, an N word or a num- 
ber is the adjacent item, there is no 
obvious manner by which the four 
choices can be reduced to two. The 
confidence ratings must reflect to some 
extent these differences in the number 
of potentially possible correct responses 
on the recognition test. Therefore, it 
seems likely that the greater interfer- 
ence potential of the F and A words is 


compensated for by a reduction in 
number of reasonable alternatives on 
the recognition test. It will be pointed 
out later that this same situation ob- 
tains for A words even in the condi- 
tions where no adjacent words are 
presented. 

Delayed recognition—Two groups 
(D and FL-D) received the recogni- 
tion test 24 hr. after presentation of the 
CR words. The results for Group 
FL-D will be considered first. This 
group recalled 14.93 of the 40 items 
immediately after presentation and 24 
hr. later correctly recognized 26.65 
items. It was anticipated that the free- 
recall trial would influence recognition 
since for those items recalled an addi- 
tional frequency unit would occur. Of 
the 14.93 items given on free recall, 
90.3% were correct on the recognition 
test. Of the 25.07 items not given in 
free recall, 52.5% were correct on the 
recognition test. However, this differ- 
ence reflects in small part the fact that 
the items which are given most fre- 
quently in free recall are also most 
likely to be recognized correctly 24 hr. 
later. This relationship is shown by 
the positive product-moment correla- 
tion of .31 between the number of times 
items were correct in free recall and the 
number of times they were correct in 
24-hr. recognition for Group D (the 
group not having free recall). How- 
ever, the corresponding correlation be- 
tween number of times an item was 
correct in free recall and number of 
times correctly recognized for Group 
FL-D is .50. Using the free-recall and 
recognition scores for each S in Group 
FL-D, the correlation becomes .69. In 
the free-recall protocols 16 intrusions 
occurred which were A words and 
therefore appeared on the recognition 
test. Eight (50%) were chosen as the 
correct response. Such data indicate 
that free recall may, to some extent, 
increase the likelihood of errors as 
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well as increase the likelihood of cor- 
rect responses for those recalled. 

The above data indicate that item 
difficulty influences both free recall and 
24-hr. recognition in somewhat the 
same manner. Therefore, in determin- 
ing the overall influence of free recall 
on 24-hr. recognition, the total correct 
recognitions for each item for Group 
D was subtracted from the total cor- 
rect recognitions for the corresponding 
items for Group FL-D. The resulting 
distribution was then evaluated statis- 
tically and found to be significant (t = 
2.65). Thus, although the overall dif- 
ference in correct recognitions between 
the two groups is not large (24.93 vs. 
26.65) it appears to represent a reliable 
effect of having had an immediate test 
of free recall, Furthermore, the rat- 
ings reflect differences, being 3.73 for 
the correct responses for Group FL-D, 
and 3.41 for Group D, F (1, 78) = 
5.49, p < .05. The difference was pri- 
marily produced by items which were 
given correctly on both free recall and 
recognition, the mean rating for these 
items for Group FL-D being 4.59. 
Nevertheless, the difference in correct 
recognition as a consequence of having 
free recall is not large and the data as 
a whole suggest that most of the items 
given correctly in free recall would 
have been correct in 24-hr. recognition 
even without the additional frequency 
input. 

Errors in delayed recognition— 
Table 2 shows the numbers and per- 
centages of each type of error occurring 
at 24 hr. Group I (immediate recog- 
nition) is included for comparison pur- 
poses, The consistency for the various 
groups is notable, all showing that the 
greatest numbers of errors involve the 
selection of the A word, the next most 
frequent the N word, with the F word 
only slightly less than the N word. 
It will be remembered that when S was 
forced to pronounce an adjacent word, 


TABLE 2 
TYPES OF ERRORS IN RECOGNITION 


Error Type 


A Word N Word 


Group I 153 | 42.5 | 107 | 29.7 | 100 | 27.8 

Group D z i i 

Group FL-D | 224 | 41:9 | 173 | 32.4 | 137 | 25.7 
Totals 627 | 41.9 | 470 | 31.4 | 400 | 26.7 


F words appeared to interfere appre- 
ciably more than did N words, In 
Table 2 the F words have slightly less 
likelihood of being chosen as an error 
than do the N words. These differ- 
ences suggest that acoustic coding of 
words is not likely to occur unless the 
adjacent words essentially call such a 
possibility to the attention of S. That 
this coding is not entirely specific to 
the instance of an adjacent F word is 
shown by the fact that in Fig. 2 F 
words were chosen by Group F more 
frequently as errors than N words, 
even when numbers were the adjacent 
items. However, as seen in Table 2, 
there is no evidence that acoustic simi- 
larity influences recognition errors any 
more than does a neutral word; in fact 
the evidence is in the other direction, 
and the choice of an N word may be 
viewed as representing near complete 
forgetting. 

It was noted earlier that A words, 
though chosen most frequently as 
errors, produced higher confidence rat- 
ings than other error types when ad- 
jacent stimuli were involved. It may 
be asked whether this same phenome- 
non occurs without adjacent stimuli, 
the question being based on the as- 
sumption that S “naturally” produces 
JARs when single words are presented 
for learning. If S does, and if the as- 
sociate produced is the same as the one 
appearing on the recognition test, con- 
fidence should be higher for the A 
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words chosen as errors then when 
either N or F words are chosen. For 
Group FL-D, the mean ratings of 
errors were 2.41, 2.21, and 1.99, for A 
words, N words, and F words, respec- 
tively. For Group D, the correspond- 
ing values were 2.27, 1.95, and 1.96. 
The only significant source of variance 
was error type, F (2, 156) = 7.28, p < 
.01. Data show the same relationships 
occurred for Group I, the means being 
2.46, 2.01, and 1.99. Thus, it again 
appears that in a recognition test the 
occurrence of IARs in the form of 
high associates restricts the number 
of potential alternatives and this re- 
striction is reflected in the confidence 
ratings. 

Immediate vs. delayed recall—As- 
suming the equivalence of the abilities 
of groups ran at different points in the 
school year, determinations may be 
made of the forgetting of recognition 
learning by comparing Groups I and 
D. The mean recognition scores were 
32.63 and 24.93, respectively. As- 
suming that the mean for Group I rep- 
resents a learning measure, the forget- 
ting shown over 24 hr. is 7.70 items, or 
23.6%. The mean ratings for the cor- 
rect items also drop from 4.14 to 3.41, 
a change of 18.4%. Ratings for in- 
correct responses show only a slight 
fall (2.25 to 2.16). 


Discussion 


The evidence of the present study indi- 
cates that forcing S to produce a word 
(that he is not asked to learn) contiguous 
to a word that he is asked to learn, 
strongly influences the nature of the error 
on a recognition task. If the correct word 
is replaced on the recognition test, it is 
most likely to be replaced by the forced 
word, regardless of the relationship be- 
tween the correct word and the forced 
word. However, the data have also 
shown that the most powerful interfering 
response is the strong associate of the 


word to be remembered. This source of 
interference operates whether an adjacent 
response is forced or not, and is, therefore, 
consonant with the assumption that S may 
normally produce an IAR which functions 
in much the same manner as when the 
occurrence of the associate is forced. 
Even so, the interference effect of the 
IAR has probably been underestimated 
because of the reduction in the number of 
effective distractors which occurred on 
the recognition task. This in turn re- 
sults in an increase in the number of cor- 
rect responses expected by chance. The 
confidence ratings, always higher when 
an associate was chosen as an error than 
when other alternative wrong responses 
were chosen, suggested that S had es- 
sentially limited his choice to one of two 
responses. 

Formal or acoustical similarity pro- 
duced results corresponding to those pro- 
duced by a high associate only when an 
adjacent response was forced. When the 
interfering response was not forced, the 
formally similar recognition alternative 
produced no more errors than did the 
neutral word. This suggests that in the 
usual learning situation, words are not 
singularly remembered by their formal or 
acoustical properties. Such a finding con- 
firms the work of Eagle and Ortof (1967) 
who showed that appreciable selection of 
formally similar incorrect words (clang 
words) occurred only following severe 
distracting conditions present on the 
learning trial. Thus, it appears that 
under usual conditions, memory for the 
acoustical property of the word is domi- 
nated by other properties, one of which, 
the associative property, has been identi- 
fied in the present study. 

It was suggested in the introduction 
that discrimination on a recognition task 
may be based in part, at least, on a situ- 
ational frequency difference between items 
presented and those not presented but oc- 
curring on the recognition test. The 
greater the frequency differential the bet- 
ter the discrimination. The question may 
be raised as to how forgetting of recog- 
nition learning may occur as it did over 
24 hr. It may be assumed that situational 
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frequency loses its specificity with the 
passage of time and gradually assimilates 
into background frequency (the frequency 
with which the word has occurred in all 
situations). Just how this may occur 
cannot be specified at this time, but inso- 
far as the specific place or time for the 
occurrence of any event may be forgotten, 
so too the association between a particular 
word and its frequency in a particular 
situation may be forgotten. 
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SHORT-TERM MEMORY FOR INDIVIDUAL VERBAL ITEMS 
AS A FUNCTION OF METHOD OF RECALL? 


HENRY LOESS ann RICHARD HARRIS 
College of Wooster 


2 experiments were conducted to evaluate different methods of recall 
in short-term memory, Each experiment contained 96 Ss divided 
into 4 subgroups. The stimuli were word triads, each containing words 
from 3 common taxonomic categories. The S saw a triad for 1.5 SEC., 
followed by a series of random digits designed to minimize rehearsal. 
Retention was measured after 9 sec. In Exp. I, experimental Ss were 
provided with the appropriate category names at the beginning of a 
recall interval. Performance was not superior to control Ss who 
were told simply to recall. In Exp. II, experimental Ss received 
category cues after an initial period of free recall. Under these con- 
ditions performance was superior to control Ss who had the same total 
time to recall but no special cues. The results are discussed in terms 
of (a) opposing facilitative and interfering effects of the recall cues, 
and (b) different methods of retrieval from short-term and long-term 


memory. 


Several experiments in short-term 
memory (STM) have demonstrated 
that, following a brief initial presenta- 
tion, the retention of individual verbal 
items decreases rapidly over time. This 
has been shown to be true for many 
types of stimuli, under a variety of pre- 
sentation conditions, and with several 
methods of measuring recall (eg., 
Murdock, 1963; Peterson & Peterson, 
1959). A majority of studies have at- 
tempted either to identify the relevant 
variables or to answer theoretical ques- 
tions; the former have generally been 
concerned with similarities among the 
variables determining STM as con- 
trasted with long-term memory 
(LTM), (eg., Hellyer, 1962; Loess, 
1967), while the latter have attempted 
to assess the nature of the storage pro- 
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cess, specifically, whether STM in- 
volves a decay of traces, interference 
from other materials, or both (eg., 
Conrad, 1967; Waugh & Norman, 
1965). Few studies have been con- 
cerned with a second theoretical issue, 
namely, retrieval or trace utilization. 

For LTM it is frequently held that 
permanent structural traces have been 
established. Therefore the material in 
LTM is considered to be potentially 
available. Whether it is retrievable 
depends on two considerations: (a) 
the nature of interference from pre- 
viously or subsequently learned asso- 
ciations, and (b) the appropriateness 
or completeness of the retrieval cue 
(Melton, 1963). For STM the evi- 
dence is less clear concerning the per- 
manence of the structural trace. Thus, 
failure to recall a briefly stored item 
might reflect either of the above factors 
or it might mean that the item cannot 
be recalled because the trace has de- 
cayed and disappeared. Therefore, a 
systematic study of different recall 
methods in STM seems desirable. If 
items which are presumed to be in 
STM are recallable with one method 
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but not with another, it means that 
considerable attention should be paid 
to the method of recall when theoretical 
statements are based on recall data. 

One experiment which investigated 
differential recall was conducted by 
Tulving and Pearlstone (1966). They 
found that Ss who were told at the 
time of recall the names of the relevant 
categories and the numbers of words 
per category (cued recall) showed 
higher recall after a single presentation 
of a list than Ss who were not given 
this information (noncued recall). In 
addition, words from a greater number 
of categories were recalled under the 
cued condition but the mean number 
of words recalled per category re- 
mained invariant. The authors con- 
cluded that there are two independent 
retrieval processes, one concerned with 
the accessibility of higher-order mem- 
ory units, the other with accessibility of 
items within higher-order units. How- 
ever, since all of the word lists were of 
supraspan length, their experiment did 
not deal exclusively with STM. It is 
likely that during the presentation of 
a list S engaged in an active attempt to 
transfer material from short-term to 
long-term store as well as to rehearse 
the material in short-term store (At- 
kinson & Shiffrin, 1967). 

_The present experiments were de- 
signed to determine if different in- 
structions at the time of recall would 
lead to differential retention of subspan 
units under conditions which minimize 
the need for coding operations of the 
types discussed by Tulving and Pearl- 
stone (1966) and Atkinson and Shif- 
frin (1967). 


EXPERIMENT I 
Method 
Subjects—The Ss were 96 Wooster under- 


graduates who served in the experiment to 
ulfill a course requirement. 


Materials —The stimuli consisted of 48 
word triads used in a previous study (Loess, 
1967). The triads had been constructed 
from lists of words of high and low fre- 
quencies of occurrence for eight taxonomic 
categories (Bousfield, Cohen, & Whitmarsh, 
1958). The categories are: Animals, Men’s 
Names, Birds, Countries, Vegetables, Pro- 
fessions, Types of Cloth, and Musical In- 
struments. The word lists represent com- 
mon (high frequency) and uncommon (low 
frequency) examples as given by Ss in the 
Bousfield et al. norms. Each triad con- 
tained words from three categories. Twenty- 
four triads contained high-frequency words 
and 24 triads contained low-frequency words. 
The triads were prepared on 2 X 2 in. slides. 
For each stimulus slide there was a com- 
panion “cue” slide which contained the three 
category names. For example, one stim- 
ulus slide read, “Dog-England-Carrot.” The 
corresponding cue slide read, “Animal-Coun- 
try-Vegetable.” Additional materials con- 
sisted of slides containing random six-digit 
numbers and instructional slides saying, 
“Ready” and “Recall.” 

Design—The experimental design was a 
2x2 factorial. One factor was frequency 
of occurrence (high or low) ; the other was 
type of recall (cued or noncued). Twenty- 
four Ss were assigned randomly to each 
of the four combinations: high-cued, low- 
cued, high-noncued, and low-noncued, For 
each S, 12 stimulus slides were chosen ran- 
domly from the pool of 24 high-frequency 
slides or from the pool of 24 low-frequency 
slides. The slides were then assigned ran- 
domly to the 12 trials. The slides contain- 
ing the six-digit numbers were initially 
placed randomly in the carousel but were 
not changed between Ss. 

Procedure—The Ss were run individually 
and the stimuli were presented visually with 
a Kodak Carousel projector set for a 1.5-sec. 
exposure time. The projector took an ad- 
ditional .5 sec. to change slides. Time in- 
tervals were measured with a Hunter Kloc- 
kounter and the recycling was controlled 
by two Hunter 115C timers. All Ss received 
identical general instructions regarding the 
nature of the experiment. In addition, Ss 
in the noncued groups were told: 


After several numbers have appeared, 
the word “recall” will appear. This is 
your cue to try to tell me what the words 
Were at the beginning of the series. If 
possible, say them in the order in which 
they appeared. You will have several sec- 


66 HENRY LOESS AND RICHARD HARRIS 


onds to remember the words. If you re- 
meniber them before I say stop, they 
will be counted correct. 


The Ss in the cued groups were told: 


After several numbers have appeared, 
three new words will appear. These will 
be cues to try to tell me what the words 
were at the beginning of the series. You 
will have several seconds to remember the 
words. If you remember them at any time 
while the cue words are on the screen, be- 
fore I say stop, they will be counted 
correct. 


After E read the instructions a slide say- 
ing “Ready” appeared on the screen. There 
were no practice slides. Following the 
ready slide the first triad appeared and S 
spoke the three words aloud. Then, in 
succession, there appeared five slides con- 
taining six-digit numbers. The S$ read the 
digits out loud rapidly with instructions to 
repeat them if he had time. After the 
fifth number a recall slide appeared. For 
Ss in the noncued groups the slide said 
simply, “Recall.” For Ss in the cued groups 
the slide contained the three appropriate 
category names. The S was given 9 sec. to 
recall after which E said “Stop.” Two sec- 
onds later a ready signal appeared anticipat- 
ing the next trial. In this manner each S 
received 12 trials in a single condition of the 
experiment. 


Results 


Table 1 presents the proportions of 
triads and words correctly recalled over 
the 12 trials. A triad was counted cor- 
rect if all three words were recalled in 
the correct intraitem serial position. A 
word was counted correct if it was 
recalled at any time during the recall 
interval. For the recall of triads, the 
table appears to show an interaction 
between associative frequency and 
method of recall. However, results of 
a 2 X 2 analysis of variance indicated 
that neither the main effect of recall 
method nor the Frequency X Method 
interaction was significant, F (1, 92) 
= 2.61, p > .05, and, F (1, 92) = 1.11, 
p > .05, respectively. On the other 


hand, the main effect of frequency was $ 


: the experiment. 
cueing reduced the total number of 


TABLE 1 


PROPORTIONS oF TRIADS AND Worns Re- 
CALLED CORRECTLY IN Exp, | As A 
FUNCTION OF FREQUENCY OF 
OCCURRENCE AND METHOD 
oF RECALL 


Proportion of 
Correct Triads 


Proportion of 
Correct Words 


Cond. 
Cued | Noncued | Cued | Noncued 
High Frequency | .44 33 71 69 
Low Frequency | .23 21 63 61 
Total +34 27 67 65 


significant, F (1, 92) = 14.38, p < .01. 
Examination of individual response 
protocols shows that the main advan- * 
tage, if any, of having the category ` 
names at hand at the time of recall was 
that it allowed S to rearrange the words 
in a triad to correct transposition er- 
rors. The category names did not ap- 
pear to help S$ fill in words of an 
incomplete triad. This is consistent 
with data for correct words, also pre- 
sented in Table 1. An analysis of 
variance showed that neither the main 
effect of recall method nor the Fre- 
quency X Method interaction was sig- 
nificant (both Fs < 1); again, the main | 
effect of frequency was significant, F ` 
(1, 92) = 7.35, p < 01. 

Intrusion data are presented in Table 3 
2. An intrusion is any incorrect word | 
verbalized at the time of recall. Intra- 
experimental intrusions are words 
which had appeared in triads of pre- 
vious trials; extraexperimental intru- 
sions are words which had not appeared 
in previous trials, Correct-category in- 
trusions are words which belong to one 
of the three categories of a current 
triad; incorrect-category intrusions are 
words which do not belong to one of 
the three categories. The proportions 
in the table were obtained by dividing 
the frequency of each type of intrusion 
by the total number of intrusions in 
Table 2 shows that 
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TABLE 2 


PROPORTIONS AND TOTAL NUMBER OF 
VARIOUS TYPES OF INTRUSION IN 
Exe. I anD II 


Exp. I Exp. II 
Intrusions 
Cued | Noncued | Cued | Noncued 
Intraexperimental 
Correct 
Category 24 21 24 AT 
Incorrect 
Category 04 19 07 13 
Extraexperimental 
Correct 
Category 10 04 16 04 
Incorrect 
sm Category 05 A3 08 a11 
Total 43 57 51 49 
© Total Number 95 126 118 112 


intrusions by minimizing incorrect- 
category intrusions. Correct-category 
intrusions were slightly increased by 
cueing but not reliably so. 

The results suggest that when S is 
provided with the category names of 
words in a triad at thé beginning of a 
recall period, the main effects are to re- 
duce the number of overt incorrect- 
category intrusions and to minimize 
transposition errors. Presentation of 
the category names did not give S ac- 
cess to more words than did the simple 
cue to “recall.” However, it is pos- 
sible that the experiment did not use 
appropriate retrieval cues. Some Ss 
reported considerable interference when 
the category names were flashed on the 
screen. Their comments are consistent 
with the idea of a limited short-term 
storage buffer from which some items 
are bumped when new information is 
presented (Atkinson & Shiffrin, 1967; 
Waugh & Norman, 1965). It is true 
that performance of the cued groups 
was not lower than performance of the 
noncued groups, as might be expected 
if the category names had significantly 
interfered with retention. On the other 
hand, it is possible that facilitative ef- 
fects of cueing were neutralized by 
interference caused by presentation of 


the cues. The second experiment was 
designed to clarify this ambiguity by 
minimizing the interference created by 
the retrieval cues. It was hoped that 
in this manner any positive effects of 
cueing could be more readily identified. 


EXPERIMENT IT 


Method 


Subjects and design.—The Ss were 96 paid 
yolunteers from the University of New 
Hampshire during the summer of 1967. All 
had served in a previous STM study that 
was not related in method or materials to 
the present experiment. As before, there 
were four subgroups of 24 Ss each: high- 
cued, high-noncued, low-cued, and low-non- 
cued, The design, materials, and procedure 
were identical to Exp. I. The major 
change was the use of a different method of 
cueing. This was accomplished by chang- 
ing the specific instructions to S's in the cued 
groups. After the general instructions they 
were told: 


After several numbers have appeared, the 
word “recall” will appear. This is your 
cue to try to tell me what the words were 
at the beginning of the series. If possible, 
say them in the order in which they ap- 
peared. If you have trouble recalling 
them after a few seconds, I may give 
you some verbal cues to help you remem- 
ber. However, please make every effort 
to recall the words as soon as the word 
“recall” appears on the screen. 


On each trial E waited 6 sec. for S to re- 
spond. If S did not respond with words 
from each of the categories in the triad by 
the end of the 6 sec, E told S the category 
names of missing words. This procedure 
was intended to help S recall missing words 
rather than to correct transposition errors 
and to minimize the strain on the short- 
term buffer. It should be clear that E 
gave cues only when the categories relevant 
to a particular triad were unrepresented in 
Ss response. If S reported an incorrect 
word from a relevant category, E did not 
provide a cue. ‘After the cue was given, E 
allowed another 6 sec. for S to respond. 
This necessitated a total recall period of 12 
sec., 3 sec. longer than in Exp. I. To equa- 
lize recall time, Ss in the noncued groups 
were also given 12 sec. to recall. 


After several Ss were run, it was dis- 
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covered that the Carousel projector being 
used for Exp. II was taking .3 sec. longer to 
change slides than the one used in Exp. 1. 
This plus other factors such as the 3 sec. 
added to the recall periods in Exp. II ap- 
parently raised the general level of respond- 
ing but should not have created a systematic 
bias across subgroups. 


Results 


The results are presented in Table 3. 
For both response measures, perform- 
ance was considerably higher for the 
cued than for the noncued groups. 
Separate 2 X 2 analyses of variance 
showed that in both cases the main 
effects of recall method were significant, 
F (1, 92) = 4.48, p <.05, for triads 
and F (1, 92) = 11.31, p < .01, for 
words. In addition, the main effects 
of associative frequency were signifi- 
cant (p’s<.01), but the Method x 
Frequency interactions were not (both 
Fs <1), The intrusion data are pre- 
sented in Table 2. Cueing decreased 
intrusions from incorrect categories 
and increased intrusions from correct 
categories. However, the proportions 
of total intrusions were about the same 
for the cued and noncued conditions 
(.51 and 49). In comparison, there 
were considerably fewer intrusions in 
the cued than noncued conditions in 
Exp. I (.43 and .57), presumably be- 
cause the cues were presented at the 
beginning of the retention interval, 


TABLE 3 


Proportions oF TRIADS AND Worps RE- 
CALLED CORRECTLY IN Exp. IT As A 
FUNCTION OF FREQUENCY oF 
OCCURRENCE AND METHOD 


oF RECALL 
Proportion of | Proportion of 
Correct Triads | Correct Words 
Cond. 
Cued | Noncued | Cued | Noncued 
High Frequency | .55 45 87 76 
Low Frequency | .41 «28 S77, -69 
Total :48 337 -82 173 


Discussion 


These two experiments were conducted 
to see whether the nature and placement 
of retrieval cues would influence the re- 
call of subspan verbal units. By testing 
subspan units under conditions generally 
assumed to minimize control processes 
such as coding and rehearsal it was hoped 
to determine whether preexisting associa- 
tions would be useful in eliciting correct 
recall. Tulving and Pearlstone (1966) 
have previously reported that the recall of 
superspan lists of words was facilitated 
by retrieval cues when S was encouraged 
to attend to taxonomic associations dur- 
ing presentation of a list. 

The present experiments showed that 
cues did have an effect on the recall of 
subspan units but only when S was given 
a brief opportunity for free recall prior to 
presentation of the cues, Thus, the re- 
sults confirm the value of preexisting 
associations in cueing; however, the ob- 
tained interaction between cues and their 
temporal location suggests that the use- 
fulness of such cues depends in some way 
on the states of stored items at different 
times in a recall interval. It is possible 
that cues presented late in the interval al- 
lowed recall of items that early cueing 
missed, or that early cueing prevented 
recall of items which later cueing did not. 
Which of these or other possibilities ac- 
tually occurred cannot be determined 
from the present data, On the other 
hand, it is apparent that the results are 
consistent with several multiprocess views 
of human information storage that have 
recently been presented (Atkinson & Shif- 
frin, 1967; Waugh & Norman, 1965). 
For example, Atkinson and Shiffrin have 
assumed the existence of both a short- 
term store (STS) which receives selected 
sensory information but loses unrehearsed 
material in a matter of seconds, and a 
long-term store (LTS) which receives 
information from STS and in which stor- 
age is relatively permanent. 

In the context of the experiments pre- 
sented here their model suggests that at 
the beginning of any recall trial one of 
the following conditions is likely to hold: 
(a) the item is relatively strong in STS 
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and direct retrieval is probable if carried 
out quickly, (b) the item has been trans- 
ferred to LTS and direct retrieval is prob- 
able, or (c) representation in both STS 
and LTS is relatively weak and direct re- 
trieval from either is improbable. In the 
latter case some sort of search is necessary 
for successful recall. 

This interpretation suggests that the 
presentation of cues at the beginning of a 
recall period would interfere with the re- 
call of items currently stored in STS and 
would delay the time of attempted re- 
trieval. The overall effect would be to 
increase the probability that the item has 
been lost from STS. On the other hand, 
the model also predicts that delayed cues 
would allow S to report the presence of an 
item in STS (or LTS) and assist a search 
of LTS if direct recall is not successful, 
maximizing retrieval of an item if it is 
potentially available, 

The present results can also be ex- 
plained by a uniprocess view of human 
memory by considering the effects of cue- 
ing upon background conditioning. Peter- 
son (1963) has defined background con- 
ditioning as the conditional relationships 
which are established between the desired 
Tesponse and unspecified events of the 
general experimental situation. When 
taxonomic cues are presented at the be- 
ginning of a recall interval, S is oriented 
away from the associative cues estab- 
lished during the learning trial. When 
presentation of the cues is delayed, S is 
allowed time to respond to the effects of 
background conditioning. Later he can 


direct his attention to the taxonomic cues’ 


when they are presented, if necessary. 
Tn conclusion, whether one prefers a 
uniprocess or a multiprocess view of hu- 
man memory, the results indicate that the 
measurement of short-term retention is a 
function of the method used to cue recall 
and of the placement of special cues within 
the recall interval. Tt seems appropriate 
that these relationships should be taken 


into account when theoretical statements 
about the availability of previously learned 
material are based on performance data. 
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INTERACTION OF MIDCHAIN DETENTION AND REWARD 
MAGNITUDE IN INSTRUMENTAL CONDITIONING? 


H. WAYNE LUDVIGSON 


Texas Christian University 


60 female rats received 120 food-rewarded trials under different 
reward-detention combinations: 20-0, 20-15, 1-0, 1-3, 1-15, where the 
numbers designate number of pellets of reward (.045 gm.) and length 
of detention (sec.), respectively. The apparatus was an L-shaped 
alley with a detention chamber located at the juncture of the alley 
segments, Starting and running speeds approaching the detention 
provided no statistical evidence of an interaction between reward and 
detention although detention reduced running speeds more under the 
small reward than under the large reward condition. This interaction 
was clearly evident and significant in behavior which was incompatible 
with approach to the goal, especially that occurring at the entry of the 


detention chamber. 


This experiment explored a funda- 
mental problem in the study of goal 
interference: Does the extent of the 
disturbance produced by the interfer- 
ence depend on properties of the goal? 
More specifically, this study sought to 
determine whether the inhibition of re- 
sponse strength produced by detaining 
or delaying S toward the middle of an 
instrumental response sequence varies 
as a function of reward magnitude. 
The manner in which the inhibition de- 
velops and the nature of the resulting 
acquisition function were also of inter- 
est, as were certain theoretical impli- 
cations. 


METHOD 


Subjects —Sixty experimentally naive, fe- 
male, albino rats from the Cheek-Jones 
Company, 92-98 days old, began the experi- 
ment. 

Apparatus and response measures—The 
apparatus was an L-shaped double runway, 
consisting of a start box (SB), a straight 
alley (Aı), a midbox (MB), a second alley 


1This investigation was supported by 
Public Health Service Research Grant MH 
11029-01, from the National Institute of 
Mental Health. The diligent work of 
Robert A. Gay and Stephen E. Gay in 
executing this experiment is gratefully 
acknowledged, 


(Az), and a goal box (GB), as described in 
Ludvigson (1967). Unlike the previous 
study the voltage across the 7-w. lamps above 
the alley was 70 v. A fan masked incidental 
noises, Guillotine start and retrace doors 
separated the five sections of the runway. 
Starting speed was calculated on the basis 
of the time lapse between the opening of the 
SB door and S’s crossing an infrared photo- 
beam located 6 in. from the start door. The 
time taken by S in traversing the next 4 ft. 
to a second photobeam provided the basis for 
the Ai running speed measure, while the As 
running speed came from interruption of 
photobeams in Az located 2 ft. and 8 ft. 
from the MB door. All measures were 
expressed in ft/sec. 

Several measures were recorded of be- 
havior which competed with forward prog- 
ress in the alley. A retrace was defined as 
an instance of backward motion in the run- 
way, and a stop was any cessation of for- 
ward motion which did not eventuate in a 
retrace, The locus of occurrence of a stop 
or initiation of a retrace was also observed, 
so that the incidence of competing behavior 
(CB) was obtained for six different sections 
of the runway (CB:-CBs). The first section 
(CB) extended 27 in. into the alley from 
the SB door, CBs covered the next 27 in., 
and CBs covered the next 9 in. which ex- 
tended from the second photobeam to a point 
3 in. into the MB. The sections of the 
second alley (CB.-CBs) were similar except 
their lengths were 48 in., 48 in., and 20 in., 
respectively. The frequency of occurrence 
of CB was converted to a “density” measure 
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to indicate the average incidence of CB per 
foot of the alley per trial. 

Procedure—On Day 1 of the experiment 
Ss were begun on a 24-hr. feeding cycle in 
which they received 9 gm. of dry mash, or, 
when training began, an amount which to- 
gether with the food received during train- 
ing would equal approximately 9 gm. On 
Days 14-17, 10 pellets of the type later to be 
used as reward (.045 gm.) were incorporated 
into the daily ration. Water was available 
at all times except during a trial in the 
runway. 

On Day 14 each S was removed from its 
cage and accustomed to handling for 2 min. 
To habituate Ss to the apparatus, they were 
allowed 5 min. of exploration in each alley 
on each of Days 15-17. No food was pres- 
ent in the GB on these days. 

Training began on Day 18. The Ss were 
run in squads of 10, with two Ss from each 
experimental group represented in each 
squad. The 10 Ss were rotated through 
their three daily trials for 40 days with an 
intertrial interval of about 7 min. A trial 
began with the placement of S into the SB 
through an end door. The SB door was 
opened after S had maintained an orientation 
toward it for 1 sec. The Ss which received 
no detention were allowed unobstructed 
Passage to the GB. Delayed Ss were con- 
fined to the MB by the retrace door be- 
tween A; and the MB for a prescribed inter- 
val before the door leading to As» was 
opened. If S remained in one section of the 
runway for 2 min. it was gently guided into 
the next section and a time of 120 sec. was 
recorded for that section. The S was re- 
moved from the GB. and returned to its 
cage immediately after it had eaten its 
reward in the GB. Two Es working to- 
gether conducted the experiment. 

Each of five groups received one of the 
following magnitude-detention combinations, 
where the first number designates the num- 
ber of pellets and the second the detention 
In seconds : 20-0, 20-15, 1-0, 1-15, 1-3. 


RESULTS 


Alley 1: Response strength ap- 
proaching the detention—Mean start- 
ing and running speeds for A, are pre- 
sented in Fig. 1 and 2. An analysis of 
variance was performed on the data 
for each measure over all 10 blocks 
of trials, excluding Group 1-3, with 

eward magnitude, Detention condi- 
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MEAN A; START SPEEDS 
(FT/SEC) 


BLOCKS OF TRIALS 


Fic. 1. Mean starting speeds in Alley 1 
for the various combinations of reward mag- 
nitude (R) and detention (D) in blocks of 
12 trials. 


tion, and blocks of Trials as the factors. 
Detention significantly reduced both 
starting and running speeds, Fs (1, 
42) = 40.84 and 70.80, p < .001, while 
the larger reward produced signif- 
icantly faster running speeds, F (1, 
42) = 25.00, p < .001, but not starting 
speeds, F (1, 42) = 2.65.2 Although 
in the running measure, detention ap- 
peared to have a somewhat greater in- 
hibitory effect when combined with the 
small reward the interaction of Reward 
and Detention was not significant, F 


a 
a 


a 
ù 


MEAN Aj RUN SPEEDS (FT/SEC) 
N w 
w& a 


BLOCKS OF TRIALS 


Fic. 2. Mean running speeds in Alley 1 
for the various combinations of reward 
magnitude (R) and detention (D) in blocks 
of 12 trials. 


2To achieve proportionality of cell fre- 
quencies missing data for two Ss which died 
in the course of the study were replaced with 
means of the available data from the cor- 
responding treatment combinations. The de- 
grees of freedom were reduced accordingly, 
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(1, 42) = 1.97. In separate analyses 
Groups 1-0 and 1-3 did not differ sig- 
nificantly either in starting or running 
speeds, Fs (1, 22) < 1. 

Since the functions relating the in- 
cidence of stops and retraces to trials 
were similar in form, the two measures 
were summed to provide a single index 
of competing behavior. Mean densities 
of such CB measures are presented in 
Fig. 3 indicating the frequency of CB 
in various segments of the runway and 
stages of acquisition. Because the as- 
sumptions for parametric statistics 
were untenable the Mann-Whitney U 
test and the Wilcoxin matched-pairs, 
signed-ranks test were employed in the 
following analyses. The main effect of 
reward magnitude was tested by pool- 
ing across the levels of detention with 
which magnitude was completely 
crossed. Thus Groups 20-0 and 20-15 
were combined as were 1-0 and 1-15, 
with comparisons between them yield- 
ing 2 = 1.88, 2.21, and 2.26 for the 
sums across all trials of CB,, CBo, and 
CBs, respectively. The latter two are 
significant at the .05 level. Analo- 
gously the delay variable (Groups 20-0 


SECTION 1 (CB;) 


SECTION 2 (CB2) 


and 1-0 vs. 20-15 and 1-15) yielded z 
= 5.25, 5.47, and 5.50; p<.001 for 
all three segments of A4. 

Interactions for the CB measure 
were evaluated by pairing Ss from 
Groups 20-0 and 20-15 as well as Ss 
from Groups 1-0 and 1-15 on the basis 
of membership in the same squad. Dif- 
ference scores from these pairs were 
again paired on the same basis and the 
Wilcoxin test applied. The results 
were T (10) = 10.5, 4.5, and 3.5 for 
CB;, CBs, and CBs, respectively. The 
latter two are significant (p < .02) in- 
dicating that in the two alley segments 
closest to the MB detention produced 
more CB in Ss receiving the small as 
compared with the large reward. 

To determine the reliability of the 
apparent rise toa maximum followed by 
a decline in Group 1-15, Block 3 was 
separately compared with Blocks 1 and 
10. The values of T (11) = 9.5, p< 
05; T (10) = 4, p < .02; and T (11) 
=0, p<.01, were obtained for the 
three successive sections of A; on the 
comparison between Blocks 1 and 3. 
The values for the analogous compari- 
sons between Blocks 3 and 10 were T 
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Fic. 3. Mean density of competing behavior in the three sections of Ai 
over the course of training in blocks of 12 trials. 
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H= 7, p< .023-T 4QyH= 2h p< 
01; and T (10) = 20, which is not sig- 
nificant. Similar comparisons in the 
CB; measure indicated that Groups 1-3 
and 20-15 also rose significantly from 
Block 1 to 4, T (9) = 0, p < .01, and 
T (10)=2, p<.01, respectively. 
However, only Group 1-3 dropped sig- 
nificantly from Block 4 to 10, T (9) 
= 3.5, p<.05. 

Alley 2: Response strength follow- 

ing the detention—Running speeds are 
presented in Fig. 4. An analysis of 
variance, again excluding Group 1-3, 
revealed that, of the effects of interest, 
only the Reward factor was significant, 
F (1, 42) = 6.24, p< .05. The ap- 
parent interaction between Reward and 
Detention was significant only at the 
10 level, F (1, 42) =2.92. These 
conclusions did not change when only 
asymptotic speeds over Blocks 6-10 
were analyzed. In accord with the 
conclusion that detention did not re- 
liably affect Ay speed, Groups 1-0 and 
1-3, in a separate analysis, did not dif- 
fer significantly over Blocks 6-10, F 
(1,22) = 2.92. 
_ There was very little CB in Az, and 
it occurred mostly on early trials and 
decreased to a negligible amount within 
the first few blocks of trials. 
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Fic. 4. Mean running speeds in Alley 2 


for the various combinations of reward mag- 


nitude (R) and detention (D) in blocks of 
12 trials. 


DISCUSSION 


The speed data provide only a sug- 
gestion that reward magnitude and deten- 
tion interact in determining the strength 
of approach toward the detention cham- 
ber, The CB data, however, indicate that 
such an interaction, while weak in those 
segments of the response sequence re- 
flected in the speed measures, occurs 
strongly immediately prior to the point 
of detention (CB,). A speed measure 
from this latter section of the runway 
would probably have revealed this inter- 
action also. Taken together, then, the 
data suggest that detention disrupts ante- 
cedent approach behavior regardless of 
the magnitude of the goal object, but 
that the disruption appears increasingly 
stronger under small as compared with 
large reward as S nears the detention 
chamber. 

Changes in the response measure as a 
function of training were of particular 
interest in the light of previous similar 
research. Ludvigson, Caul, Korn, and 
McHose (1964), and Ludvigson and Mc- 
Hose (1965) found that certain detained 
Ss displayed a marked and protracted 
decline following an initial rise in speed 
of approach to the detention; later in 
training speeds tended to recover.3 While 
the speed functions of the present study 
do not clearly exhibit the decline follow- 
ing an initial rise, there is evidence of an 
inhibitory effect which is maximal some 
trials after the beginning of training and 
which attenuates or is partially counter- 
acted later in training. This is indicated 
by (a) the change over trials of the dif- 
ference in speed between a detained group 
and the comparable nondetained group, 
and (b) the CB data which increased 
sharply through the first three or four 


3 The somewhat perplexing suggestion of 
the Ludvigson and McHose (1965) data that 
detention may not be the factor causing the 
decline in speed now appears erroneous. Re- 
cent investigations (Ludvigson & Sytsma, 
1967; McHose & Ludvigson, 1966) point to 
the operation of a heretofore unrecognized 
source of experimental contamination which 
very likely caused a spurious decline in the 
speeds of nondelayed Ss. 
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blocks of trials and then diminished some- 
what thereafter. In essential agreement 
with these and the previous detention data 
are those of Sgro, Dyal, and Anastasio 
(1967) which clearly show an increasing- 
decreasing inhibitory effect of a delay of 
the type which is imposed following com- 
pletion of the instrumental response chain. 

While the present data have some rele- 
vance for a number of theoretical posi- 
tions only two will be discussed. Deten- 
tion or delay of reward has frequently 
been assumed to produce a reaction of 
frustration, although a delay which is 
constant on all trials (CDR) may not be 
frustrative. Support for the latter view 
comes from (a) the absence in the pres- 
ent study of a facilitation of A, response 
strength analogous to the “frustration ef- 
fect” (Amsel & Roussel, 1952), (b) the 
dissimilarity between the effects of CDR 
and partial reward on acquisition per- 
formance (cf., Goodrich, 1959), and (c) 
the failure of CDR to increase resistance 
to extinction in some, though not all, 
studies (e.g., Sgro et al., 1967). Never- 
theless, since this matter remains unsettled 
it is instructive to consider the implica- 
tions of the present data for frustration 
theory. 

Assuming detention produces frustra- 
tion, the magnitude of anticipatory frus- 
tration, ry, should be directly related to 
reward magnitude (Amsel & Surridge, 
1964). The behavioral consequence of 
this relationship is the resultant of essen- 
tially antagonistic properties of rp, viz., 
(a) a contribution to incentive motivation 
which increases with rp magnitude, and 
(b) response inhibition which increases 
with ry but which is also diminished to 
the extent that $p has been associated with 
approach to the goal (Amsel, 1967). In 
order to account for the present data 
(especially CB3) frustration theory must 
assume that the first of these factors had 
the relatively greater effect, and/or the 
large reward condition was more con- 
ducive to the association of sp with the 
approach response. It is interesting that 
frustration theory would thus of necessity 
infer the greater disruption of perform- 
ance for the Ss assumed to experience 


the lesser degree of frustration. It is of 
further interest to note that this inference 
is directly opposite to that drawn by 
Amsel and Surridge (1964) in a study 
which demonstrated that a stimulus paired 
with nonreward during partial reward 
training is more disruptive for Ss run- 
ning to a large as compared with a small 
reward, For frustration theory the crit- 
ical difference between the two studies 
appears to be in the extent to which sp 
has been associated with approaching the 
goal. 

Another theoretical position for which 
these data have particular relevance is 
that of cognitive dissonance (Lawrence 
& Festinger, 1962). This view holds that, 
“when the weighted sum of the dissonant 
relationships is held constant, the total 
magnitude of the resulting dissonance de- 
creases as the weighted sum of the con- 
sonant relationshps increases [p. 44].” 
Presumably a dissonant relationship exists 
between. the cognitions of having re- 
sponded and having been detained, while 
a consonant relationship exists between 
having responded and having received re- 
ward, If CBs can be taken as indicative 
of the resulting dissonance then this is 
inversely related to reward magnitude as 
predicted. However, it is not completely 
clear from the theory why CB should in- 
crease over the initial trials. Further- 
more this emergence of CB might rea- 
sonably be considered a “change in be- 
havior,” and, according to the theory, 
changes in behavior should occur only if 
the dissonance aroused by the new action 
is less than that stemming from the origi- 
nal one. It is not obvious how the occur- 
rence of CB increasing as it does the time 
and effort involved in reaching the goal, 
arouses less dissonance than approaching 
the delay chamber directly. It is also ap- 
parent from the CB, data that the accrual 
of “extra attractiveness” to the detention 
chamber as postulated by the theory is a 
very slow process. 
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SIGNAL PROCESSING TIME AS A FUNCTION OF AGING? 
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Old and young Ss pressed a right- or left-hand key in response to 
the onset of 1 of a pair of lights. In Exp. I, a warning light 
indicated the approaching onset of the stimulus light but also 
communicated the rule governing the ensuing response; i.e., respond 
with key on the same or opposite side as the stimulus light. The 
interval (PI) between warning and stimulus lights was short (100 
msec.) in one block of trials and long (1.5 sec.) in another. The 
difference in RT between the 2 blocks reflected the time S required to 
Process the information in the warning light. Results indicated that 
signal processing time increased as a function of aging. In Exp. II, 
normal reactions were unaffected by PI, but reversed reactions were 


slower on short than on long PI trials. 


While it is an accepted fact that re- 
action time (RT) increases as a func- 
tion of aging, the locus of the slowing 
has not been well established (Birren, 
1964; Welford, 1958). Early thinking 
tended to regard response mechanisms 
as the primary locus of slowing whereas 
the current view emphasizes the role of 
central mental processes, In the typical 
RT experiment, it is impossible to dis- 
sociate the time required to process the 
stimulus from the time required to exe- 
cute the response and, thus, the reason 
for an observed age decrement cannot 
be determined. The present research 
was designed to study the signal pro- 
cessing component of RT. The hy- 
potheses were that signal processing 
time increases as a function of aging 
and that it takes longer to program an 
incompatible response than a compat- 
ible response. 

In the traditional two-choice RT 
task, the rules for responding are 
established prior to a block of trials; 
e.g, S may be told to respond with 
the right key to the right light and 
with the left key to the left light. Gen- 
erally, a ready signal indicates the im- 
minent onset of the stimulus. The RT 


1The author acknowledges the assistance 
of Philip Tolin in collecting the data. 
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task used in the present experiment dif- 
fered from the traditional one in that 
the ready signal not only indicated the 
approaching onset of the stimulus but 
also established the “rule” which was 
to govern S’s response on the ensuing 
trial. Specifically, the ready signal in- 
dicated whether S was to respond nor- 
mally (i.e., depress the right key for 
the right light and the left key for the 
left light) or in a reversed manner 
(i.e., right key for left light and left 
key for right light). The Ss performed 
on two blocks of trials. On one block, 
the preparatory interval (PI) was 
“long” so that S had sufficient time 
between ready signal and stimulus to 
establish the appropriate set to respond. 
On the second block, however, the PI 
was “short” so that S was unable to 
complete processing the ready signal 
before the stimulus light appeared. 
Since making the appropriate response 
was contingent on processing the mean- 
ing of the ready signal, the reaction to 
the stimulus light was inevitably de- 
layed on the short PI trials. Thus, the 
difference between RTs under short 
and long PI conditions provided an in- 
dication of the time spent processing 
the ready signal. While others (e.g., 
Davis, 1964) have studied RT in situa- 


SIGNAL PROCESSING TIME AND AGING 77 


tions requiring S to combine two bits 
of information presented sequentially, 
they have not, evidently, employed the 
present procedure to investigate the 
signal processing component of RT. 


EXPERIMENT I 


Subjects—The Ss were 24 senior citizens 
between the ages of 64 and 86 (average age 
72.3) and 24 undergraduates between the 
ages of 18 and 21 (average age 19.7). Half 
of the Ss in each age group were male and 
half were female. 

Apparatus——The apparatus measured 
choice RT to a visual display. The S’s 
task was to react as quickly as possible to 
the display by pressing the correct one of 
two telegraph keys. The display panel was 
positioned about 20 in. from S’s eyes and 
perpendicular to his line of sight. On the 
panel were two green stimulus lights (1-in. 
diameter) mounted 7 in. apart in a horizontal 
line. Midway between these lights were two 
red warning lights (4-in. diameter). The 
warning lights were mounted in a vertical 
line, one 2 in. above and the other 2 in. below 
the imaginary horizontal line connecting the 
two signal lights. 

On the table directly below each stimulus 
light was a telegraph key. The S rested 
both hands on the table top and operated 
the keys with his right and left index finger. 
Onset of a stimulus light activated 
a Hunter klockounter which stopped when 
S depressed a key. Depressing the 
key also turned off both the stimulus light 
and the warning light and signalled E which 
key (or keys) had been pressed. Hunter 
Decade Interval timers were used to con- 
trol the interval between the onset of the 
Warning light and the stimulus light (1.5 
sec. or 100 msec.) and also to control the 
8-sec. intertrial interval. After each trial, 
E operated a four-position switch to select 
the particular combination of warning light 
and stimulus light for the next trial. 

Procedure and experimental design—The 
Ss were instructed that there would be two 
kinds of test trials, some in which the cor- 
rect response would be to press the key on 
the same side as the stimulus light and 
Others in which the correct response would 
be just the opposite. They were told, 


Before each test trial, one of the warning 
lights will come on and shortly thereafter 
a stimulus light will come on. The warn- 
ing light does two things. First, the 


warning light tells you that a stimulus 
light is about to appear. Second, the 
warning light tells you how to respond. 
If the top warning light comes on before 
the trial, you should respond with the right 
key to the right light and the left key to 
the left light. If the bottom light comes 
on before the trial, you should respond 
with the right key to the left light and 
with the left key to the right light. You 
will be given a series of trials in which 
normal and reversed trials will be pre- 
sented in random order. It is important 
that you respond as fast as you can on 
each trial without making errors. 


All Ss performed on two blocks of trials; 
in one block the preparatory interval (PI) 
between the onset of the warning light and 
stimulus light was 100 msec. (short PI) and, 
in the other block, this interval was 1.5 sec. 
(long PI). Half of the Ss in each Age X 
Sex subgroup performed the short PI trials 
first while the other half performed the long 
PI trials first. The Ss were told the length 
of the PI prior to beginning a block of 
trials. Each block consisted of 60 test 
trials. Within a block, each of the four 
possible pairings of a warning light (top 
or bottom) and a stimulus light (left or 
right) was presented equally often in a pre- 
determined random sequence. Each block 
was preceded by 8 practice trials, which in- 
cluded two of each of the four combinations 
of warning and stimulus lights. 


Results—Median RTs were com- 
puted for each S for each of the four 
experimental conditions; i.e normal 
and reversed reactions with both the 
short and long PI. Incorrect responses 
occurred on about 6% of the trials and 
were omitted from the scoring. 

The top half of Table 1 presents the 
mean RT under each treatment con- 
dition for the two age groups. An 
analysis of variance revealed that RT 
with the short PI was slower than with 
the long PI (898 vs. 552 msec.), F (1, 
44) = 98.17, p < .001. This finding 
was as planned since the short PI con- 
dition was purposely designed so that 
Ss would not have sufficient time to 
complete processing the warning light 
before the signal light was presented. 
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TABLE 1 


CxoicB RT (MSEC.) AS A FUNCTION OF PREPARATORY INTERVAL, 
S-R CORRESPONDENCE, AND AGE 


Young Old 
Preparetaey, 
erval 
‘oe. Normal | Reverse M Normal | Reverse M Overall 
Exp. I 
Short 628 654 641 1104 1206 1155 898 
Long 368 415 391 652 772 712 552 
M 498 534 516 878 989 933 
Erp II 
hort 315 407 361 390 551 471 416 
Long 311 377 344 395 531 463 404 
M 313 392 353 392 541 467 


The difference, then, between RTs 
under short and long PI provides a 
measure of time spent processing the 
meaning of the warning signal. 

The analysis of variance also re- 
vealed that normal reactions were faster 
than reversed (688 vs. 762 msec.), F 
(1, 44) = 16.43, p < .001; that young 
Ss responded faster than old (516 vs. 
933 msec.), F (1, 44) = 30.28, p< 
.001; and that, while men tended to 
respond faster than women (670 vs. 
779 msec.), the difference was not 
statistically significant. 

As predicted, the PI x Age inter- 
action was significant, F (1, 44) = 
7.68, p < .01. Table 1 shows that the 
difference in RT between the short and 
long PI conditions was greater for the 
older than for the younger group (443 
vs. 250 msec.). The Age X S-R Com- 
patibility interaction was also signifi- 
cant, F (1, 44) = 4.17, p< .05. It 
may be noted in Table 1 that the dif- 
ference between normal and reversed 
reactions was greater for the older 
group than for the younger group (111 
vs. 36 msec.). The analysis provided 
no evidence of the predicted S-R Com- 
patibility X PI interaction. In other 
words, the time required to prepare for 
a subsequent normal reaction did not 


differ from the time required to pre- 
pare for a reversed reaction. 


EXPERIMENT II 


Results of Exp. I demonstrated that 
the difference between RT under short 
and long PI conditions was greater for 
older than for younger Ss. In order 
to interpret this finding as evidence of 
an increase in central processing time 
with aging, it was necessary to elim- 
inate an alternative explanation; i.e., 
that older Ss may have had more dif- 
ficulty than the young in responding 
under short PI conditions. Experi- 
ment II, then, was a traditional choice 
RT study in which the warning light 
had but one function, that of signalling 
the approaching onset of the stimulus. 
Independent variables were PI, S-R 
compatibility, age, and sex. 

Subjects —Two groups of Ss were studied, 
an older group of 24 ranging in age from 
63 to 88 (average age 73.9) and a younger 
group of undergraduates ranging in age 
from 18 to 21 (average age 19.2). There 
was an equal number of males and females 
in each age group. 

Apparatus, procedure, and experimental 
design—The apparatus was the same as 
in Exp. I. The procedure, however, differed 
in that normal and reversed trials were ipres. 
sented in separate blocks rather. than in a 
random sequence within the same block as 
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was the case in Exp. I. Each Ş performed 
on four blocks of trials. On two of the 
blocks, one involving a short PI (150 msec.) 
and the other a long PI (1.5 sec.), S was 
instructed to respond “with the right key 
to the right light and with the leit key to 
the left light.” On the other two blocks, one 
with the short PI and the other with the 
long PI, S was instructed to respond “with 
the right key to the left light and with the 
left key to the right light.” S was informed 
as to the length of the PI prior to each block 
of trials, 

A block consisted of 30 test trials in which 
left and right stimulus lights were each 
presented 15 times in a predetermined ran- 
dom sequence. Four practice trials, two 
with each light, preceded the blocks. A 
4X 4 Latin square was used to balance the 
order of presentation of the blocks of trials. 
Three males and three females in each age 
group were randomly assigned to each of 
the four orders. 

The essential difference between the ex- 
periments was that, in Exp. II, S was able to 
establish a set to respond (in a normal or 
reversed fashion) prior to the onset of the 
warning light whereas, in Exp. I, he could 
not establish. this set until the onset of the 
warning light. In other words, in Exp. II, 
the warning light simply signalled the ap- 
Proaching onset of the stimulus whereas, 
in Exp. I, it also communicated the “rule” 
which was to govern S’s ensuing response. 


Results —The data for each S were 
Processed as in Exp. I. The lower 
half of Table 1 summarizes the mean 
RT under the various treatment con- 
ditions for each age group. An analy- 
Sis of variance indicated that there was 
no difference in RT between the short 
and long PI conditions (416 vs. 404 
msec,), F (1, 44) = 1.72; that normal 
Teactions were faster than reversed 
(353 vs. 466 msec.), F (1, 44) = 
166.67, p < .001; that the young re- 
Sponded faster than the old (353 vs. 
467 msec.), F (1, 44) = 29.36, p< 
001; and that males responded faster 
than females (383 vs. 437 msec.), F 
(1, 44) = 6.63, p < .05. 

Of major interest in this experiment 
was the absence of the PI x Age inter- 
action, F (1,44) =.21. The data in 


Table 1 clearly indicate that the differ- 
ence between short and long PI con- 
ditions did not vary as a function of 
age group (17 vs. 8 msec.). Two first- 
order interactions were, however, sig- 
nificant. One was the Age X S-R 
Compatibility interaction. As in Exp. 
I, the difference between normal and 
reversed reactions was greater for the 
older group than for the younger (149 
vs. 79 msec.), F (1, 44) = 15.57, p < 
001. The other significant interaction 
was S-R Compatibility x PI, F (1, 
44) = 6.67, p < .05. It will be noted 
in Table 1 that reversed reactions were 
faster under the long PI condition than 
under the short (observed for 36 of the 
48 Ss) whereas normal reactions were 
not affected by the PI. 

An overall analysis of the two ex- 
periments combined ‘confirmed the re- 
sults of the separate analyses but, in 
addition, revealed a significant S-R 
Compatibility X Experiment interac- 
tion, F (1, 92) = 4.02, p < .05, re- 
flecting the fact that the difference be- 
tween normal and reversed RT was 
greater in Exp. II than in Exp. I (113 
vs. 74 msec.). This result was due 
mainly to the relatively small difference 
between normal and reversed reactions 
under the short PI condition in Exp. I 
and is reflected in the S-R Compati- 
bility x Experiment X PI interaction, 
F (1, 92) = 3.00, p < -10. Reversed 
reactions were actually faster than nor- 
mal reactions for 17 of the 48 Ss under 
the short PI condition in Exp. I. Only 
five such inversions occurred under the 
long PI condition. 


DISCUSSION 


The present research provided a method 
for separating signal processing time from 
the time required to execute a response. 
In Exp. I, the difference between the 
short and long PI conditions was 77% 
greater for the older group than for the 
young. This difference between age 
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groups is in marked contrast to the 32% 
difference between groups in the tradi- 
tional RT measure used in Exp. II. 
Since Exp. II provided no evidence of 
an Age X PI interaction, results of Exp. 
I are interpreted as demonstrating an in- 
crease in signal processing time with age. 

By indulging in some tortuous reason- 
ing, it would still be possible to explain 
the present results without postulating a 
slowing of central processes. This would 
require assuming that, under the short PI 
condition of Exp. I, Ss could process the 
signal light while simultaneously estab- 
lishing the appropriate set to respond. 
Furthermore, one would have to assume a 
deterioration with age in the ability to 
carry on two processes concurrently. It 
is unlikely that these factors, even if they 
did exist, would explain the large dif- 
ference observed between the two age 
groups. It seems more likely that aging 
produces a general slowing in the speed 
with which a stimulus can be interpreted 
and an appropriate response set estab- 
lished. Research on set and aging sug- 
gests that older Ss are less facile in alter- 
nating from one type of arithmetic oper- 
ation to another (Botwinick, Brinley, & 
Robbin, 1958) and may require more time 
than the young to overcome inaccurate 
expectations (Botwinick & Brinley, 
1962). 

The present results fail to provide a 
clearcut answer to the question of whether 
it takes longer to program an incompatible 
response than it does to program a com- 
patible response. The predicted S-R 
Compatibility X PI interaction was not 
significant in Exp. I but was significant 
in Exp. II. Examination of the data 
from Exp. II revealed that normal re- 
actions did not vary as a function of PI 
but that reversed reactions were slower 
under the short than under the long PI 
condition. Perhaps the 100-msec. PI was 
too short for S to establish an optimum 
set to respond to the reversed S-R rela- 
tionships. 

Results of both experiments support the 
well-established finding that RT increases 
when there is a lack of spatial correspond- 
ence between the display and control 
elements comprising the task (Fitts & 


Seeger, 1953; Garvey & Mitnick, 1955). 
Two other aspects of the present results 
bear mention. One was the significant 
Age X S-R Compatibility interaction ob- 
served in both Exp. I and II. This find- 
ing adds to the growing evidence that one 
of the important accompaniments of aging 
is an increasing difficulty with tasks which 
have complicated control-display relation- 
ships (Kay, 1955; Simon, 1967; Welford, 
1958). The other interesting result was 
the finding that the difference between 
normal and reversed RT was smaller 
when both kinds of reactions were in- 
cluded in the same block of trials (Exp. 
I) than when they were in separate blocks 
(Exp. II). This apparent context effect, 
which was particularly marked under the 
short PI condition in Exp. I, deserves 
further study. 
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COGNITIVE PROCESSES DURING DIFFERENTIAL TRACE 


AND DELAYED CONDITIONING OF THE GSR? 
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The concordance between conditional GSR differentiation and ability 
to verbalize conditional stimulus relations accurately was assessed 
in 26 trace conditioned Ss. The CSs were 1.5-sec. tones of differing 
frequencies, the UCS was a shock, and the CS-UCS interval was 
8 sec. Division of the ISI and the post-UCS period into 3 intervals 
permitted the analyses of multiple GSRs. Both intertrial reports and 
postconditioning interviews were obtained. Conditional differentiation 
was limited to a group of 10 Ss who accurately verbalized the stimulus 
contingencies during postconditioning interviews. In a comparison 
group of 26 delayed conditioned Ss, more Ss (18) verbalized 
accurately. Both the trace and delayed groups showed significant 1st 
interval GSR differentiation during acquisition. The trace group did 
not differentiate in the 2nd interval during acquisition, nor in any of 
the 3 intervals during extinction, but the delayed group did. Assessing 
GSR differentiation in accurately verbalizing Ss only, trace Ss did not 
condition in the 2nd interval during acquisition nor in the 1st 2 intervals 
during extinction; delayed Ss conditioned in all relevant intervals. 


Demonstrations that cognitive proc- 
esses are involved in the classical con- 
ditioning of human Ss have depended 
chiefly on the use of instructional sets. 
Numerous studies (e.g., Cook & Har- 
ris, 1937; Grings & Lockhart, 1963; 
Mandel & Bridger, 1967) indicate that 
both the acquisition and extinction of 
conditional responses are facilitated 
when S is informed about the stimulus 
contingencies prior to acquisition or 
extinction trials. As an alternative to 
instructional manipulation, cognitive 
Processes during conditioning may also 
be studied by assessing Ss’ ability to 
verbalize the stimulus contingencies. 
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Although the usefulness of postcondi- 
tioning interviews for this purpose has 
been questioned (Grings, 1965), per- 
tinent characterization of stimulus rela- 
tions in Ss’ interviews has been shown 
to be related to semantic conditioning 
and generalization of the GSR (Branca, 
1957; Chatterjee & Erikson, 1960). 
Using simple auditory stimuli in a de- 
layed conditioning paradigm, Fuhrer 
and Baer (1965) found that conditional 
GSR differentiation was limited to Ss 
who accurately verbalized the differen- 
tial stimulus contingencies either dur- 
ing intertrial reports or postcondition- 
ing interviews. In a subsequent study 
(Fuhrer & Baer, 1966), a similar re- 
lationship was found between cognition 
and conditioning when only postcon- 
ditioning interviews were obtained. 

In the present investigation trace 
conditioning was assessed for concord- 
ance between cognitive processes and 
conditioning in an effort to extend the 
finding of concordance previously ob- 
served for delayed conditioning. The 
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effects of trace and delayed condition- 
ing * were compared for differences in 
degree of differential conditioning and 
for the probability of Ss accurately 
verbalizing stimulus contingencies. 
Since trace conditioning of the GSR has 
been consistently found less effective 
than delayed conditioning in both 
simple (Baxter, 1966; Kimmel, 1967; 
Rodnick, 1937) and differential con- 
ditioning paradigms (Grings, Lock- 
hart, Zeiner, & Uno, 1964), fewer 
accurately verbalizing Ss were pre- 
dicted for the trace conditioned group. 
This prediction is tenable if verbaliza- 
tion of stimulus contingencies is a cor- 
relate of GSR conditioning irrespective 
of differences between conditioning 
paradigms. 


METHOD 


Female Ss between the ages of 20 and 50 
served as paid volunteers. There were 26 
Ss each in the trace and delayed conditioning 
groups. Successive Ss were assigned on an 
alternating basis to one of the two groups. 
Exosomatic skin resistance and GSRs were 
recorded monopolarly using a constant volt- 
age system supplying 4v. (Montagu & 
Coles, 1966), in conjunction with an Offner 
Type R Dynograph. Basal resistance was 
measured in ohms and the GSR in 
micromhos, 

For the delayed paradigm, a pilot group 
of eight Ss psychophysically matched an 8- 
sec., 700-cps tone for loudness with an 8-sec., 
3,500-cps tone of 59 db.; as a result the 
700-cps tone was set at 70 db. These decibel 
levels were measured at the earphones 
through which the tones were delivered. 
For the trace paradigm, the 1.5-sec., 3,500- 
cps tone was matched to the 8-sec,, 3,500-cps 
tone and set at 65 db. Last, the 1.5-sec., 700- 
cps tone was set at 75 db. applying the same 
psychophysical procedure to match tones used 
in the delayed procedure. 

The UCS was a .25-sec., 100 pulse/sec 
shock applied to the left forefinger. For the 


4 While both the trace and delayed groups 
were run during the same period, it should 
be noted that the results from the delayed 
group have been reported in part by Fuhrer 
and Baer (1965). ‘ 


delayed conditioning group, shock onset was 
coincident with offset of one of the 8-sec. 
tones. For the trace conditioned group, 
shock onset occurred 6.5 sec. after offset of 
one of the 1.5-sec. tones, The variable dura- 
tion of Ss’ intertrial reports was largely the 
basis of the variable intertrial intervals (for 
the delayed conditioned group, M = 50 sec, 
SD=16 sec.; for the trace conditioned 
group, M=46 sec, SD=18 sec.). 

The S was seated in a sound-attenuated, 
semidarkened room adjacent to the instru- 
mentation room, After electrodes were 
applied, shock was adjusted to S’s toler- 
ance level. Instructions5 were given that 
whenever the investigator said “report” over 
the intercom, S was to state aloud her ob- 
servations, thoughts, and feelings regarding 
events in the experiment. References to 
the CSs or the CS-UCS relations were care- 
fully avoided. There were 8 adaptation, 20 
acquisition, and 8 extinction trials. In each 
phase an equal number of the two tones was 
presented in unsystematic order, the same 
tone never more than twice successively. In 
each group, half of the Ss had the 700-cps 
tone paired with shock, and half the 3,500-cps 
tone. The instructions to report followed 
each tone onset by 10-15 sec. After. the last 
trial, all Ss were interrogated with a 
standard series of increasingly informative 
questions. 


RESULTS 


Verbalizations—Transcripts of the 
intertrial verbal reports and interviews 
for both groups were independently 
content analyzed by three raters for the 
accuracy of Ss’ conceptualizations of 
the conditional stimulus contingencies. 
The raters had no knowledge of the 
GSR conditioning which was obtained 
for each S, nor was verbal data identi- 
fied by S’s name. The Ss were classi- 


5 Copies of the instructions for intertrial 
reports, the postconditioning interview, and 
the rating schedules for both have been de- 
posited with the American Documentation 
Institute. Order Document No. 10029 from 
ADI Auxiliary Publications Project, Photo- 
duplication Project, Photoduplication Ser- 
vice, Library of Congress, Washington, D. C. 
20540. Remit in advance $1.25 for micro- 
film or $1.25 for photocopies and make 
checks payable to: Chief, Photoduplication 
Service, Library of Congress. 
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fied as accurate or inaccurate verba- 
lizers once by intertrial reports and 
again by postconditioning interviews. 
Assignment of S to an accurately ver- 
balizing group required either two rat- 
ings of accuracy not considered ques- 
tionable by either rater, or three ratings 
of accuracy if one or more raters con- 
sidered their rating questionable. Rat- 
ings of interview verbalizations which 
were elicited by questions containing 
information about differential stimulus 
contingencies were not used to classify 
Ss. High interrater reliability was in- 
dicated by the failure to achieve con- 
sensus for only oné S’s interview and 
another S’s intertrial report. Both Ss 
were in the delayed conditioned group 
and were classified as inaccurate. 

On the basis of intertrial reports 
by the trace conditioned group, 8 
Ss were classified as accurate and 18 
as inaccurate; in the delayed con- 
ditioned group, 15 Ss were classi- 
fied as accurate and 11 as inaccurate, 
x°=2.8. Ratings of the interviews 
yielded 10 accurate Ss and 16 inaccur- 
ate Ss in the trace group and 18 accur- 
ate Ss and 8 inaccurate Ss in the de- 
layed group, x? = 3.8, p < .05, two- 
tailed. There were 5 Ss in the two 
groups who did not verbalize accurately 
during intertrial reports but did so 
when interviewed following condition- 
ing. No other discordance between 
reports and interviews occurred. The 
classification based on interviews was 
used in the GSR analyses which are 
subsequently reported. Substantially 
similar results were obtained by using 
intertrial reports as the basis of 
classification, 

GS R measures.—Similar to an anal- 
ysis suggested by Stewart, Stern, 
Winokur, and Fredman (1961), the 
8-sec. interval beginning 1 sec. after 
tone onset was divided into two sub- 
intervals. A GSR originating during 


the subinterval 1-5 sec. after tone onset 
was scored as a first interval response. 
A GSR originating during the remain- 
ing interval, terminating 1 sec. after 
shock onset, was scored as a second 
interval response. A third interval re- 
sponse was scored during extinction 
trials for a GSR originating 9-13 sec. 
following tone onset. Each GSR was 
measured from its onset to its peak, 
and the magnitude expressed as the 
square root of the conductance change. 

First interval GSRs—The upper 
left-hand panel of Fig. 1 shows the 
mean first interval GSR magnitudes 
for the 10 trace conditioned Ss who 
accurately verbalized the contingencies 
(Group TA) and the 16 who did not 
(Group TI). The data are plotted in 
blocks of CS+ and CS-— trials. For 
adaptation the mean of the first four 
CS+ and the first four CS— trials 
consistute the two data points labeled 
AD. For acquisition (AQ) three 
blocks of three CS+ and three CS— 
trials each were similarly averaged and 
plotted, AQ, containing Trials 11-16, 
AQ, containing Trials 17-22, and AQ, 
containing Trials 23-28. The first 
CS+ and CS— acquisition trials were 
omitted because of the excessive sensi- 
tization effects of the first shock. For 
extinction mean values for the last 
four CS+ and CS— trials were used. 
Three-factor analyses of variance 
(Groups TA vs. TI, CS+ vs. CS— 
and trial blocks) were calculated 
separately for the three phases of the 
conditioning procedure. The analyses 
revealed no relevant differences in 
GSR reactivity during adaptation. 
During acquisition, significantly dif- 
ferentiated GSR magnitudes to CS+ 
and CS— were obtained for Groups 
TA and TI combined, F (1, 24) = 
6.7, p<.025. The degree of dif- 
ferentiation differed markedly between 
the groups, F (1,24) = 27.3, p < .001, 
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Fic. 1. First and second interval GSR magnitudes for accurate trace Ss (n = 10), inaccurate 
trace Ss (n = 16), accurate delayed Ss (n = 18), and inaccurate delayed Ss (n=8). 


with differentiation being limited to 
Group TA. There was no evidence of 
conditional differentiation during ex- 
tinction for Groups TA and TI com- 
bined, F < 1, nor did one group show 
more differentiation than the other, 
ESI 

The upper right-hand panel of Fig. 1 
shows mean first interval GSR magni- 
tude for 18 delayed conditioning Ss 
who accurately verbalized the conting- 
encies (Group DA) and the 8 who did 
not (Group DI). Reliable differential 
conditioning occurred during acquisi- 
tion trials for Groups DA and DI com- 
bined, F (1, 24) = 14.0, p < .01. In 
addition, the groups were markedly 
dissimilar in the conditioning which 
was achieved, F (1, 24) = 8.5, p < .01. 
As shown in Fig. 1, only Group DA 
manifested GSR differentiation. The 


two groups continued to differ during 
extinction, F (1, 24) = 5.9, p < .025, 
with Group DA appearing to differen- 
tiate sufficiently to produce a reliable 
indication of differentiation for the 
two groups combined, F (1, 24) = 8.7, 
p <.01. Although Fig. 1 suggests 
greater overall GSR reactivity to CS+ 
and CS— combined for Group DA 
during acquisition, the difference be- 
tween Groups DA and DI was not 
statistically significant, F (1, 24) = 
1:73 


To compare the trace and delayed 
conditioned groups directly, irrespec- 
tive of accuracy of verbalization, sepa- 
rate analyses of variance were com- 
puted for adaptation, acquisition, and 
extinction trial blocks. These analyses 
were comprised of Group TA + TI 
vs. DA + DI, CS+ vs. CS—, and trial 
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blocks. The trace and delayed condi- 
tioned groups differed significantly in 
first interval GSR magnitudes to CS+ 
and CS— combined during adaptation, 
F (1, 50) = 6.9, p < .025. This find- 
ing reflected the greater overall reactiv- 
ity of the trace conditioned group. 
There was, however, no evidence that 
the reactivity of the groups differed 
reliably during acquisition, F < 1, or 
extinction, F < 1. The trace and de- 
layed groups did not differ significantly 
in GSR differentiation to CS+ and 
CS— during acquisition, F (1, 50) = 
1.1, or extinction, F (1, 50) = 1.5. 

Second interval GSRs.—The second 
interval GSR magnitudes for Groups 
TA and TI are depicted in the lower 
left-hand panel of Fig. 1. The dis- 
tribution of second interval data dic- 
tated the use of nonparametric analyses. 
The mean difference between GSR 
magnitudes to CS+ and CS— trials 
was calculated separately for each S. 
Two-tailed tests of significance were 
used throughout. Use of the Wil- 
coxon test indicated there was no sta- 
tistically significant GSR differentia- 
tion for Groups TA and TI combined 
or considered separately during com- 
bined acquisition or extinction trial 
blocks. It may be noted in Fig. 1 that 
second interval GSRs for Group TA 
appear to differentiate during the final 
set of acquisition trial blocks. Post 
hoc analyses revealed significant dif- 
ferentiation for Group TA as well as 
for the entire trace group, T (10) = 6, 
P< .05; T (26) =41, p<.01, re- 
spectively. 

During acquisition, Group DA and 
DI combined showed significant second 
interval GSR differentiation to CS+ 
and CS—, T (26)=72, p<.0l. 
When the groups were considered 
Separately, Group DA showed signif- 
icant differentiation, T (18) = 4, p< 
01, but Group DI did not, T (8) = 17. 


The combined groups continued to dif- 
ferentiate significantly during extinc- 
tion, T (26) =14, p<.01. When 
the groups were considered separately, 
Group DA showed significant resist- 
ance to extinction, T (18) =6, p< 
01, while Group DI showed no evi- 
dence of differentiation, T (8) = 14. 

The conditional differentiation of 
second interval GSR magnitudes of 
the trace and delayed conditioning 
groups was directly assessed using the 
Mann-Whitney test. For all acquisi- 
tion trial blocks, the mean difference 
score of 1.17 for delayed group ex- 
ceeded the mean of .41 for the trace 
group, z= 1.87, p<.07, The two 
accurate groups did not differ signif- 
icantly for the combined acquisition 
trial blocks, U (10, 18) = 68, or for 
the set of extinction trial blocks, U (10, 
18) = 84. 

Third interval GSRs.—Mean third 
interval GSR magnitudes for the four 
groups during the combined extinction 
trials are contained in Table 1. Com- 
bining Groups TA and TI provided no 
evidence that the trace conditioned 
group reliably differentiated during ex- 
tinction, T (26) =111. However, 
Group TA was found to differentiate 
significantly, T (10)=4, p< .02, 
though Group TI did not, T (16) = 
32. The delayed conditioning group, 
comprised of Groups DA and DI, 
showed significant differentiation, T 


TABLE 1 


MEAN THIRD INTERVAL GSR MAGNITUDES 
DURING EXTINCTION 


— 
Group cs+ cs- 
Trace 
Accurate (n = 10) .24 .07 
Inaccurate (n = 16) 14 16 
naa (n = 26) 18 13: 
Dela: 
Accurate (n = 18) 34 19. 
Inaccurate (n = 8) avi A5. 
Combined (n = 26) 27 18 
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(26) = 71, p < .01. When the groups 
were considered separately, Group DA 
demonstrated reliable conditioning, 
T (18) = 23, p < .01, but Group DI 
did not, T (8) =17. Use of the 
Mann-Whitney test indicated that the 
delayed and trace conditioned groups 
did not differ significantly in third 
interval differentiation during extinc- 
tion, z= 1.21. The two accurate 
groups also did not differ, U (10, 18) 
= 82, 


Discussion 


The present results indicate clearly that 
ability to verbalize stimulus contingencies 
accurately is a correlate of differential 
trace conditioning as well as delayed con- 
ditioning, The group of accurately ver- 
balizing trace conditioned Ss showed 
Statistically significant GSR differentia- 
tion for first interval responses during the 
combined acquisition trials, for second in- 
terval responses during the last set of 
acquisition trials, and for third interval 
responses during the combined extinction 
trials. There was no reliable evidence of 
conditioning during acquisition or extinc- 
tion trials for the group which inaccu- 
rately formulated the stimulus contingen- 
cies or omitted references to them. 

The concordance between accuracy of 
verbalization and conditional GSR differ- 
entiation which was obtained in the de- 
layed conditioning group does not appear 
to be due to the use of intertrial reports 
since a similar concordance was obtained 
for a delayed conditioning group which 
only received postconditioning interviews 
(Fuhrer & Baer, 1966). It seems im- 
probable that the concordance obtained in 
the trace group is entirely attributable to 
use of intertrial reports, although inter- 
trial reporting may differ in its effects 
in trace and delayed paradigms. 

The delayed procedures yielded more 
Ss who could accurately verbalize the 
stimulus contingencies in either intertrial 
reports or postconditioning interviews. 
The delayed conditioning procedure may 
facilitate recognition of stimulus relations 
by virtue of the coincidence of CS offset 


and UCS onset, as well as the extended 
CS duration. From a cognitive view- 
point, CS-UCS coincidence may promote 
association of stimuli. The extended CS 
may serve to sustain S’s attention, con- 
sequently reducing the probability that the 
UCS will be associated with extraneous 
stimuli, 

Differences between the entire trace and 
delayed conditioned groups at the cogni- 
tive level were paralleled by differences 
in conditional GSR differentiation. Sec- 
ond interval GSR differentiation during 
combined acquisition trials was greater 
for the delayed conditioned group. In- 
direct comparison of the groups also indi- 
cated the superiority of delayed condition- 
ing. Statistically significant conditional 
GSR differentiation occurred in the trace 
group only for first interval GSRs during 
the combined acquisition trials and for 
second interval GSRs during late acquisi- 
tion trials. On the other hand, the de- 
layed conditioning group reliably differ- 
entiated in all relevant intervals for com- 
bined trials during both acquisition and 
extinction. It may be concluded that the 
differences in conditioning between the 
groups are in part a function of the num- 
ber of accurately verbalizing Ss within 
each group. 

The concordance between cognition and 
conditioning within both the trace and 
delayed groups is complemented by the 
comparison between the groups which in- 
dicates conditioning and cognition may 
well be jointly susceptible to manipulating 
the parameters of conditioning. Further 
elucidation of the role of cognition in 
conditioning requires study of the extent 
to which parameters known to affect GSR 
conditioning also affect Ss’ ability to 
formulate the stimulus contingencies. 

There also was evidence that trace con- 
ditioning is less effective than delayed 
conditioning even when only accurately 
verbalizing Ss are considered. The accu- 
rately verbalizing delayed conditioned 
group manifested reliable conditional dif- 
ferentiation in all relevant intervals dur- 
ing combined trials for both acquisition 
and extinction. In contrast, accurately 
verbalizing trace conditioned Ss only dif- 
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ferentiated significantly in the first inter- 
val during acquisition, in the second in- 
terval late during acquisition, and in the 
third interval during extinction. These 
differences do not appear attributable to 
delayed Ss having recognized the stimulus 
contingencies earlier than did trace Ss. 
Inspection of intertrial reports for the 
trial number of the first accurate report 
revealed no differences between the two 
groups. Evidently differences between 
the trace and delayed paradigms do not 
rest solely on cognitive grounds. 
Differences between the accurate groups 
in the intervals in which GSR differenti- 
ation occurred are relevant to current 
interest in the analysis of GSR condition- 
ing during extended CS-UCS intervals 
(eg, Gale & Stern, 1967; Lockhart, 
1966; Prokasy & Ebel, 1967). The pres- 
ent findings suggest two possible inter- 
pretations. First, the several intervals 
may uniformly differ in their sensitivity 
to reflect conditioning regardless of the 
factors which facilitate or retard condi- 
tioning. Substantiation of this hypothesis 
will require future studies in which con- 
ditioning during the various intervals is 
assessed as a function of systematic alter- 
ations of the conditioning paradigm. Sec- 
ond, the various intervals may reflect 
unique processes which are differently 
affected by the particular characteristics 
of the conditioning paradigm. For ex- 
ample, first interval responses may be 
characterized as conditioned orienting 
activity (Gale & Stern, 1967), while sec- 
ond interval responses may be viewed as 
concomitants of a preparatory set in 
anticipation of the UCS (Lockhart, 
1966). Conceptualized in terms of pre- 


` paratory set, the present findings of 


Poorer second interval conditioning in the 
trace group appear consistant with pre- 
vious reports that reaction time is slower 
following a trace stimulus (Behar & 
Adams, 1966) and that less UCR inhibi- 
tion occurs (Baxter, 1966). 
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A total of 96 Ss were each presented 32 high-meaningful (M) and 
32 low-M items for a single trial at a 3-sec. exposure rate. Using a 
recognition measure, half of the Ss were tested for retention immedi- 
ately after presentation and half were tested after a 24-hr. interval. 
Differential forgetting, with greater forgetting of high-M than low-M 
items, was found. These results are compatible with an interference 
theory which predicts greater extraexperimental interference for the 
associations formed between the experimental context and high-M items 
than for similar low-M context-item associations. Alternate explana- 


tions were considered. 


Underwood and Postman (1960), 
in attempting to provide an interfer- 
ence explanation of the forgetting of 
a single list of words, noted two likely 
extraexperimental sources of interfer- 
ence: unit sequence interference and 
letter sequence interference. Briefly, 
the former was thought to occur when 
the order of a list of verbal units to be 
learned violated the usual (and over- 
learned) sequence of unit occurrence 
in spoken and written language while 
the latter was thought to occur when 
the unit itself was constructed in vio- 
lation of the “rules” of the language. 
Thus the approximately 20% forget- 
ting which is observed after a 24-hr. 
period when the retention of a single 
list is measured, can be explained by 
use of an interference mechanism 
rather than by a decay mechanism. 
While the Underwood and Postman 
theory has generated a great deal of 
interest and research, the evidence has 
been largely negative (Ekstrand & 
Underwood, 1965). 

_ There are several possible explana- 
tions for the failure to find positive re- 
sults. For example, it may be that the 


d *The present research was performed un- 
er Grant GB-3629 from the National Sci- 
ence Foundation to David T. Hakes and the 
senior author. 


retention measures employed in pre- 
vious studies were insensitive or that 
the criteria of learning were too high. 
Both of these possibilities were taken 
into consideration in the design of the 
present experiment: degree of learn- 
ing was kept relatively low and reten- 
tion was measured by a recognition 
test. In addition, meaningfulness (M) 
was varied to provide another test of 
the hypothesis of extraexperimental 
sources of forgetting. 

Suppose a list of items were pre- 
sented with instructions to remember, 
as in a free-recall experiment. These 
items, it can be assumed, are con- 
ditioned to the context of the experi- 
ment (cf. McGovern, 1964). It would 
be expected that during the retention 
interval, associations formed between 
the experimental context and high-M 
items would be more subject to inter- 
ference from extraexperimental asso- 
ciations than would low-M items. 
That is, with the high-M items extinc- 
tion of the association between item 
and context should occur as a func- 
tion of interference from relatively nu- 
merous extraexperimental associations 
which existed between item and other 
contexts and words prior to the start 
of the experiment. Low-M items, on 
the other hand, would be associated 
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with the experimental context but, by 
any definition of M, would have few 
other associations, In the absence of 
other associations, relatively little in- 
terference could occur between extra- 
experimental associations and the con- 
textual associations of the experiment. 
For this reason, relatively little ex- 
tinction of the associations between 
low-M items and the experimental 
context should occur. 


METHOD 


Materials—The 32 highest-M and 32 low- 
est-M items of Noble’s list (1952) were se- 
lected for use in the experiment. Each of 
these items was paired with an item taken 
from the Thorndike-Lorge (TL) word count 
(1944) such that frequency of occurrence 
was matched for each Noble-TL pair. If 
any pair was judged by the authors not to be 
approximately equal in M value, the TL item 
was dropped and a new item substituted. In 
addition, the 15 Noble items which had the 
lowest M values were paired with TL items 
and were rated as to comparative meaning- 
fulness by advanced undergraduate psychol- 
ogy students. Again, several TL items were 
dropped and others substituted before lists 
of approximately equivalent M were devel- 
oped. Within the limits imposed by a sample 
of 128 items, formal and meaningful intralist 
similarity were kept as low as possible. 

Procedure.—Each of 96 Ss was presented 
80 items for a single trial. The first 10 items 
as well as the last 6 items were “buffer” 
items taken from Noble’s 32 middle-M items 
and were not scored. Each item was pre- 
sented once for a 3-sec. period with an inter- 
item interval of 1 sec. by a Kodak Carousel 
projector. The test items (i.e, nonbuffer 
items) were either the 64 Noble items or the 
64 TL items described above. Retention was 
measured by a recognition task in which S 
was presented all 128 Noble and TL test 
items in either of two forms of a check list 
(Form A or B) and asked to check a Yes 
or a No to indicate whether or not the item 
had been presented. In addition $ was asked 
to rate on a 1 to 4 scale (from “positive” to 
“not at all sure”) how confident he was of 
his judgment. Half of the Ss were tested 
for retention immediately after presentation 
of the items while the second half returned 
after a 24-hr. retention interval. 

Thus, the design of the experiment con- 


tained three between-Ss variables: Items pre- 
sented (Noble or TL), Retention interval 
(Immediate or 24-hr.), and Check list (Form 
A or B )and two within-Ss variables: Item 
sample (Noble or TL) and Meaningfulness 
(high or low). These were combined in a 
2X2X2X2X2 factorial design (N =12 
per cell). Another variable of interest, the 
P (presented) vs. P (not presented) vari- 
able, is in reality the interaction of the two 
samples of items on the check list with the 
two samples of items originally presented. 
Thus when Noble’s items were presented, 
these were the P items and the TL items 
were the P items. When the TL items were 


presented, then the P and P items were re- 
versed, 

The Ss were run in squads of four with 
the order of presentation of the 64 test items. 
being rearranged randomly from one squad 
to the next. Immediately after presentation 
of the test items, the two Ss to be tested 
after 24 hr. were dismissed and told to return 
the next day. The two remaining Ss were 
then tested for retention. At each retention 
interval, one S was given Form A and the 
other Form B of the check list, Preliminary 
analyses supported the assumption that Ss 
within a squad were independent of each 
other and for this reason Check list and Re- 
tention interval were analyzed as between- 
Ss variables, 

The Ss were told that they were to be 
presented with a large number of words each 
for 3 sec. and that they were to remember 
as many as they could. They were told that 
some would be familiar words and some 
would be words they had not seen before. 
They were told not to discuss the experiment 
with anyone. At the start of the retention 
portion of the experiment, the check list was 
explained. At the end of the experiment E 
made sure all items were checked and rated. 
The Ss used in the experiment were taken 
from introductory psychology classes and 
assigned randomly to groups. None had 
served previously in a yerbal learning ex- 
periment. 


RESULTS 


The numbers of correct responses 
given by each S under each of the 
several conditions were analyzed.? The 


2 No correction for guessing such as R-W 
for each condition was attempted because 
analysis of these transformed data would 
yield identical Fs, when available, as would 
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TABLE 1 


MEAN CORRECT RESPONSES AS A FUNCTION OF MEANINGFULNESS, 
RETENTION INTERVAL, AND PRESENTATION 


Meaningfulness 
High 
Retention i r 
Interval = 
P P P P 
M SD M SD M SD M SD 
Immediate | 24.54 4.26 27.13 3.61 25.10 3.65 24.48 5.38 
24-hr. 23.48 5.71 18.48 6.16 22.81 4.37 20.83 5.28 


basic results are presented in Table 1. 
The maximum possible correct within 
each condition is 32, and it can be seen 
that the means varied from a low of 
18.48 in the 24-hr. high-P group to 
a high of 27.73 in the Immediate high- 
P group. 

In the analysis, Retention interval 
(R) was an important variable. After 
24 hr. fewer items were checked cor- 
rectly than immediately after presenta- 
tion, F (1, 88) =84.55, p < .001, in- 
dicating forgetting took place during 
the retention interval. In addition, 
more high-M items were correct than 
low in the immediate test while more 
low-M items than high were correct 
after 24 hr. This RX M interaction 
was also significant, F (1, 88) = 12.46, 
$ < 001. 

Retention interval also interacted 
with the P variable, F (1, 88) = 13.75, 
$ < .001. The meaning of this inter- 
action, as well as that of the R x P X 
M interaction, which was also signifi- 
cant, F (1, 88) =9,.91, p < .001, can 
be determined from Table 1. More 
P items than P items were correct in 
the immediate test, but after 24 hr. 
this relationship was reversed and 


be found in the analysis of correct responses 
alone, eg, R—W=R-—(32—R) =2R— 
32. Addition of constants or multiplication 
by a constant results in the same F values 
being obtained. 


more P than P items were correct. 
Similarly, the R x P interaction un- 
der high-M conditions differed from 
that found under low-M conditions. 
That is, in the high-M conditions more 

than P items were correct in the 
immediate test but this relationship 
was reversed after 24 hr. On the 
other hand, in the low-M conditions 
more P than P items were correct for 
both the immediate and 24-hr. reten- 
tion intervals. It would appear that 
the significant R XP and RX P Xx M 
interactions were in large part attribu- 
table to a guessing bias by Ss in the 
24-hr. groups where the familiar high- 
M items were indicated as having ap- 
peared. Under these conditions the 
number of P items correct would in- 
crease while the number of P items 
correct would decrease. 

Additional analyses were performed 
to describe the data more fully. A 
significant difference between the high- 
and low-M conditions was found in 
the immediate retention groups, F (1, 
44) = 11.40, p < .01, indicating better 
retention of the high-M items immedi- 
ately after presentation. After 24 hr., 
however, the direction of difference 
had reversed and more low-M items 
were retained than high. With F (1, 
44) =3.40, p<.10, the significance 
of this difference was marginal. 
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The Ss rated their confidence in 
their individual judgments and a Type 
II Receiver Operating Characteristic 
(ROC) curve, as described by Egan 
and Clark (1966), has been drawn for 
each of the four major groups. These 
data are presented in Fig. 1. The ab- 
scissa represents the probability of 

- being incorrect, the ordinate represents 
the probability of being correct, and 
the individual data points represent 
the probability of being correct and in- 
correct at some level of Ss’ confidence. 
If the probability of being correct and 
incorrect are the same, then chance per- 
formance would result in data points 
lying along the positive diagonal of 
Fig. 1. The essential results of the 
experiment can be seen in the figure. 
That is, position on the negative diag- 
onal represents the degree of discrim- 
inability (the ability of S to discrimi- 
nate between his correct and incorrect 
responses), with a greater distance 
from the positive diagonal represent- 
ing a greater degree of discriminability. 
Thus, it can be seen that correct re- 
sponses were best discriminated from 
incorrect responses for the high-Im- 


4--~-4 HIGH IMMEDIATE, 
4 —_ LOW IMMEDIATE 
e- HIGH 24 hr. 
e— Low 24 hr. 


P(CONFIDENCE > K/CORRECT) 
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P(CONFIDENCE 2 K/INCORRECT) 


Fic. 1. ROC curves showing the four 
major groups as a function of Ss’ confidence 
ratings given a correct response or an incor- 
rect response. 


mediate items, and that discriminabil- 
ity was poorest for the two 24-hr. 
groups. Notice, however, that the per- 
formance of both 24-hr. groups is 
above chance performance. The de- 
gree of discriminability, as measured 
by ds, is .82 for high Immediate, .62 
for low Immediate, .40 for high 24 hr., 
and .39 for low 24 hr. Again the in- 
teraction between Retention interval 
and Meaningfulness is apparent: the 
greater retention or discriminability of 
the high-M groups which is observed 
when retention is first measured dis- 
appears after the 24-hr. retention in- 
terval. 

Finally, the amount forgotten can be 
compared to the 20% forgetting which 
is taken as the usual loss in the meas- 
urement of retention of a single list. 
In the present experiment, S could 
guess and get items correct. To esti- 
mate the “true” number of correct re- 
sponses the number of wrong responses 
was subtracted from the number of 
right responses. The corrected num- 
ber correct for the high- and low-M 
immediate retention conditions was 
taken as the basis for comparison. 
Under these conditions 49% retention 
was found in the high-M conditions 
(ie., retention after 24 hr. was 49% 
of what it was immediately after learn- 
ing) and 66% retention was found for 
the low-M conditions, Clearly, forget- 
ting was greater in the present experi- 
ment than in the usual experiment 
measuring retention of a single list. 


Discussion 


A large number of studies have failed 
to find that retention is related to mean- 
ingfulness. This negative finding has 
been so consistent that it has been 


3 Assuming chance performance to be 32 
(out of 64) correct, the mean correct for the 
high 24-hr. group was 41.96 and t (47) = 
13.83, $ < .001. 
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stated as a general conclusion (Keppel, 
1968). However, the present data, as 
well as those obtained from a short-term 
memory experiment by Turnage (1967), 
indicate that high-M items are not as well 
retained as low-M items. Thus, under 
some conditions at least, retention is 
related to meaningfulness; the boundary 
conditions under which the general con- 
clusion holds will have to be more closely 
examined and specified. 

In contrast to the position taken by 
Bahrick (1964) and others, the results 
of the present experiment suggest that 
recognition and recall measures are not 
necessarily the same. The reason for 
this, of course, is the differential for- 
getting as a function of meaningfulness 
obtained by a recognition measure in the 
present experiment and the absence of 
such differential forgetting when reten- 
tion is measured by recall (cf. Keppel, 
1967). 

The design of the present experiment 
attempted to reduce the possibility of 
both letter sequence and unit sequence 
interference. The effects of these two 
sources of interference should have been 
minimal because the responses did not 
have to be reproduced and because only a 
single trial was given. In addition, 
buffer items were used since it was as- 
sumed that extraexperimental associations 
were more likely to interfere with weak 
contextual associations than strong. 

The present results give some support 
to the general concept of extraexperi- 
Mental sources of interference as pro- 
Posed by Underwood and Postman 
(1960), but explanation of the results 
either in terms of unit sequence or letter 
interference does not appear to be appro- 
Priate. Instead, explanation in terms of 
another source of interference could be 
defined as the interference which occurs 
between the association formed between 
an item and the experimental context and 
other more permanent associations exist- 
ing between the same item and other con- 
texts and words. The present results 
clearly show greater forgetting of the 
high- than of the low-M items and this 
Would appear to be a function of the 


greater use high-M items have in the life 
of the average college sophomore. In 
contrast to the low-M items, the high-M 
items are conditioned to more extraex- 
perimental contexts and are experienced 
more often. Under such conditions the 
association between the item and the ex- 
perimental context for a high-M item 
is more likely to be interfered with by 
these extraexperimental associations than 
would a similar low-M association. The 
reason for this is that the low-M item is 
conditioned to few if any contexts other 
than the one in which the experiment took 
place. 

There are greater numbers of interitem 
associations among a sample of high-M 
items than among a sample of low-M 
items (Deese, 1960). Incorrect identi- 
fication of P items as P items could thus 
occur through associations between P 
items and the experimental context which 
are mediated by certain of the P items. 
Some evidence in support of this was 
found in the response bias data of the 
high-M conditions. While a prediction 
of poorer retention for high-M items 
could be based on interitem associations, 
the mechanism involved in the differential 
forgetting of high- and low-M items 
remains difficult to specify. Under these 
conditions it would appear that an inter- 
ference explanation of the results of the 
present experiment would still have to 
rely heavily on the idea of context inter- 
ference. 

It would appear that a simple decay 
theory of forgetting would have some 
difficulty explaining the present results. 
Such a theory predicts that retention 
of the high- and low-M items would de- 
crease at approximately the same rate. 
On the other hand, Conrad’s (1967) 
modified decay model appears to have 
little difficulty with the present data. 
Although specifically designed to explain 
the results of short-term memory studies, 
Conrad states that “decay involves those 
characteristics of an item that make it 
discriminable . . . from other items in 
an available set [p. 52].” Thus those 
characteristics which make the high-M 
items discriminable immediately after 
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presentation, according to the theory, 
decay during the 24-hr. retention inter- 
val until the high- and low-M items are 
no longer discriminable. Modified decay 
theory would have greater difficulty in 
accounting for the differential forgetting 
found in the present experiment if there 
had been no differences in retention im- 
mediately after learning. In this case 
it would have been difficult to determine 
which class of items was more discrimin- 
able. 

Both interference and decay theories 
can predict the results of the present 
study. However, the results are of con- 
siderable importance to an interference 
theory because forgetting occurs follow- 
ing the learning of a single list. Such a 
theory must identify some mechanism 
such as extraexperimental interference 
to explain this forgetting. Little evi- 
dence favorable to an explanation such as 
this has been found up to the present time. 
The differential forgetting found in the 
present experiment has been predicted by 
interference theory and as such the re- 
sults support the theory. 
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CHOICE REACTION TIMES WITH EQUALLY AND 


UNEQUALLY PROBABLE ALTERNATIVES? 


RAYMOND H. HOHLE anp BARRY GHOLSON 2 
Institute of Child Behavior and Development, University of Iowa 


Choice reaction times (RTs) were obtained from 2 groups of adult 
female Ss under 2 conditions. Ss in 1 group (n=9) were presented 
3 sets of stimuli (geometrical forms) with 2, 4, and 8 alternatives, 
where the alternatives were presented with equal relative frequencies. 
A 2nd group (n =8) received a sequence of trials on 4 different stimuli 
presented with relative frequencies of .50, .25, .125, and .125. Choice 
RT was an increasing, approximately linear, function of the negative 
logarithm of the relative frequency of occurrence of a stimulus for 
each group, but the slope of the function was significantly greater for 
the group receiving stimuli with equal relative frequencies. Choice 
RTs on trials on which the stimulus was the same as that on the 
immediately preceding trial were significantly faster than on non- 


repeat trials, 


Choice reaction time (RT) has been 
shown by a number of investigators to 
be an increasing function—usually 
linear—of stimulus uncertainty, where 
stimulus uncertainty is defined as the 
hegative logarithm (Base 2) of the 
probability of occurrence of the stimu- 
lus (Hick, 1952; Hyman, 1953; Kauf- 
man & Lamb, 1966; Kaufman & Levy, 
1966; Lamb & Kaufman, 1965). Al- 
though this general relation tends to 
hold whether the choice stimuli are 
Presented with equal probabilities (EP 
conditions) or with unequal probabil- 
ities (UP conditions), different in- 
vestigators have obtained conflicting 
results concerning differential slopes 
of curves relating RT to uncertainty 
under EP compared to UP conditions. 

Hyman (1953) reported that, for 


_ Stimuli with low uncertainty (i.e., high 


Probability of occurrence), mean choice 
T was higher under conditions of 
P than EP, while the reverse was 

true with high stimulus uncertainty. 
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Similarly, Stone and Calloway (1964) 
and Kaufman and Lamb (1966) pre- 
sented data indicating steeper slopes 
for RT-uncertainty curves under EP 
conditions than under UP conditions. 
On the other hand, Lamb and Kauf- 
man (1965) and Kaufman and Levy 
(1966) obtained just the opposite re- 
sults: the slopes of comparable curves 
were steeper with UP than EP con- 
ditions. 

The present paper reports an inves- 
tigation of the RT-uncertainty rela- 
tionship under EP and UP conditions 
with identical levels of stimulus uncer- 
tainty under the two conditions. A 
direct comparison of RTs obtained 
under EP vs. UP conditions at several 
levels of uncertainty was thus possible. 


METHOD 


Subjects —The Ss (paid volunteers) were 
17 college women, 21-23 yr. of age. 

Apparatus and stimulus materials —The 
stimuli consisted of eight outline geometrical 
figures printed in black ink on white back- 
grounds. The figures were square, rectangle, 
circle, oval, triangle, diamond, crescent, and 
“cone” (an inverted isosceles triangle with 
the base curved), with areas ranging .093- 
.102 sq. in. 
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These figures were presented in a two-field 
tachistoscope, with the reaction stimuli pre- 
sented on the primary field. A new trial 
was initiated every 7.5 sec. by a click serving 
as a “ready” signal. Seven hundred and fifty 
milliseconds following the click the tachisto- 
scope was switched to the primary field ex- 
posing the stimulus figure. The Ss were 
instructed to name the figure orally into a 
microphone as quickly as possible, without 
making errors, when it appeared. This re- 
sponse activated a voice key which termi- 
nated the stimulus exposure and stopped a 
chronoscope that had been activated coin- 
cidentally with appearance of the stimulus 
figure, 

Design and procedure—Nine of the Ss 
were assigned to an EP condition in which 
lists of 2, 4, and 8 equally probable stimuli 
were presented in separate blocks of trials on 
each of three consecutive daily sessions. The 
stimulus probabilities for the three lists were 
thus .50, 25, and .125 corresponding to stim- 
ulus uncertainty values, as defined above, of 
1, 2, and 3. Within each session the three 
different lists were presented in separate 
blocks of trials, with order of presentation 
of these blocks counterbalanced across Ss 
within sessions and for individual Ss over 
the 3 days. The stimuli within a given block 
(list size) were presented in a quasi-random 
sequence, with each stimulus appearing 20 
times. The Ss were assigned different, ran- 
domly selected, sets of stimuli for the two- 
and four-item lists, with the restriction that 
the four-item list always included the stimuli 
in the two-item list for a given S. Each S 
received the same stimulus lists during each 
of the three sessions, Sixteen warm-up 
trials on the eight-stimulus list were given at 
the beginning of each session. At the begin- 
ning of each trial block containing the differ- 
ent list sizes, S was told exactly which stim- 
uli could occur during the ensuing block of 
trials. Only those two stimuli common to all 
three list sizes were included in the data 
analyses, 

Eight Ss were assigned to a UP condition 
in which a single set of four stimulus figures 
with unequal probabilities was presented to 
each S. The stimulus probabilities were 
chosen to be equal to those in the EP con- 
dition, viz., .50, .25, .125, and .125 for the 
four stimuli. The eight stimulus figures 
were assigned to Ss such that each was the 
.50 stimulus for one S, the .25 stimulus for 
one S, and was one of the two .125 stimuli 
for each of two Ss. Each S participated in 
three consecutive daily sessions with the 
same set of stimuli presented in each session. 


During each of these sessions the .50 stimulus 
appeared a total of 140 times, the .25 stimu- 
lus appeared 70 times, and each of the .125 
stimuli appeared 35 times. At the beginning 
of a session S was told the relative frequen- 
cies of occurrence of the four stimuli, and 
16 warm-up trials were given with these 
relative frequencies. A 30-sec. rest period 
followed each 40 trials for both EP and UP 
groups. 

If S gave a wrong response on any given 
trial, the stimulus figure presented on that 
trial was repeated at the end of the session. 
No S made more than one error during any 
280-trial session. This slightly lower error 
rate than that reported by Morin, Konick, 
Troxell, and McPherson (1965) was proba- 
bly due to a strong emphasis in the instruc- 
tions to S$ that he respond accurately. 

Medians were obtained from each S’s dis- 
tributions of RTs to individual stimuli for 
each session at each level of stimulus proba- 
bility. Means were computed of the medians 
for the two stimulus figures common to all 
three of the EP lists and for the two .125 
stimuli in the UP list, and these values, along 
with the medians of the RTs to the .50 and 
.25 stimuli under the UP condition, were 
entered into an analysis of variance. With- 
in-S's factors in this analysis were sessions 
and stimulus uncertainty, and a between- 
Ss factor was EP vs. UP. 

Because of the different numbers of trials 
from which the dependent variates were de- 
rived—yarying from 40 for all EP conditions 
to 140 for the .50 stimulus under the UP 
condition—the homogeneity of variance as- 
sumption necessary for analysis of variance 
was not strictly satisfied; however, the extent 
of nonhomogeneity introduced by this source 
is probably not sufficient to invalidate any 
inferences based on the analysis of variance 
(cf. Scheffé, 1959, pp. 351 ff.). 


RESULTS AND DISCUSSION 


Means of the individual median 
choice RTs obtained under EP and 
UP conditions are plotted in Fig. 1 as 
functions of stimulus uncertainty. Pre- 
liminary analyses of variance indicated 
that RT varied significantly as a func- 
tion of uncertainty for both the EP 
group, F (2, 16) = 56.32, p< 01, 
and the UP group, F (2, 14) = 15.56, 
< 0l; in the main analysis, the 
Group (EP vs. UP) X Uncertainty 
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interaction was significant, F (2, 30) = 
8.84, p < .01. The differentially posi- 
tive slopes of the two functions in Fig. 
1 are thus considered reliable. 

In general, mean choice RT de- 
creased significantly over days, F (2, 
30) = 36.66, p < .01, but this practice 
effect did not interact significantly with 
task (EP vs. UP), F (2, 30) = 1.04, 
or uncertainty, F (4, 60) = 1.56, nor 
was the triple interaction, Practice X 
Uncertainty X Task, significant, F (4, 
60) =.92. Thus no evidence was 
found that the different functions in 
Fig. 1 are due to different effects of 
practice under the two conditions. 

The Task X Uncertainty interaction 
is in accord with the expectation—but 
not the data—reported by Lamb and 
Kaufman (1965). These investigators 
proposed that an § can decrease his 
overall choice RT by heightening his 
readiness to respond to those stimuli 
with higher probabilities of occurrence. 
This strategy is effective, however, 
only to the extent that S’s subjective 
Probabilities associated with the vari- 
ous stimuli correspond to the objective 


a 2 3 
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Fig. 1. Choice RT with equally probable 
and unequally probable stimuli as functions 
Ot stimulus uncertainty, where stimulus un- 
certainty is equal to the negative logarithm 
(Base 2) of the probability of occurrence of 

© stimulus on any given trial. 


probabilities. In these terms, the re- 
sults reported here suggest that, al- 
though S tends to reduce his overall 
RT under UP conditions by adopting 
a set to respond to the most likely 
alternatives, he tends not to take full 
advantage of objectively different prob- 
abilities: He tends to treat the stimuli 
as having more nearly equal probabil- 
ities of occurrence than is objectively 
true. Data reported by Kaufman and 
Lamb (1966) suggest that this ten- 
dency breaks down at extreme stimu- 
lus probabilities, however. When they 
introduced a stimulus with relative 
frequency of .90, choice RT to that 
stimulus showed a sharp decrease com- 
pared to those at slightly lower relative 
frequencies. Under EP conditions, of 
course, the efficiency of S’s set to re- 
spond to particular stimuli simply de- 
creases with greater numbers of al- 
ternatives. 

The present results are of interest in 
connection with an hypothesis pro- 
posed by Kornblum (1967). Korn- 
blum noted that choice RTs on trials 
on which a stimulus is the same as on 
the immediately preceding trial tend to 
be shorter than those on nonrepeat 
trials (Bertelson, 1963), and suggested 
that, since the proportion of repeat 
trials decreases as the number of EP 
alternatives increases, the overall 
longer mean RTs observed with longer 
list lengths might be due simply to 
smaller relative frequencies of stimulus 
repetitions on successive trials. Tn its 
simplest form, this hypothesis predicts 
that the relation of choice RT to un- 
certainty should be the same under EP 
and UP conditions, since the expected 
relative frequencies as functions of 
probabilities are the same under the 
two conditions. The interaction shown 
in Fig. 1 is contrary to this prediction, 
however, suggesting that variations in 
the proportions of repeat trials is not 
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the only factor determining the mean 
RT-uncertainty relation. 

Differences between repeat and non- 
repeat trials were explored further by 
computing separate means for the two 
types of trials within each Uncertainty 
X Session combination for each S. 
These means were then entered into 
a four-factor analysis of variance with 
repeat vs. nonrepeat, uncertainty, ses- 
sions, and EP vs. UP as factors. All 
effects significant in the analysis de- 
scribed above were also significant in 
this second analysis, of course, and in 
addition, there was a significant re- 
peat vs. nonrepeat effect, F (1, 15) = 
58.01, p< .01 (mean RT for repeat 
trials was 590 msec. compared to 632 
msec. for nonrepeat trials) ; a signifi- 
cant Repeat vs. Nonrepeat X Uncer- 
tainty interaction, F (2, 30) = 10.35, 
p <.01, was found; and the triple in- 
teraction, Repeat vs. Nonrepeat X Un- 
certainty X EP vs. UP, was signifi- 
cant, F (2, 30) =7.43, p < .01. 

The nature of the two interactions 
may be seen from the data in Table 1: 
The obtained average slope of the RT- 
uncertainty function was steeper for 
nonrepeat trials than for repeat trials, 
and this interaction was due almost en- 
tirely to the results from the EP group. 


TABLE 1 


Mean Cuorce RT As Functions oF 
STIMULUS UNCERTAINTY ON REPEAT 
AND NONREPEAT TRIALS UNDER 
EP Anp UP Task Conprtions 


Repeat Trials | Nonrepeat Trials 
Uncertainty 


535 | 616 | 658 | 538 | 666 | 728 
537 | 593 | 601 | 583 | 618 | 659 


EP 
UP 


From these data it thus appears that 
(a) stimulus probability, or uncer- 
tainty, affects choice RT differentially 
under EP and UP conditions, and (b) 
the form of this differential effect is 
the same for repeat and nonrepeat 
trials, though the quantitative details 
differ for the two types of trials. 
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RECALL AND RECOGNITION MEASURES OF THE 
VON RESTORFF EFFECT IN SERIAL LEARNING* 


JOHN P. McLAUGHLIN 


University of Delaware 


To determine whether the von Restorff effect occurs in the response- 
acquisition stage of learning, a vivid item (V) was included in 9-item 
lists which were learned by the serial-anticipation method for 1, 2, or 


3 trials. 
trol items in tests of free recall. 


nition tests which substituted for or followed the recall tests. 


After 2 and after 3 trials, V was facilitated relative to con- 


No such effect was found in recog- 


The 


data indicated that vividness affects search and retrieval processes. 


Since von Restorff’s (1933) original 
report that a vivid item (V) was facili- 
tated in retention tests, numerous in- 
vestigators (cf. Wallace, 1965) have 
found that the learning of V is en- 
hanced in paired-associate and serial- 
anticipation tasks. Horowitz (1962) 
has suggested that this facilitation due 
to vividness occurs primarily in the 
associative stage rather than in the re- 
sponse-acquisition stage of learning, 
citing evidence that low similarity 
among responses inhibits response ac- 
quisition, but facilitates the formation 
of S-R associations. 

While there is evidence that the von 
Restorff effect occurs when necessary 
Tesponse acquisition is minimal (Jen- 
sen, 1962), other investigators have 
reported data which suggest that vivid- 
hess also enhances response acquisi- 
tion, Using a low meaningful (7) 
Paralog as V in a series of high m di- 
syllables, Newman and Saltz (1958) 
reported that V appeared as an overt 
intrusion error on the first trial more 
often than its control. Ina later study, 
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Saltz and Newman (1959) presented 
Ss with one serial-anticipation training 
trial and then tested for learning with 
a controlled association test. The 
stimulus terms in the test were list 
members and Ss were instructed to re- 
spond with any other item from the 
series, V was elicited more frequently 
than other items, leading the investi- 
gators to the conclusion that the effect 
of vividness was to increase the ten- 
dency to emit V as a response uncon- 
nected to any specific stimulus (Saltz 
& Newman, 1959). 

McLaughlin (1966), however, us- 
ing a red upper-case consonant-vowel- 
consonant trigram (CVC) in a series 
of black lower-case CVCs, failed to 
find sufficient intrusions of V to sup- 
port Saltz and Newman’s ( 1959) con- 
tention. It might have been that in- 
trusion errors were not a sufficiently 
sensitive measure of the acquisition of 
CVCs as responses, or it may be that 
all operations defining vividness do 
not in fact produce the same effects on 
learning. 

To pursue this problem, the present 
experiment used tests of response ac- 
quisition other than intrusion errors, 
The purpose of the present study was 
to assess the effects of vividness on the 
early stages of the serial learning of 
CVC material using free recall and 
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recognition measures after serial train- 
ing, in addition to the more standard 
anticipation responses. 


METHOD 


Subjects—Ninety-six undergraduate vol- 
unteers served as Ss in the experiment. 
None had previously participated in such 
experiments. 

Apparatus—Nine CVCs with an average 
association value of 32% were selected from 
Archer’s (1960) standardization. Radio- 
mats were used to prepare a 2 X 2 in. slide 
for each item which was exposed by a Ko- 
dak Carousel projector, controlled by Hunter 
timers. Size and shape of the letters were 
varied to produce vividness; the projected 
nonvivid items were lower-case pica symbols 
averaging 1.5 in. in height and the V was 
in 4-in. high upper-case bold type. 

Procedure—Seventy-two Ss were divided 
equally into three groups. Each group re- 
ceived serial-anticipation instructions and 
then learned a nine-item list. One group re- 
ceived one training trial, the second group 
two trials and the third group three trials. 
In each group, 12 Ss (experimental Ss) 
learned lists with V in the sixth position 
and the remaining 12 Ss (control Ss) learned 
lists in which no item was vivid. This de- 
sign then includes six groups of 12 Ss each, 
two conditions by three amounts of training. 

Immediately after the last training trial 
each S was instructed to recall as many 
items from the list as possible, disregarding 
order. After a 1 min. period, they were 
given a list of 50 CVCs selected from 
Archer’s (1960) study including the original 
nine items. This list was typed in upper- 
case pica symbols. Each S was given 1 
min, to circle any items they recognized 
from the original list. Anticipation re- 
sponses were also obtained from the four 
groups receiving two and three training 
trials. à 

To minimize any idiosyncratic effects of 
specific items, four random orders of the 
nine-item list were used. One order was 
given to each subset of three Ss in every 
group. All Ss were familiarized with the 
anticipation procedure by being required to 
learn a list of four common words before the 
experiment proper. The Ss were individually 
tested and were assigned to groups on the 
basis of their order of appearance in the 
laboratory. Order of assignment was coun- 
terbalanced with respect to group and or- 
der of items. 


In the presentation of the 
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nine-item series, each item was exposed for 
2.6 sec., with an interitem interval of .7 sec, 
and an intertrial interval of 6.6 sec. 

An additional 12 experimental and 12 con- 
trol Ss were given two training trials, fol- 
lowed immediately by the recognition test 
to assess the effects of prior recall on recog- 
nition in this situation. These groups served 
after the completion of the first part of the 
experiment, but were treated in the same 
manner as the original 72 Ss except for the 
absence of the recall test. 


RESULTS 


Figure 1 shows the frequency of 
correct responses for the recall and 
recognition measures for each of the 
groups as a function of item serial 
position. Recall (RL) frequency is 
presented in the coordinates to the left 
and recognition (RN) frequency to 
the right. The numbers following the 
RL or RN notation in each cell repre- 
sent the number of preceding training 
trials. Inspection of the recall func- 
tions indicates that V was not particu- 
larly favored after one training trial, 
but that its recall considerably ex- 
ceeded that of its control item after 
two and after three presentations of 
the series. Both differences reached 
statistical significance, x? (1) = 4.69, 
p<.05, and x? (1) = 4.29, p < .05, 
for the RL-2 and RL-3 cells, respec- 
tively. 

No such effect is apparent in the 
recognition data. Values for Vs ex- 
ceeded those for the controls only for 
the groups receiving two training trials 
and those differences were not signifi- 
cant. 

Anticipation responses were ana- 
lyzed to determine if V or its control 
item had become associated to any 
intralist stimulus during training. 
Neither V nor its control appeared as 
correct or incorrect anticipation Te- 
sponses on the second trial for those 
Ss receiving either two or three train- 
ing trials. In the experimental group 
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: Fic. 1. Total correct responses as a func- 
tion of item serial position for recall (RL) 
and recognition (RN) measures taken after 
» 2, or 3 serial-anticipation training trials 
Or experimental and control groups. 


receiving three training trials, two Ss 
anticipated V correctly and one S used 
„as an incorrect response on the third 
trial, 

Other items in the experimental 
Series were compared to their control 
Counterparts to determine if V’s pres- 


ence affected the learning of the re- 
mainder of the list. No significant 
differences in frequency of correct re- 
sponse between corresponding items 
were found for any of the response 
measures. In addition, there were no 
differences between the experimental 
and control groups in total correct re- 
sponses, recall or recognition, when 
the values for V and its control item 
were excluded. 

The RN-2 cell of Fig. 1 presents 
the recognition scores taken immedi- 
ately after two training trials and those 
taken after the 1-min. recall period. 
No differences in total correct recog- 
nitions were detected between the two 
experimental nor between the two con- 
trol groups. 


Discussion 


The major finding of this study was 
that V was more available in free recall 
than its control item. It was also found 
that V and its control were equally 
available in recognition. These results 
indicate that vividness affects processes 
of search and retrieval and may not affect 
the strength of stored information. 

The facilitation of V in recall could 
indicate that the intraitem associative 
bonds of the V trigram are stronger than 
those of the corresponding control item. 
These data could also indicate that the 
search processes used in free recall are 
such that V is retrieved more efficiently 
than its control. 

The absence of a facilitation of V in 
the recognition data does not support the 
former interpretation. If a recognition 
test is more sensitive than recall, one ex- 
pects that the existence of weaker as- 
sociative bonds would lead to more cor- 
rect responses on recognition tests than 
on recall tests. Such an increase in 
correct response on the recognition tests 
was found for the control item, but a 
similar increase was not obtained for V. 
It could be argued that the frequency of 
correct recognition of V was retarded by 
a ceiling effect, but the obtained values 
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for V rarely approached their maximum 
possible values. It seems more likely 
that the recognition test retarded the 
retrieval of V. The appearance of V 
and the other items changed on the recog- 
nition test in that all items on that test 
were printed in uppercase pica symbols. 
Either the change in shape of V per se, 
or its similarity in shape to the other 
items on the test may have prompted in- 
appropriate search processes for that 
item. 

The facilitated recall of V is consistent 
with the essentials of the conclusion 
reached by Saltz and Newman (1959). 
Vividness enhanced the availability of a 
response before that response was asso- 
ciated with its intralist stimulus. Horo- 
witz’s (1962) suggestion was not sup- 
ported by the data of the present experi- 
ment. 
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RECALL OF A SERIAL LIST AS A FUNCTION OF 


AROUSAL AND RETENTION INTERVAL 


BARBARA S. UEHLING anp ROBERT SPRINKLE 
Emory University 


Although arousal level, varied during learning, has been shown to 
influence retention of verbal materials, arousal effects only at the time 
of recall have not yet been investigated. In the present study arousal 
level, immediately prior to recall of a serial list, was manipulated by 
(a) instructions to relax, (b) induction of muscle tension, and (c) 
presentation of bursts of white noise (80 db.). 3 retention intervals 
(immediate, 24 hr., and 1 wk.) were employed for each arousal con- 
dition, No differences were found among arousal conditions for im- 
mediate recall; however, white noise facilitated recall when measured 
at 24 hr. or 1 wk. Results are interpreted in terms of general arousal 
produced by activity of the brainstem reticular formation which may 


be expected to facilitate performance in many situations. 


The concept of arousal was initially 
introduced into psychological literature 
to refer to the neurophysiological pat- 
tern of increased activity in the brain- 
stem reticular formation and EEG ac- 
tivation and to the behavioral arousal 
accompanying the neural change 
(Lindsley, 1951). Since its introduc- 
tion, the concept of arousal has been 
variously applied. Stimulus input 
variables such as auditory and visual 
stimulation, which have been shown to 
produce neurophysiological and be- 
havioral arousal, provide one opera- 
tional definition of arousal. Arousal 
has also been used to refer to affec- 
tively toned material which is judged 
to be arousing in its referent or asso- 
ciative connections. Still another use 
of arousal is in reference to peripheral 
Measures which vary with stimulus 
inputs shown to produce the neural 
pattern of arousal. Illustrative of the 
latter are muscle action potential and 
skin resistance, both of which have 
been shown to vary with white-noise 
input (Berlyne & Lewis, 1963; Davis, 
1948), 


_ The effects of arousal during acqui- 
sition of verbal materials on immediate 
and later recall have been investigated 


using all of the above definitions of 
arousal. Berlyne and Lewis (1963), 
in varying white-noise (WN) presen- 
tation during acquisition of verbal ma- 
terials, have shown equivocal effects on 
immediate retention (IR). One study 
(Berlyne, Borsa, Craw, Gelman, & 
Mandell, 1965) resulted in poorer IR 
of items paired with WN during ac- 
quisition, while a later study (Berlyne, 
Borsa, Hamacher, & Koenig, 1966), 
which differed procedurally, showed 
no difference in IR between WN and 
non-WN items. However, in both in- 
vestigations, retention of WN-accom- 
panied items was superior to items 
not paired with WN when retention 
was measured 24 hr, later. 

Similar findings suggesting an inter- 
action between arousal and time of 
retention have been found in studies 
defining arousal as affectively toned 
material and as GSR to these materials 
and to nonsense syllables (Kleinsmith 
& Kaplan, 1963; Kleinsmith & Kaplan, 
1964; Walker & Tarte, 1963). Again, 
poorer IR but superior later retention 
was found for high arousal materials. 

In none of the studies reported 
above have the effects of arousal only 
at the time of recall been studied. It 
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is possible that arousal at recall is suf- 
ficient to produce retention differences 
which would make unnecessary the 
postulation of arousal effects on con- 
solidation of memory traces during 
learning. The present investigation 
varied arousal immediately prior to 
recall where arousal was defined as 
auditory input (WN) and as induced 
muscle tension. The rationale for de- 
fining muscle tension as arousal was 
twofold. First, as suggested above, 
muscle tension has been shown to vary 
with WN, and secondly, peripheral 
nerve stimulation has been shown to 
directly produce neurophysiological 
and behavioral arousal (Segundo, 
Arana, & French, 1955). In view of 
the earlier findings suggesting an in- 
teraction of arousal during learning 
with retention interval, another vari- 
able retention interval was included to 
determine if arousal conditions might 
have differential effects related to time 
of measurement. 


METHOD 


Subjects and materials —A total of 108 
introductory psychology students from the 
winter and spring quarters served as Ss. A 
10-item serial list consisting of disyllabic 
English adjectives selected for their low 
arousal properties comprised the learning 
task. Selection of items was made on the 
basis of a rating procedure using 38 Ss from 
the fall introductory psychology class. Each 
of 50 disyllabic English words was rated on 
five semantic differential scales, the scales 
used being chosen for their apparent rela- 
tionship to arousal and activation. Nine 
points were used for each scale, the ends 
of which were denoted as (a) strong-weak, 
(b) good-bad, (c) pleasant-unpleasant, (d) 
angular-rounded, (e) fast-slow. The 10 
most neutral items on the basis of ratings 
on all five scales were selected. Obtained 
mean ratings for these items were 4.8, 4.8, 
4.9, 5.2, and 5.6 on the above scales, respec- 
tively. 

Procedure—All experimental sessions 
were conducted in a small, sound-attenuated 
room with a large desk on which a memory 


drum was placed. -A screen blocked the 
view of E by S during learning and recall 
trials and during arousal conditions. Pre- 
sentation rate for both learning and recall 
was 2 sec, with a 4-sec. intertrial interval. 
Learning to the criterion of one perfect an- 
ticipation trial was used. All experimental 
sessions were conducted in the afternoon. 
To establish equivalence of initial learning, 
Ss were assigned to retention groups on the 
basis of trials to criterion. 

Three retention intervals were employed: 
(a) immediate recall (IR), (b) 24-hr. recall 
(24 R), and (c) 1-wk. recall (Wk R). 
An interval of 2-3 min. elapsed between 
learning and IR. Those Ss in the 24 R and 
Wk R reported for the second phase of the 
experiment at the same time on the appro- 
priate day. Within each of the three re- 
tention intervals, three arousal conditions 
were used, thus providing nine groups (12 
Ss each). Arousal was manipulated by the 
activity during the first 3 min. of the reten- 
tion session, this activity being either (a) 
low arousal (LA), (b) muscle tension 
arousal (MTA), or (c) white-noise arousal 
(WNA). Instructions to LA Ss were: 

Sit in this chair and relax for the next 

few minutes. Most people find it easiest 

to relax by getting in a comfortably 
erect posture, closing their eyes, and 
letting their minds wander with muscles 
at ease. I’ll tell you when to stop re- 
laxing. 

Instructions to MTA Ss were: 

Sit in this chair and exercise at a mod- 

erately fast pace with these muscle ten- 

sors for a few minutes. Don’t tire your- 

self out, but keep active, I'll tell you 

when to stop. 
The Ss in the WNA conditions were given 
the same instructions as used in the LA con- 
dition, but during the interval a tape re- 
cording of irregular intervals of WN (80 
db.) was played. At the end of 3 min. all 
Ss were instructed to recall the words 
learned in the previous session. Relearning 
trials were given until Ss reattained the cri- 
terion of one perfect trial or for a maximum 
of five trials. 


RESULTS 


All Ss required at least two trials 
to reattain the criterion. Hence, tw 
measures of recall were analyzed: (@) 
mean number correct items on Recall 


Ll —— a a 


be 
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MEAN CORRECT RECALL TRIAL I 


IR 24hr. 


RETENTION 


Fre. 1. 


Trial 1 and (b) mean number correct 
on Recall Trials 1 and 2. Since nearly 
identical relationships and statistical 
results were found using both response 
measures, only analyses employing 
mean number correct on Recall Trial 1 
(see Fig. 1) will be reported. An 
analysis of variance performed on 
these data resulted in a significant 
main effect of time of retention, F (2, 
99) = 23.98, p < .001, and of arousal 
conditions at the time of recall, F (2, 
99) = 4.80, p< .025; however, the 
interaction was not significant. Pairs 
of treatment means within the overall 
analysis were compared by the New- 
man-Keuls method (Winer, 1962). 
As expected from inspection of the 
figure, no differences in IR occurred 
among arousal conditions. For 24 R, 
WNA differed significantly from both 
MTA and LA conditions (p < .01), 
while for Wk R, it differed signifi- 
cantly only from LA (p < .01). The 
apparent difference between MTA Wk 
and LA Wk is not significant however. 
Thus, it appears that WNA, when 


WK. 
INTERVAL 


Mean number correct responses on Recall Trial 1 for each arousal condition. 


compared with LA, has a significant 
facilitatory effect on later retention 
but not on IR, while MTA does not 
differ from LA for either IR or later 
retention. 


Discussion 


Failure to find effects of arousal condi- 
tions presented immediately prior to re- 
call upon IR is consistent with those 
previous studies reporting no effect on 
IR of manipulations of arousal during 
learning. Facilitation of later retention 
measures by WN presentation found in 
the present study is consistent with those 
studies reporting greater recall of high 
arousal items in later retention tests. 
However, present results suggest that 
arousal immediately prior to the reten- 
tion test is sufficient to produce this 
facilitation, although arousal during 
learning or immediately after has not dif- 
fered. These data would seem to best be 
interpreted in a general arousal frame- 
work where sensory input to the brain- 
stem reticular formation can be ex- 
pected to facilitate performance in a 
variety of tasks. That facilitation is 
evident only in later retention measures 
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can be accounted for by two factors. 
First, performance for IR is already near 
an optimum so demonstration of facilita- 
tion is unlikely. However, with later 
retention, forgetting will have occurred 
so facilitative effects on the lowered re- 
sponse strength can be demonstrated. In 
addition, in the IR situation, attention or 
arousal should already be high since S has 
just undergone a learning procedure; 
whereas with later measures, arousal 
on entering the experimental room may 
be expected to be lower, hence facilitation 
due to sensory input would be expected. 

In designing the study, it was reasoned 
that stimulation from muscle tension even 
without precise quantification of the de- 
gree of tension would be sufficient to 
produce neurophysiological and be- 
havioral arousal. Failure to find in- 
creased recall with MTA however may be 
due to lack of quantification of the de- 
gree of tension. This explanation is 
supported by a comparison of the findings 
of the present study with those of Bourne 
(1955) who studied effects of muscle ten- 
sion induced by squeezing a hand dyna- 
mometer during learning, retention, dur- 
ing both, or neither. Facilitation of 
IR was found when tension was varied 
during recall, although no facilitation in 
recall occurred when tension was varied 
during learning. Of the several pro- 
cedural differences between Bourne’s 
study and the present one, perhaps the 
most relevant is that each S in the 
Bourne study was well practiced on the 
hand dynamometer and was required to 
squeeze to one-third of his maximum 
grip on each tension induction. In the 
present study, Ss had no prior practice 
and varied considerably in their apparent 
effort to squeeze the dynamometer. 
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` PAIRED-ASSOCIATE ACQUISITION: 


SOME EFFECTS OF INTER- AND INTRAPAIR SIMILARITY * 


CHARLES P. THOMPSON AND DEAN E. FRITZLER 


Kansas State University 


that the effect of formal intrapair similarity can be modified by the 


predictability of the repeated letter. 


The location of the within-pair 


repeated letter does not affect the total number of errors but it does 
affect the distribution of the types of errors made by S. 


Although a large number of experi- 
ments have investigated the effects of 
similarity among stimulus or response 
members on paired-associate (PA) ac- 
quisition, relatively little effort has 
been expended on investigation of the 
effects of similarity between stimulus 
and response members on PA acquisi- 

_ tion. Nevertheless, there is substan- 
tial evidence to suggest that S-R simi- 
larity has powerful effects which may 
either markedly facilitate or markedly 
inhibit PA acquisition. Further, the 
evidence suggests that there are at 
least three variables which may deter- 
mine the direction and magnitude of 
the effects produced by S-R similarity. 

The first, and probably the most 
powerful, variable is the location (with- 
in or between pairs) of the S-R simi- 
larity. Interference may be expected 
when items of different pairs are simi- 
lar, but facilitation may be expected 
when items of the same pair are simi- 
lar. For example, it is clear that com- 
plete intrapair item similarity (i.e. 
identity) would make a list trivially 
easy. On the other hand, a double- 


function PA list (in which each item 
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serves, in different pairs, as both a 
stimulus and a response) has been 
demonstrated (Umemoto & Hilgard, 
1961) to be much more difficult than 
a single-function PA list containing 
the same number of pairs. 

The second variable which may 
modify the effects of S-R similarity is 
the location in the stimulus term of the 
component which is repeated (or ap- 
proximated) in one of the response 
terms. For an illustration, consider 
a trigram-trigram PA list with the 
stimulus trigrams picked to insure 
stimulus selection and with one of the 
letters in each stimulus term also ap- 
pearing as a letter in one of the re- 
sponse terms. Since the evidence (e.g. 
Jenkins, 1963; Postman & Greenbloom, 
1967) rather overwhelmingly suggests 
that the first letter of the stimulus tri- 

will be selected as the functional 
stimulus, it seems reasonable to as- 
sume that repetition of this letter will 
produce the most facilitatory (letter 
repeated within pairs) or inhibitory 
(letter repeated between pairs) effects. 
Repetition of the third letter of the 
stimulus, on the other hand, should 
have relatively little effect. 

The third variable which may de- 
termine the magnitude of S-R simi- 
larity effects is the predictability of 
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the repeated element(s). Several ex- 
periments (e.g., Brogden & Schmidt, 
1954) have shown that task difficulty 
increases as the degree of response un- 
certainty increases. In the case of in- 
trapair S-R similarity, response un- 
certainty is determined, in part, by the 
predictability of the location in the 
stimulus and response terms of the re- 
peated element(s). Again, consider a 
trigram-trigram PA list for illustra- 
tive purposes. If the S-R similarity 
is located within pairs, it is clear that 
increasing the predictability of the re- 
peated letter(s) should increase the 
ease with which the list is learned. If 
the S-R similarity is located between 
pairs, the concept of predictability is 
meaningless except for the special case 
in which two or more letters from a 
single stimulus term appear in a sin- 
gle response term. In this case, in- 
creased predictability should again de- 
crease response uncertainty and, there- 
fore, increase the rate at which the 
PA list is learned. 

The experiment reported in this 
paper is an initial investigation of the 
three variables hypothesized above. 
Trigram-trigram PA lists with formal 
similarity between the stimuli and re- 
sponses were used in the study. 


MeEtHop 


Subjects—One hundred and twenty un- 
dergraduates enrolled in general psychology 
classes at Kansas State University served 
as Ss. 

Materials and experimental design.—Lists 
of 10 CVC trigram pairs were used in each 
of the five experimental conditions and the 
one control condition in the experiment. For 
the experimental conditions, all letters of 
the alphabet were used in generating the 
10 CVC stimuli with each of the consonants 
used once and each of the vowels (including 
Y) used at least once but no more than 
twice. The response trigrams were then 
generated by using all letters in the same 
location that they had in the stimulus terms 
(i.e, first, second, or third letter) with the 
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restriction that no two letters appearing in 
any one S term appear in any one R term, 
The five experimental conditions consisted 
of one condition in which all the S-R simi- 
larity was located between, rather than 
within, pairs (Cond. B) and four conditions 
in which one of the letters in the S term 
Was repeated in the R term of that pair, 
The four intrapair similarity conditions 
were: Cond. F, first letter of S occurred as 
the first letter of R; Cond. L, last letters 
of S and R were identical; Cond. FL, half 
the pairs had the first, and half the last, let- 
ters of S and R identical; and Cond. U, 
either the first or the last letter of S oc- 
curred as either the first or the last letter 
of R with all four possible combinations 
occurring approximately an equal number of 
times.2 The stimuli and responses for the 
five experimental conditions are given in 
Table 1. The mean association values 
(Archer, 1960) for the stimuli and responses 
in these lists were 25.8 and 40, respectively. 
The control condition (Cond. C) stimuli 
were constructed using all the vowels (in- 
cluding Y) and 10 consonants, Each con- 
sonant was used once as a first letter and 
once as a last letter with the restriction that 
the consonant was not repeated in a single 
trigram. The responses for the control list 
were constructed in the same manner using 
all the vowels and the remaining 10 con- 
sonants. The mean association values for 
the stimuli and responses in the control list 
were 33.4 and 31.5, respectively. 
Procedure—Group data collection pro- 
cedures were used and all 20 Ss in Cond. 
L, FL, and U were run simultaneously. 
Due to scheduling difficulties, Ss in Cond. 
F, B, and C were run in two groups of 10 
Ss each. All groups of Ss were given stan- 
dard instruction for study-test (recall) PA 
acquisition (e.g, Battig & Brackett, 1961). 
When it was clear that all Ss understood the 
instructions, acquisition trials were begun. 
Each pair in the study portion of the trial 
and each stimulus in the test portion of the 
trial were presented separately by E on a 
3X5 card at a 4-sec. rate with an intertrial 


? Owing to the manner in which the 
stimuli and responses were constructed, it 
was possible to use the same set of stimuli 
and responses for Cond. F, L, and B. Fur- 
ther, since, on an a priori basis, stimulus 
selection was deemed to be possibly a most 
important factor, great care was taken in 
generating a set of low-association value 
stimuli and this set of stimuli was used for 
all the experimental conditions. 


TABLE 1 
STIMULI AND RESPONSES FOR EACH OF THE 
LOF 


| 
j FIVE EXPERIMENTAL CONDITIONS 
Responses 
l Stimuli 
MAH 


XID 
woJ 
SYG 
PAZ 
NEB 
LOF 
MAH 
REK 
CUQ 
TYV 


| PYD 
LEJ 
XAG 
NIZ 
MIB 
CEF 
RYH 
WUK 
TOQ 
SUV 


Cond. F |Cond.L| C894- |Cond. U|Cond. B 
NUD 
NEB 


cUQ 
wOoJ 
SYG 
XID 
TYV 
REK 
PAZ 


interval of 12 sec. A G. C. Wilson Model 

4 repeat cycle timer was used to pace the 

presentation of the cards. Twelve different 

orders of presentation were used to minimize 

serial learning. The order of presentation 

of the items was the same for all Ss within 

a condition. The lists were presented for 

30 trials and Ss wrote the responses for 

each trial on a separate page in a small data 
booklet. 

The data were tabulated in terms of trials 

and errors to a criterion of one errorless 

trial. In those instances in which S did not 


31 trials to criterion. 


RESULTS 


Analyses of variance performed on 
the trials and errors data demonstrated 
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that the differences among the groups 
were highly significant Frias (5, 114) 
=4.72, p< 001, Feros (5, 114) = 
3.79, p<.01. The means for both 
trials and errors are given in Table 2 
along with the results of subsequent 
range analyses (Duncan, 1955). These 
results show clearly that Cond. B was 
more difficult than Cond. F, L, or FL 
with Cond. U and C roughly midway 
between these two extremes. The au- 
thors interpret this to mean that loca- 
tion of similarity (within or between 
pairs) was an extremely effective vari- 
able (Cond. B vs. Cond. F, L, FL, 
and U), that predictability was also 
an effective variable (Cond. U vs. FL), 
and that the location in the stimulus 
trigram of the repeated letter (Cond. 
F vs. L) was not, at least in terms of 
these measures, an effective variable. 
In order to look more closely at the 
nature of the errors made by Ss, the 
authors also tabulated the number of 
misplaced (but otherwise correct) re- 
sponses for each S for each condition 
and the number of overt errors on the 
repeated letter for each S in the four 
intrapair similarity conditions. An 
analysis of variance performed on the 
misplaced response data demonstrated 
that the differences among the groups 


TABLE 2 


MEAN NuMBER OF TRIALS, ERRORS, 


AND IMPROPERLY PLACED RESPONSES 


AS WELL AS THE MEAN PROPORTION OF IMPROPERLY PLACED 
RESPONSES FOR EAcH CONDITION 


achieve criterion, total errors were tabulated 
and S was assigned the arbitrary score of 
} 


Condition 


L FL Cc U B 
mo | 476. | 479 | 684 | 84i 
[133 | m6 | 173 
se Merle es 2 
| 4,55 | 3.65 | 8.75 
40 | 285 | 250 
Percentage 91 3.5 55 | (9.3 5.2 10.1 
| 


Note—Means underlined with the same line do not differ significantly. 


F 
Mean Errors 38.6 $ 5 
Mean Trials 9.0 | 10.0 | 95 
Improperly Placed Ri | 
Mean ly Placed Responses sea 2.85 2.50 
i 
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were highly significant, F (5, 114) = 
9.99, p < .001. The mean number of 
misplaced responses in each condition 
is also given in Table 2 along with the 
results of the Duncan range analysis 
on that data. As can be seen, the pat- 
tern of these results almost exactly 
mimics the results for the total errors 
and total trials data. However, it is 
probably more meaningful to consider 
the number of misplaced responses rela- 
tive to the total number of errors for 
each S. Accordingly, the number of 
misplaced responses were converted 
into a percentage of total errors: score 
for each S and an analysis on these data 
was performed. Again, the means for 
these data and the results of the Dun- 
can range test performed on them are 
given in Table 2. As before, the anal- 
ysis demonstrated significant differ- 
ences among the groups, F (5, 114) = 
5.46, p < .001. Inspection of the or- 
dering of these means strongly suggests 
that the proportion of misplaced re- 
sponses decreases as the number of 
pairs having the first letter of the stimu- 
lus and response in common increases. 

The number of overt errors on the 
repeated letter was also converted into 
percentage of total errors for each 5 
in each of the four intrapair similarity 
conditions, and an analysis of variance 
was performed on the resulting data. 
Again, the differences among the 
groups were highly significant, F (3, 
76) = 15.43, p<.001. The means 
for these data, along with the results 
of a Duncan range test, are given in 
Table 3. 

The reliable difference between 
Cond. FL and U demonstrates that 
the proportion of overt errors made on 
a repeated letter increases as the pre- 
dictability of the repeated letter de- 
creases. 

The difference between Cond. F and 
L cannot be adequately evaluated with- 
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TABLE 3 


MEAN PROPORTION OF OVERT ERRORS ON THE 
REPEATED LETTER FOR THE REPEATED- 
LETTER EXPERIMENTAL CONDITIONS 
AND THE MEAN PROPORTION OF 
OVERT ERRORS ON THE FIRST 
AND LAST POSITION IN THE 
CONTROL CONDITION 


Conditions F L FL U 


Experimental | 2.87 | 10.10 11,52 | 26.32 
Control 17.30 | 26.20 


Note.—Means underlined with the same line do not 
differ significantly. 


out reference to performance on a con- 
trol condition. Therefore, the number 
of overt errors at both the first and 
last positions of the trigram were con- 
verted into percentage of total errors 
for each S in the control condition. 
The mean percentage of overt errors 
for the first and last positions in the 
control condition are also given in 
Table 3. The means for both experi- 
mental conditions were reliably lower 
than the comparable control mean, 
Farst (1, 38) = 21.32, p < 001; Fiast 
(1, 38) = 15.08, p < 001. However, 
the magnitude of the reduction was ap- 
proximately the same for both condi- 
tions since comparison of the difference 
scores (the appropriate control condi- 
tion mean was subtracted from the 
score for each experimental S) for the 
two conditions yielded no reliable dis- 
parity (F <1). Thus, while the re- 
sults demonstrate that the introduction 
of a repeated letter at either the first 
or last position of the trigram reliably 
reduced the proportion of overt errors 
at that position, it does not appear that 
there was a differential reduction in the 
proportion of errors made at the two 
positions. 


Discussion 


The results demonstrate that similarity 
between the stimulus and response items 


INTER- AND INTRA-PAIR SIMILARITY OF PAs 


of a PA list can, depending on the ar- 
rangement of pairs, either facilitate or 
inhibit PA acquisition. The type and 
magnitude of the effect depends, at least 
in part, on the location of S-R similarity 
(between or within pairs) and, in the case 
of intrapair similarity, the predictability 
of the repeated element. 

In the experimental conditions of the 
present study, all the letters of each stim- 
ulus term appeared in the same loca- 
tion (i.e., first, second, or third letter) 
in the response terms. The authors ex- 
pected this arrangement to produce con- 
siderable interference when all the letters 
were duplicated between pairs. Under 
that condition, incorrect forward and 
backward associations should be gen- 
erated within each response trigram and 
between the stimulus and response tri- 
grams. This expectation was given sup- 
port by the very poor performance re- 
corded for the condition (B) in which 
all the S-R similarity was located between 
pairs. The Ss serving in this condi- 
tion took more trials, made more errors, 
and more frequently misplaced an other- 
wise correct response while learning the 
PA list than did Ss in any other con- 
dition, There can be no question that, in 
this case, interpair S-R similarity 
ey retarded acquisition of the PA 
ist. 

It was also expected that location of 
one of the three repeated letters (in each 
stimulus trigram) within, rather than 
between, pairs would produce a relative 
facilitation in PA acquisition. It should 
perhaps be noted at this point that this 
manipulation involves both the removal 
of a between-pair repeated letter and the 
addition of a within-pair repeated let- 
ter. This experiment was not designed 
to distinguish between these two effects 
but, rather, it was designed to demonstrate 
the marked differences in rate of acquisi- 
tion which may occur among lists having 
the same degree of S-R similarity. 

As predicted, performance improved 
when repeated letters are located within, 
rather than between, pairs, but the degree 
to which performance is improved de- 
Pends on the predictability of the within- 
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pair repeated letter. In the two conditions 
(FL and U) designed to evaluate the ef- 
fect of predictability, the repeated letter 
could occur in either the first or last posi- 
tions of both the stimulus and response. 
However, in Cond. U the repeated letter 
could occur as one of four possible combi- 
nations whereas in Cond. FL it could oc- 
cur as one of two possible combinations. 
The superiority of Cond. FL is quite clear 
in that Ss took fewer trials and showed a 
tendency to make fewer errors in learn- 
ing the list than Ss in Cond. U. One 
would expect that increased predictability 
would decrease the proportion of errors 
made on the repeated letter; the results 
demonstrate that this is the case. How- 
ever, one would also expect that in- 
creased predictability of the repeated letter 
would reliably reduce the number and pro- 
portion of misplaced responses. The data 
do not support this expectation. 

The data for the two conditions (F 
and L) designed to evaluate the effect 
of within-pair repeated-letter location 
suggest that location also has an effect 
during PA acquisition. To begin with, 
the data for these conditions clearly show 
the facilitatory effect of a completely pre- 
dictable letter. The Ss serving in Cond. 
F and L made fewer errors, took fewer 
trials, and less frequently misplaced other- 
wise correct responses while learning the 
list than did Ss in any other condition 
except Cond. FL. In addition, their data 
show a marked reduction, from the level 
of comparable control positions, in the 
proportion of errors made on the repeated 
letter, The magnitude of the reduction 
from the control level is approximately 
the same for both conditions. In the 
above data there is no evidence to sug- 
gest that location of the completely pre-. 
dictable letter is an effective variable. If 
one considers the proportion of mis- 
placed responses, however, quite a dif- 
ferent picture emerges. The proportion 
of misplaced responses in Cond. F is 
reliably lower than the proportion of mis- 
placed responses in Cond. L. In addi- 
tion, considering all conditions, it is 
clear that the portion of misplaced re- 
sponses decreases as the number of re- 
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peated letters in the first position in- 
creases. Thus, although location of the 
repeated letter does not appear to affect 
total errors, it does affect the distribution 
of the types of errors made. 

In summary, this study demonstrates 
that the location of S-R similarity (be- 
tween or within pairs) is a powerful vari- 
able which can have marked effects on the 
ease of PA acquisition. In the case of 
intrapair S-R similarity, the results show 
that ease of acquisition also depends on 
the predictability of the repeated letter. 
Finally, the data show that the location 
of an intrapair repeated letter determines, 
at least in part, the distribution of the 
types of errors made by S. Whether 
the relative facilitation produced by intra- 
pair similarity results from the addition 
of a repeated letter or from reduced in- 
terpair similarity (or from both), and 
whether the effects due to location of 
the intrapair repeated letter may be at- 
tributed to stimulus location or to re- 
sponse location (or to both), is a matter 
for future investigation. 
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A-B, B-C, A-C MEDIATION PARADIGM: 


RECALL OF A-B FOLLOWING VARYING NUMBERS 
OF TRIALS OF A-C LEARNING* 
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Recall of A-B was measured after 0, 3, 7, or 15 trials of A-C acquisi- 
tion in an A-B, B-C, A-C mediational chaining (C) or an A-B, B-E, 
A-C nonchaining (NC) verbal paired-associates paradigm. Theo- 
retically, these manipulations were designed to trace the mediational 
process throughout the course of A-C learning by using A-B recall as 
an index of the “state” of the A-B-C associative chain which was pre- 
sumed to have been used in learning A-C under C conditions. The 
results indicated, as shown previously, that performance on A-C under 
C conditions was reliably superior to that under NC conditions—medi- 
ated facilitation. Also, recall of A-B did not differ for these condi- 
tions prior to A-C learning, 0 A-C trials. Thereafter, A-B recall as 
a function of A-C trials remained constant under C conditions and de- 
creased under NC conditions. Additional analysis showed, among 
other things, that initially weak A-B associations were strengthened 
while those which were strong initially first increased and decreased 
in strength (mediator “drop-out”) as a result of their having been 
used in acquiring A-C rapidly. Under NC conditions, unlearning 
of A-B was the rule regardless of the initial strength of A-B or the 
ease with which A-C was acquired. The results were interpreted as 


supporting an associative model of 


Several recent investigations of the 
phenomenon of mediation in the paired 
associates A-B, B-C, A-C chaining 
paradigm have demonstrated that use- 
ful inferences regarding mediational 
processes can be based on the results 
of a test for the recall of A-B following 
Stage III A-C acquisition (Horton & 
Wiley, 1967a, 1967b; Schulz, Liston, 
& Weaver, in press). The present 
study represents a continuation of this 
line of investigation. 

More specifically, the studies cited 
above have shown explicitly, among 
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paper. 
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the process of mediation. 


other things, that the use of the A-B-C 
chain of associations during the learn- 
ing of A-C under chaining conditions 
(C) preserves or maintains the strength 
of the A-B association and contrasts 
with the state of affairs that obtains 
under nonchaining (NC) control con- 
ditions (e.g, A-B, D-C, A-C) where 
A-B is unlearned and/or forgotten 
during the course of A-C acquisition. 
While the overall effects of A-C learn- 
ing on A-B associative strength under 
C and NC conditions appear reasonably 
well established, very little appears to 
be known, however, as to what happens 
to a mediating association during the 
course of Stage III A-C acquisition in 
Cond. C relative to Cond. NC. It has, 
e.g., been suggested that mediating as- 
sociations might “drop-out” as the de- 
gree of A-C learning reaches a high 
level (eg., Barnes & Underwood, 
1959). Similarly, it would be antici- 
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pated, according to an associative inter- 
pretation of this process, that the 
strength of a mediating association 
might increase during the initial por- 
tion of A-C learning in Cond. C, pro- 
vided it is not at full strength at the 
time A-C learning is initiated. 

The present study attempted to trace 
the mediational process throughout the 
course of A-C acquisition. Recall of 
A-B following varying numbers of 
trials on A-C was used to index the 
strength of the A-B-C mediating chain 
in Cond. C while recall of A-B in a 
suitable NC condition (in this instance 
A-B, B-E, A-C) served as a baseline. 
Thus, the present experiment paral- 
leled, for associations acquired entirely 
in the laboratory, Barnes and Under- 
wood’s (1959) work with the A-B, 
A-B’ and the A-B, A-C paradigm 
where, in the case of the former, media- 
tion was inferred to have occurred via 
the semantic link (synonymity) be- 
tween B and B’ to force A-B-B’ asso- 
ciative chain. 

Based on the foregoing discussion 
and the results obtained by Barnes and 
Underwood, it is possible to entertain 
two reasonably firm expectations: (a) 
as degree of A-C learning increases, 
A-B recall will first increase and then 
decrease in Cond. C, and (b) in Cond. 
NC, A-B recall should be a decreasing 
function of the degree of A-C learning. 


MetHop 


Design—As noted above, the A-B, B-C, 
A-C three-stage chaining paradigm defined 
Cond. C while the A-B, B-E, A-C nonchain- 
ing control paradigm defined Cond. NC. 
The latter paradigm was chosen rather than 
the usual A-B, D-C, A-C control in an at- 
tempt to more closely equate the factors in- 
fluencing A-B recall. Thus the present con- 
trol reduces differences in second-stage list 
difficulty and provides for approximately 
equal amounts of A-B forgetting due to the 
elicitation and extinction of the backward 
B-A associations during Stage II learning 
(cf, Mandler & Earhard, 1964). Moreover, 
the appropriateness of the present paradigm 
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as a control for mediated facilitation during 
A-C learning has been previously established 
in an experiment employing similar materials 
and procedures (Schulz, Weaver, & Gins- 
berg, 1965). 

Suitable preliminary work was conducted 
to determine four points at which to intro- 
duce A-B recall (Stage IV) so that a satis- 
factory sample of degrees of A-C learning 
would be involved. The representative points 
selected were: immediately following Stage 
II (List B-C or B-E) learning or after 3, 
7, or 15 Stage III (List A-C) learning 
trials. These latter points were picked to 
represent, respectively, the initial rapid in- 
crease in A-C performance, a point inter- 
mediate in this performance, and the ter- 
minal, ceiling level after which performance 
increases little, if any, with further trials. 
The combination of the two paradigms and 
the four independent samples of A-B recall 
result in a 2 X 4 factorial design. 

Materials—The A terms in the eight-pair 
PA lists defining the above conditions con- 
sisted of low m Taylor (1960) paralogs. 
Low m Archer (1960) CVC trigrams served 
as C and E terms. Common nouns with 
Thorndike-Lorge (1944) frequencies of AA, 
having minimal associative overlap in the 
Russell and Jenkins (1954) norms, were 
used as B terms. Intralist and interlist for- 
mal similarity were as low as possible in 
both paradigms. 

Subjects—The Ss were 160 University of 
Iowa undergraduate students enrolled in an 
elementary psychology course for which par- 
ticipation in experiments was required. All 
Ss were naive with respect to PA learning 
experiments. A total of 20 Ss, 12 males and 
8 females, were assigned randomly to each 
of the eight conditions in order of appear- 
ance at the laboratory. The restriction was 
made that the nth S was not assigned to a 
condition until n-1 Ss had been assigned to 
all other conditions. A total of 14 Ss failed 
to reach the learning criterion of either 
Stage I or II in the required 25 trials and 
were thus dismissed from the experiment 
and replaced by the next S to arrive. Since 
all Ss learned a common first list and since 
only 1 S failed criterion on Stage II, there 
was no problem with differential S selection. 

Procedure—The basic procedure used for 
the first three learning stages has been de- 
scribed previously (Schulz, Weaver, & Gins- 
berg, 1965). The study-test method was 
employed for all three lists and the usual 
recall procedure was used for Stage I test 
trials while, as in the previous study, test 
trials on Stages II and III took the form 


—— 
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TABLE 1 


MEANS AND SDs FOR TOTAL NUMBERS OF TRIALS TO REACH CRITERION ON STAGES I AND IT 
AND FOR TOTAL NUMBERS OF CORRECT RESPONSES ON STAGE III 


‘Number of A-C Trials 


Cond. o 
¥ 

Stage I Trials to Criterion 

C 10.30 

NC 9.25 
Stage II Trials to Criterion 

C 8.25 

NC 7.15 


Binge III Total Number Correct 
NC a 


3 7 15 

SD x SD x SD 2 SD 
10.75 | 5.19 | 9.75] 4.89 | 9.90] 4.85 
9.60 | 5.48 | 10.80] 4.51 | 10.50] 5.30 
10.45 | 5.01 | 9.90 | 4.56 | 9.15] 4.76 
9,60] 5.27 | 7.30] 3.84 | 7.60] 4.51 
— |15.10] 4.38 | 40.20] 9.46 | 96,00 | 11.55 
— |10.45] 4.59 | 30.35] 7.60 | 79.90 | 18.84 


of a four-alternative multiple-choice test for 
each pair. The test-trial alternatives on 
these lists had the numbers 1 to 4 printed 
below them and Ss responded with the num- 
ber of the alternative they had chosen. A 
2-sec. rate/item was used both on study and 
test trials for Stage I while for Stages II 
and III the study trial presentation rate was 
2 sec. and that for test trials 3 sec./item. 
Learning in both Stages I and II was toa 
criterion of two successive correct trials. 

Appropriate instructions for A-B recall 
were given either immediately following 
completion of Stage II learning or after 
completion of the predetermined number of 
A-C trials. The Ss were not told before- 
hand that recall of A-B would be required 
and in all conditions the instructions and 
Procedure for recall were the same. The 
eight A terms were presented once, one at a 
time, at a 1,5-sec. rate/item. No study 
trial was given and S was encouraged to 
Suess if not sure of the correct response. 
A rapid rate of presentation was employed 
in testing A-B recall so as to minimize the 
likelihood that Ss in C conditions would be 
able to mediate at the time of recall since 
Stages II and III, in conjunction with recall, 
conform to the B-C, A-C, A-B stimulus 
equivalence paradigm (cf. Schulz & Weaver, 
1968). In addition, as pointed out pre- 
viously (cf. Schulz, Liston, & Weaver, in 
Press), there are several other considera- 
tions which make it highly unlikely that Ss 
can successfully employ an A-C-B associa- 
tive chain to mediate the recall of A-B un- 
der the present conditions. 

Four different random presentation orders 


of A-C study and test trials were constructed 
with + of the Ss in each condition being as- 
signed to each order, When A-C learning 
was terminated, the next random order of 
test items was used for A-B recall (the mul- 
tiple-choice alternatives being covered by a 
shutter). A corresponding procedure pro- 
duced four different start orders for A-B 
recall in the conditions in which no A-C 
trials were given. Four different random 
presentation orders were also used for both 
study and test trials on the first and second 
stages to prevent serial learning. 


RESULTS 


The means and standard deviations 
summarizing performance in the first 
three stages of this study have been 
presented in Table 1 for each of the 
conditions. Considering each stage in 
turn, it may be seen first that the 
means for the numbers of trials re- 
quired to attain a criterion of two suc- 
cessively correct trials in Stage I (List 
A-B) are highly similar for the eight 
conditions ranging 9.25-10.80. The F 
for the between-cells variance was less 
than 1 providing statistical evidence 
that Ss in these conditions were com- 
parable in their ability to learn the 
first list. 

The means for the numbers of trials 
to reach criterion on Stage II (List 
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B-C or B-E) also failed to differ sig- 
nificantly, F (7, 152) = 1.27, p > .20. 
There was a consistent trend favoring 
List B-E as slightly less difficult to 
learn than List B-C. The overall 
means for total trials to learn were 
8.06 and 9.44 for Lists B-E and B-C, 
respectively. However, a test of this 
difference did not reach significance at 
the .05 level, F (1, 152) = 3.45. 

In Stage III, performance in the C 
conditions is seen to be superior to that 
in the appropriate NC conditions at all 
levels of A-C learning, providing evi- 
dence for the presence of mediated 
facilitation. The total numbers of cor- 
rect responses in Stage III (List A-C) 
for Cond. C vs. NC were analyzed 
separately for each level of A-C train- 
ing. These differences are significant 
beyond the .01 level with ?#’s (38) of 
3.28 and 3.63 for 3 and 7 A-C learning 
trials, respectively, and ¢ (33) of 3.26 
after 15 trials. In referring the last 
comparison of this series to the ¢ dis- 
tribution a correction of the number of 
degrees of freedom was employed to 
approximate the Behrens-Fisher dis- 
tribution (Winer, 1962, p. 37) since 
Cochran’s test indicated a significant 
deviation from homogeneous variances, 
C (2, 19) = .7267, p < .05. 

The mean numbers of correct re- 
sponses in Stage IV (A-B recall) are 
shown for each paradigm in Fig. 1. 
The standard deviations ranged 1.91- 
2.18 in the C conditions and 1.76-2.35 
in the NC conditions. It may be seen 
that the recall performance in the two 
paradigms is almost identical follow- 
ing Stage II learning; thereafter, the 
curves diverge rapidly with recall un- 
der C conditions remaining constant 
while recall under NC conditions is a 
decreasing function of the number of 
trials on A-C. Statistically, this trend 
toward interaction is a reliable one, F 
(3, 152) = 2.85, p <.05. Also, the 
differences among C conditions are not 
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Fic. 1. Recall of A-B following varying 


numbers of trials of A-C learning in the 
A-B, B-C, A-C (C) and A-B, B-E, A-C 
(NC) paradigms. 


significant, F (3, 76) < 1, while those 
among NC conditions result in F (3, 
76) = 5.38, p < .01. Thus, the evi- 
dence is unequivocal that A-B associa- 
tions are maintained under C but not 
under NC conditions. However, the 
additional expectation that A-B recall 
might first increase and then decrease 
with trials on A-C under C conditions 
is not supported by these data, There- 
fore, the analysis was carried one step 
further. 

The A-B and A-C items were ranked 
for each S in both Cond. C and NC 
to index their strength based on the 
number of times a given item had been 
correct during A-B or A-C acquisi- 
tion. Then A-B recall performance 
was tabulated as a joint function of 
A-B and A-C strength. To stabilize 
the percentages, an S’s four highest 


8 Unfortunately, it was not possible to em- 
ploy in the analysis a three-factor contin- 
gency including B-C strength because the 
item frequencies conforming to various 
strength categories were not large enough 
to give stable estimates of the recall per- 
centages associated with a given category. 
However, it has been shown elsewhere, in 
accord with an associative mediational model, 
that A-C performance is a function of both 
A-B and B-C strength (Weaver, Hopkins, 
and Schulz, 1968). 
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Fic. 2. Recall of A-B after varying num- 
bers of trials of A-C learning as a function 
of Stage I A-B strength in the A-B, B-C, 
A-C (C) and A-B, B-E, A-C (NC) para- 
dims when A-C was strong (upper panel) 
and when A-C was weak (lower panel). 


tanking items were included in the 
‘strong” category, the remaining four 
defined the “weak” category. Ties in 
tank were resolved randomly. The re- 
sults of this analysis are shown in Fig. 
2. The upper panel of Fig. 2 depicts 
recall performance for those A-B items 
associated with the best A-C perform- 
ance while the lower half of Fig. 2 
Tepresents a parallel treatment of A-B 
items associated with the poorest A-C 
Performance. Considering the upper 
half of Fig. 2 first, it may be seen that 
the effects of increasing numbers of 
A-C trials were again dramatically dif- 
ferent under C and NC conditions. 
Moreover, the trend for strong A-B 
items is clearly the expected one of an 
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increase followed by a decrease in A-B 
recall. Weak A-B items seem, on the 
other hand, to have had their strength 
increased throughout A-C acquisition. 
Contrariwise, the A-B recall curves 
for NC conditions reflect merely un- 
learning or extinction of A-B both 
when A-B was strong and when it was 
weak. 

In the lower half of Fig. 2, all of the 
curves reflect a tendency for A-B to be 
unlearned ; however, this trend is most 
apparent in NC conditions. Following 
7 and 15 trials of A-C learning, the 
superior maintenance of A-B under C 
conditions is again apparent. Also, 
after the initial decline at 3 A-C trials 
the trends, albeit less prominent, in the 
curves for C conditions are similar to 
those obtained in the upper half of 
Fig. 2 for this condition. 

Unfortunately, it is not possible, for 
a variety of reasons, to perform a sta- 
tistical analysis of the data in Fig. 2. 
Nevertheless, the major trends ob- 
served here seem sufficiently compell- 
ing to merit serious consideration of 
them. 


Discussion 


The results obtained in the first three 
stages of the present experiment provide 
yet another replication of the now com- 
mon finding that facilitation of A-C 
learning occurs in the A-B, B-C, A-C PA 
chaining paradigm as compared to a non- 
chaining control (in this case A-B, B-E, 
A-C). This evidence for the presence 
of mediated facilitation is important for 
the present investigation in that it al- 
lows for differential predictions regard- 
ing changes in A-B associative strength 
(as indexed by A-B recall) in the two 
respective paradigms. The fact that there 
was no significant difference in difficulty 
for Lists B-C and B-E provides additional 
evidence that until the differential transfer 
relationships are introduced in Stage II, 
the associative and learning variables 
which might influence A-B recall are es- 
sentially equivalent for the two paradigms. 
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As expected, A-B recall in the C con- 
ditions was superior to that in the NC 
conditions after 3, 7, and 15 A-C learning 
trials. The failure of recall performance 
to differ immediately following Stage II 
is taken as particularly strong evidence 
that the associative relationships involved 
in A-C learning produced the subsequent 
differences in recall, Furthermore, since 
List A-C was identical for all conditions, 
it may be inferred that it was the presence 
or absence of the possibility of forming a 
mediating chain which was the significant 
factor in these A-B recall differences. 
Thus, it seems firmly established, based 
on the present evidence and that ob- 
tained recently in other studies employing 
similar techniques (Horton & Wiley, 
1967a, 1967b; Schulz, Liston, & Weaver, 
in press) that A-B associations are 
maintained as the result of their being 
used as part of the A-B-C associative 
chain employed to mediate the learning 
of A-C in C conditions. 

Though it had been anticipated that 
A-B recall might first increase and then 
decrease during the course of A-C learn- 
ing under C conditions, this expectation 
failed to be confirmed by the “raw” re- 
call data (cf. Fig. 1). Rather, the ef- 
fectiveness of the presence of a media- 
tional relationship in maintaining A-B as- 
sociations appears to have increased as a 
function of A-C trials by off-setting the 
cumulative effects of extinction of A-B 
along with the effects of other factors 
producing forgetting under NC condi- 
tions. In short, the constancy of A-B 
recall over A-C trials under C conditions 
represents, relative to recall in NC con- 
ditions, an increase in the effective main- 
tainence of A-B. Furthermore, while 
there was constancy overall, further 
analysis of the recall data revealed that 
several diverse trends combined to pro- 
duce this result (cf. Fig. 2). Hence, 
this analysis which took into account 
jointly the differences in the strengths 
of A-B associations and the strengths of 
their A-C “counterparts,” permits one to 
gain, at least tentatively, additional in- 
sight into the process of mediation under 
the present conditions. 

The interpretation of the analysis dis- 
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played in Fig. 2 requires, as indicated 
earlier, the assumption that A-B recall 
under C conditions provides an index of 
the “state” of the A-B-C associative 
chain. Accordingly, the increase in re- 
call of A-B where the association be- 
tween A and C was acquired rapidly (i.e. 
A-C was strong) implies the effective and 
continued utilization of the A-B-C asso- 
ciative chain, of which A-B is one por- 
tion, during the course of A-C acquisi- 
tion. The strengthening of the A-B was 
particularly apparent if A-B was weak 
initially. Where the association between 
A and C was acquired less rapidly (ie, 
A-C was weak) it may be inferred that 
the use of the A-B-C chain was only par- 
tially successful on early trials as there 
was an initial decrement prior to an in- 
crease in A-B recall. The likelihood that 
the use of this chain will be halted and fol- 
lowed by the “drop-out” of the mediator 
seems to depend on the presence of a 
strong A-B association which can be em- 
ployed, in conjunction with the existing 
association between B and C, to acquire an 
association between A and C rapidly; 
thus, only in the case where both A-B and 
A-C were strong was there evidence for 
the predicted increase followed by a de- 
crease in A-B recall under C conditions. 
Since it was not possible to evaluate 
statistically the data which have just been 
discussed, it is of special interest to 
contrast them with the results of a simi- 
lar analysis reported by Schulz, Liston, 
and Weaver in a study alluded to earlier. 
These investigators employed 10 A-C 
trials, a value not used here, the lengths 
of their lists differed from the present 
ones, and their NC condition conformed 
to the A-B, D-C, A-C rather than the 
present A-B, B-E, A-C paradigm; hence, 
direct comparison is not possible. It is 
possible, nevertheless, to compare the 
major trends in their data with those 17 
this study. Such a comparison reveals 
complete accord between the two studies: 
(a) A-B recall was directly related to 
both A-B and A-C strength; (b) the dif- 
ference between C and NC conditions was 
greatest when A-C was strong; and (¢ 
weak A-B associations “benefited” more 
than strong ones from A-C strength. 
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These trends appear to be, therefore, re- 
producible ones. Clearly, the next step 
is the one of confirming further the im- 
plications of the interpretation set forth 
above by means of experiments suitably 
designed to afford direct manipulation of 
the factors involved. 

Lastly, a brief comment seems in order 
regarding A-B recall under NC condi- 
tions. Both the overall recall (cf. 
Fig. 1) curve and the curves for the 
items of varying A-B or A-C strength 
(cf. Fig. 2) show the “classic” unlearning 
decrement as a function of number of 
A-C trials (Barnes & Underwood, 1959; 
McGovern, 1964).* This is, of course, 
by itself not surprising. What is sur- 
prising, at least to the authors, is that 
these decrements occur in spite of the fact 
that negative transfer was never ob- 
served under NC conditions during the 
acquisition of A-C even though Stages I 
and III are in an A-B, A-C relation in 
these conditions. 

A control appropriate for the assess- 
ment of the presence or absence of nega- 
tive transfer was not included in the 
present design; however, such controls 
(e.g, X-B, B-C, A-C or A-B, B-C, E-F) 
have been employed on several occasions 
under conditions identical to the present 
ones in all essential respects and they in- 
dicated a total absence of negative trans- 
fer under NC conditions (e.g., Schulz, 
Liston, & Weaver, in press). Evidently 
then, an association may be unlearned 
even if negative transfer fails to occur 
during the process. 


4It may be inferred that the recall decre- 
ment (greater than 50%) under NC condi- 
tions following 3 to 15 trials of A-C learn- 
ing is “specific” to the A-B, A-C relation 
between Stages I and III since Schulz, 
Liston, and Weaver (in press) observed 
A-B recall to be 65% following 10 trials of 
Stage III performance in an A-B, B-C, E-F 
Paradigm. 
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CONTINUING AND REVERSING THE DIRECTION 
OF RESPONDING MOVEMENTS: 


SOME EXCEPTIONS TO THE 


SO-CALLED “PSYCHOLOGICAL 


REFRACTORY PERIOD”? 


JOHN BREBNER 
University of Dundee 


Delays in responding to a signal 


to continue or to reverse the direc- 


tion of a response movement were calculated where the signal arrived 
shortly before the response movement to a prior signal began. The 
results indicate that the processing of signals conveying instructions 
to continue a responding movement is not delayed even when the 2nd 
signal arrives during RT1. More surprisingly, without being entirely 


conclusive, the data suggest that 


signals to reverse the direction of 


movement may also be processed without delays. This evidence indi- 
cates the selective nature of the “gating process,” by which ongoing 


activity of the decision mechanism 
arrival of further signals. 


Apart from the case of “grouping,” 
exceptions to the so-called “psychologi- 
cal refractory period” have rarely been 
noted in the sizeable literature on this 
phenomenon. Vince (1950) did find 
evidence which she interpreted as 
showing that the execution of one re- 
sponse and the organization of a sec- 
ond could overlap. It seems possible, 
however, that in her experiment the 
organization of the second response 
(R2) might, in part, at least, have 
taken place during the reaction time to 
the first signal (RT1). 

Other data reported by Vince 
(1950) showed that responding move- 
ments could occasionally be curtailed 
without apparent delay, and that, in- 
frequently, the undelayed substitution 
of one response for another could be 
effected. The modification of a re- 
sponse by a further signal presented 
during RT1 has received little atten- 
tion, and the tenor of work on the sin- 
gle-channel hypothesis has been that 


1The author acknowledges the technical 
assistance of W. Crabb and M. Attivissimo 
in carrying out this study. 


is assumed to be protected from the 


exceptions to the psychological refrac- 
tory period are unimportant if they 
occur at all. 

Recently, however, the whole ques- 
tion has been raised again by Gotts- 
danker (1966) and by Vince and Wel- 
ford (1967) who tentatively propose 
an important refinement in Welford’s 
(1952, 1959) statement of the single- 
channel hypothesis. This modification 
is that, given two rapidly successive 
signals for response, only responses to 
the second signal which require the 
initiation of new patterns of muscular 
action will be delayed. 

More specifically, it is hypothesized 
post facto that signals conveying in- 
structions merely to increase or to re- 
duce the activity of those muscles al- 
ready in action are not excluded by the 
“gating mechanism” which is assumed 
to protect the single-channel decision 
mechanism. The present experiment 
tests this hypothesis. 


METHOD 


Subjects—Seven Ss, five men and two 
women, took part in the experiment. All 
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Ss were members of the second year psy- 
chology class at the University of Dundee 
and all were naive with respect to the aims 
of the experiment. 

Apparatus—The display consisted of four 
small neon lamps grouped horizontally in two 
pairs with a distance of 1 in. between pairs. 
The neons were attached to a hardboard 
screen at eye level, and were viewed from 
a distance of 10 ft. 

The S responded to the onset of any neon 
by moving a short lever with his right hand 
to a like position on a five-way Maka-Switch. 
The vertical lever position, corresponding to 
the small central gap between neon pairs, 
was the neutral starting point for each trial. 

A keying unit, designed for this purpose, 
allowed E silently to introduce controlled 
intervals between signals, and to select the 
order of signals (see below). The S’s re- 
sponse times (TR), ie. the time taken to 
reach the correct response position which 
includes the brief time spent in moving the 
lever to that position, were recorded from 
two digital readout timers with an accuracy 
of 10 sec. 

Each S provided 80 pairs of TR1, TR2 
readings evenly spread over four Signal 1- 
Signal 2 intervals in four experimental 
conditions. The intersignal intervals used 
were 100, 175, 250, and 325 msec. The four 
display/response conditions may be desig- 
nated: (a) RL, S moved the response lever 
from the neutral vertical position to the 
nearer right-hand response position in re- 
sponse to Signal 1 then back, through the 
vertical position, to the nearer left-hand posi- 
tion in response to Signal 2; (b) LR, repre- 
sents a similar left-right movement; (c) 
RR, stands for a move to the nearer right- 
hand position in response to Signal 1, fol- 
lowed by a continuation of that movement to 
the further right-hand position in response 
to Signal 2; and (d) LL, stands for a cor- 
Tesponding left-left movement. 

Jn addition to these four response possi- 
bilities, the nearer signal lamp on one side 
Was presented with no following signal on 
0 occasions, i.e., 10 L and 10 R. 

In brief, there were six possible signal 
combinations in two of which (L and R) S 
made only one response, in two of which 
(LL and RR) S could simply continue the 
Movement begun in response to Signal 1, and 
in two of which (RL and LR) S was re- 
quired to make a movement in response to 
Signal 2 in the opposite direction to the first 
Tesponse movement. The muscles mainly in- 
volved in these responses are, in L move- 


ments, pronator teres and pronator quad- 
ratus and, in R movements, supinator and 
biceps brachii. 

Signals were delivered discretely, E giving 
a verbal “Ready” warning signal before each 
presentation. The Ss returned the response 
lever to the neutral vertical position before 
each presentation. Two small filament lamps 
indicated when the lever reached either of 
the further positions, and readings on which 
S overshot the nearer position to this extent 
were discounted. 

Procedure—The S sat at a table facing 
the display at a distance of 10 ft. The re- 
sponse lever was located directly in front 
of S. The S was informed about the experi- 
ment and instructed to respond as quickly as 
possible by moving the lever to the approp- 
riate position as shown on the display. 

The S was specifically asked to respond 
as quickly as possible to the first signal 
rather than adopting the strategy of “waiting 
to see” if a second signal was delivered. 
Precise instruction was also given to fixate 
a vertical line midway between the two cen- 
ter lights in the display when E said “Ready,” 
and to expect a signal(s) about 14-4 sec. 
after the warning signal. 

The S then tried out the response lever, 
finding the four response positions and neu- 
tral vertical position. Using the two center 
lights presented simultaneously as the signal, 
S practiced each of the four R1 — R2 possi- 
bilities in turn, LL, LR, RR, RL, and five 
pairs of response times TR1—TR2 were 
recorded when practice was completed, A 
further five readings were obtained at the 
end of the experiment. The means of these 
sets of response times were used in calculat- 
ing TM—the time spent in moving the re- 
sponse lever between the positions, 

The experiment lasted for about 45 min., 
a brief rest pause being allowed at the 
halfway stage. Throughout the test session 
E was completely screened from iS: The, 
sequences of signals and the values of I (in- 
terval between S1 and S2) were randomly 
ordered. 


RESULTS 


Table 1 shows the median response 
times obtained for each S at each in- 
terval. The appropriate estimated 
movement-time component of TR2 
(TM2) for each S is also shown in this 
table. 4 

Movement time (TM2) was esti- 
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TABLE 1 
MepIAN VALUES oF TR1 AND TR2 (Msec.) OBTAINED 
LL RR 
s 100 175 250 325 100 175 250 325 

TR1 | TR2 | TR1 | TR2 | TR1 | TR2 | TR1 | TR2 | TR1 | TR2 | TR1 | TR2 | TR1 | TR2 | TR1 | TR2 

1 398 | 322 | 378 | 276 | 448 | 249 | 430 | 152 | 348 | 276 | 452 | 295 | 421 | 276 | 457 | 265 
TM2 = 21 msec. TM2 = 21 msec, 

2 491 | 454 | 471 | 346 | 481 | 302 | 479 | 258 | 481 | 470 | 457 | 380 | 470 | 361 | 467 | 474 
TM2 = 67 m: TM2 = 69 msec. 

3 423 | 432 | 523 | 511 | 449 | 423 | 409 | 427 | 345 | 352 | 373 | 357 | 436 | 363 | 478 | 407 
TM2 = 19 m: TM2 = 28 msec. 

4 570 | 501 | 623 | 487 | 568 | 351 | 524 | 242 | 507 | 453 | 506 | 367 | 561 | 354 | 549 | 272 
TM2 = 29 msec, TM2 = 36 msec. 

5 524 | 476 | 518 | 391 | 412 | 306 | 519 | 513 | 460 | 417 | 459 | 343 | 513 | 354 | 475 | 498 
TM2 = 34 m: TM2 = 54 msec, 

6 435 | 390 | 434 | 314 | 425 | 317 | 407 | 328 | 489 | 450 | 527 | 416 | 474 | 296 | 562 | 354 
TM2 = 47 msec. TM2 = 54 msec. 

7 472 | 411 | 558 | 434 | 546 | 334 | 563 | 353 | 554 | 494 | 508 | 372 | 509 | 298 | 544 | 352 
TM2 = 46 pes TE = 39 msec. 


mated from the means of the response 
times obtained before and after the test 
session by subtracting the first response 
time from the second response time. 
For clarity it may be noted again that 
in these particular sets of response 
times, S knew in advance which move- 
ments to make, RR, LL, RL, and LR, 
and that the timing of both responses 
began simultaneously on presentation 
of the signal, 

Error data have not been included 
since, e.g., no S made more than five 
“overshoot” errors. 


Discusston 


The hypothesis that no delays in TR2 
will be found where the second signal 
arrived during RT1,? and which conveys 


2RT is used to mean reaction time, while 
TR stands for time to complete the response 
ie, TR=RT +TM, thus TR2 represents 


Fic. 1. 


Diagram showing rationale for formula TR2= TR1+TM2—I. 


instructions to continue the movement 
made in response to the first signal, can 
most simply be tested in the following 
manner. Given the nature of the task al- 
ready described, TR2 will be accurately 
predicted from the formula TR2 = TRI 
+ TM2 — I, where TR1 is the time taken 
to initiate and complete a response to the 
first signal. 

Figure 1 shows the rationale for the 
formula. It can be seen that there 1s 
precise prediction of TR2 where I < TRI 
— T02. Where I> TRI — TO2 the 
predicted values will fall short of the ob- 
tained values. Thus greater discrep- 
ancies between TR2 predicted and ob- 
tained might be anticipated at longer 
values of I. ‘i 

The main feature of the formula 1s, 
however, that TR2 is predicted on the 


the time elapsing between the arrival of S2 
and the completion of the appropriate re- 
sponding movement. 


TR2 
n(m). anna 


(TM=time 


spent in response movement, TO = time spent in organizing the response, and Delay — where 
TR1>TO2+I the start of the response movement to S2 will be delayed until TRI is 


completed.) 
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AT THE Four INTERSIGNAL INTERVALS 


LR 


100 175 250 325 


100 175 250 325 


TR1 | TR2 | TR1 | TR2 | TR1 | TR2 | TRi | TR2 


TR1 | TR2 | TR1 | TR2 | TR1 | TR2 | TR1 | TR2 


407 | 670 | 509 | 482 | 373 | 373 | 428 | 426 
487 | 734 | 525 


552 | 761 


516 | 497 | 548 | 538 | 529 
TM2 = 284 msec. 
517 | 749 | 600 | 671 | 469 | 551 | 472 | 473 
TM2 = 239 msec, 


542 | 756 


486 | 717 | 548 | 723 | 644 | 750 | 510 | 698 
‘M2 = 302 msec. 

485 | 673 | 457 | 582 | 438 | 461 | 430 | 485 
TM2 = 245 msec, 

545 | 707 | 496 | 605 | 520 | 552 | 513 | 493 
TM2 = 318 msec. 

551 | 663 | 478 | 538 | 469 | 495 | 576 | 527 


peels 194 ieee 


assumption that the start of the organiza- 
tion of a response to the second signal is 
not delayed even though the decision 
mechanism is already engaged in organiz- 
ing a response to the first signal. This 
denies the single-channel hypothesis. The 
differences between TR2 predicted in this 
way and the observed value of TR2, for 
the LL and RR conditions, were cal- 
culated and are shown in Table 2. 

It may be seen from Table 2 that, al- 
though there are some large discrep- 
ancies between predicted and observed 
TR2 values, most of the large discrepan- 
cies occur, as anticipated, at the longer 
intervals. The majority of the differ- 
ences fall between as many as 10 msec. 
too short and 20 msec. too long. This 
level of agreement implies that, in the 
case of a continued movement, either the 
Organization of the second response Oc- 
curs during TM1 or the central processes 
concerned with the organization of re- 
sponses to the first and second signals 
can run concurrently. TM1 was not 
Measured on each occasion in this ex- 
periment, but it would seem that TM1 is 
Probably too short to cover the organiza- 
tion of R2. This possibility was tested 
directly by running two additional naive 
Ss, one male and one female, in the ex- 
Periment and timing the movement time 
Of the first response. 

The mean value of TM1 and o were 
calculated for both Ss and were found to 


be: mean TM1 = 23 msec., g = 12.9 msec. 
for the first S, and mean TM1 = 39 msec., 
o=27.1 msec, for the second $. 

While these data underestimate S’s true 


TABLE 2 


DIFFERENCES BETWEEN TR2 PREDICTED AND 
OBSERVED (MsEC.) IN THE LL AND RR 
Conpitions AT Eac I VALUE 


I (msec.) 


Difference 
(msec.) 


100 175 250 325 Total 


TR2 Predicted < TR2 Observed 


>150 1 7 8 25 41 
141-150 1 0 2 2 5 
131-140 “th 2 5 2 10 
121-130 0 0 2 1 3 
111-120 0 0 0 0 0 
101-110 0 0 1 0 1 

91-100 2 0 0 0 2 
81- 90 1 2 0 1 4 
71- 80 0 0 2 0 2 
61- 70 1 0 2 1 4 
51- 60 0 2 2 2 6 
41- 50 3 3 1 5 12 
31- 40 1 1 0 3 5 
21- 30 5 3 5 5 18 
11- 20 | 15 6 12 10 43 
0- 10| 27 29 19 10 85 


TR2 Predicted > TR2 Observed 


ATE cleat eB ak iden st 
si espe rO ies pat ape en EA 
i= a0:| eo rote ea FOF) 1 
>30| 0 | o | 0 | 0} 0 
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TABLE 3 


MEAN DIFFERENCE oF TRI 
Meptans (MsEc.) 


a 
>i 


ȚaDuUnBUN=e 
E 
wo 


movement time by the time required to 
take up the initial inertia of the switch, 
and also by any brief period spent moving 
the switch before breaking the timing 
circuit, it is still highly unlikely that the 
true movement time would be sufficiently 
lengthy for the organization of R2 to 
occur during TM1. 

A further doubt remains to be dis- 
pelled. Welford’s original proposed ex- 
planation of the phenomenon of the psy- 
chological refractory period was ex- 
pressed in terms of the central processes 
concerned with separate stimuli not being 
able to coexist (Welford, 1952). This 
allowed the case of “grouping” to be 
dealt with satisfactorily although the 
question of what makes for the “separate- 
ness” of signals has not, as yet, received 
systematic study. 

The data so far discussed could be the 
result of grouping. Despite the instruc- 
tion to Ss not.to “wait and see if a 
second signal is given,” a risk in giving 
such an instruction is that of sensitizing 5 
to this possible strategy. The start of 
the processing of S1 may have been de- 
layed until the second signal arrived or 
for some period before S decided that the 
second signal would not be presented. 
It requires to be shown that this did not 
happen before accepting the alternative 
that, in the case of a continued movement, 
the central processes concerned with the 
organization of a response to the second 
signal are not delayed even where the sec- 
ond signal arrives during RT1. 

If grouping is to explain the results, it 
would be expected that TR1 should show 
a systematic increase as I increased. On 
the “wait for the second signal” strategy, 
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the increases in TR1 should parallel the in- 
creases in I. To test this, the differences 
between median TR1 values at succeeding 
intervals were calculated. On the hy- 
pothesis that there is no systematic change 
in TRI as I increases, these differences 
should be distributed around a mean of 
zero; on the grouping hypothesis, the dif- 
ferences should fall around a mean of +75 
msec. Table 3 gives the mean difference 
between the TR1 medians for each S. 
The results in Table 3 offer no support for 
the grouping hypothesis. 

Given what, for the most part, is an 
extremely good prediction of TR2 shown 
in Table 2, it may be worth looking more 
closely at the predictive failures. In 60 


TABLE 4 
DIFFERENCES BETWEEN TR2 PREDICTED AND 
OBSERVED (MsEc.) IN THE LR AND RL 
Conpitions AT Eacu I VALUE 


Difference 
(msec.) 


I (msec.) 


100 175 250 325 Total 


TR2 Predicted < TR2 Observed 


v 


WHNEUHNEUNWHWORHED 


>150 
141-150 
131-140 
121-130 
111-120 
101-110 
91-100 
81- 90 
71- 80 
61- 70 
51- 60 
41- 50 
31- 40 
21- 30 
11- 20 

0- 10 


1 


PUN OWARIAR RENE ORCC 
ACPNMODWWERNYNONOHA 
MANNOASCWHEACNHNHEHE 


TR2 Predicted > TR2 Observed 


1- 10 
11- 20 
21- 30 
31- 40 
41- 50 
51- 60 
61- 70 
71- 80 
81- 90 
91-100 

>100 


m 
PR OROWWAR OCG 
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cases TR2 observed exceeds the predicted 
value by more than 100 msec. Twenty- 
eight of these cases are contributed by one 
S—S3. There is no additional evidence 
to suggest that this S performed the task 
in a different manner from other Ss. It 
can only be said that any increase in TR2 
beyond the predicted level must be in- 
curred after TR1 and, if S was following 
the instruction to respond as quickly as 
possible, it seems unlikely to be due to a 
slowing of TM2. It remains an open 
possibility that the initiation of R2 was 
slowed or prevented, perhaps by the 
processing of feedback from R1 or possi- 
bly the organization of R2 after TRI. 

On the hypothesis put forward by Vince 
and Welford (1967), the “gating process,” 
which protects the single-channel decision 
mechanism, excludes signals conveying 
instructions to bring other muscles into 
play in a fresh pattern of response. Ac- 
cordingly, in the LR and RL conditions, 
one would expect TR2 predicted from 
the formula TR2=TR1+TM2—I to 
fall short of TR2 observed. The extent 
of this underestimation would reflect the 
time taken to organize the initial response 
to the second signal after RT1. This can- 
not be predicted with any precision since 
there is no way of knowing if R2 will be 
(partially) organized during TM1, but it 
would be expected that the organization 
of R2 would take place after RT1. 


Table 4 shows the differences between 
TR2 observed and predicted from the 
formula TR2 = TR1 + TM2— I for the 
LR and RL conditions. 

It is clear from Table 4 that some 
underestimation of TR2 has occurred as 
predicted. The extent of this underesti- 
mation is, however, less than might have 
been expected on the single-channel hy- 
pothesis, and on 58 occasions the pre- 
dicted value exceeds that observed. This 
latter result must reflect the fact that 
TM2, in these cases, is shorter than the 
estimated value of TM2 employed in pre- 
dicting TR2. But the important question 
raised by the data in Table 4 is that of 
when R2 is organized. Even allowing 
that R2 may be organized during TM1 
plus the time taken to overcome the 
inertia of the switch, it appears that the 
organization of R2 might start during 
RT1. One can check this possibility from 
the data of the two Ss for whom RT1 
and TM1 were measured separately. 

If it is assumed that the organization 
of R2 (TO2) does not take place during 
RTI, then in the present experiment the 
time available for TO2 + TM2 may be 
calculated from (TR2+1)—RT1. This 
allows that TO2 may occur during TM1. 
These values should exceed the estimated 
TM2 in the RL and LR conditions by the 
amount of TO2. Table 5 shows the mean 
times calculated from (TR2+1) —RT1 


TABLE 5 


Comparison or TO2 + TM2 (MsEc.), CALCULATED FROM TR2 + I) — RTI, 
AND ESTIMATED VALUES OF TM2 (MsEc. 


RL Condition 
S1 TM2 Estimated = 332 msec. S2 TM2 Estimated = 318 msec. 
2 100 175 250 325 100 175 250 325 
S 
Z | 332 322 a i ae e E aR 
LR Condition 
S1 TM2 Estimated = 336 msec. S2 TM2 Estimated = 323 msec. 
SE 
E 100 175 250 325 100 175 250 325 
~n] 
x 383 339 347 396 310 327 388 390 
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for both Ss and compares these with TM2 
estimated from the pre- and posttest 
sessions. 

Table 5 shows that, for the most part, 
the tabled readings are not markedly 
longer than TM2. It would appear, there- 
fore, that the organization of R2 begins 
during RT1 even where R2 involves mus- 
cle groups antagonistic to those involved 
in RI. 

Returning to the data from the first 
part of the experiment, the grouping hy- 
pothesis has already been shown to derive 
no support from the TR1 medians which 
showed no systematic increase as I in- 
creased. This point may be reinforced 
by noting that if grouping had occurred, 
not only would TR1 be expected to in- 
crease as I increased, but TR2 would 
show no systematic decrease as I in- 
creased. If, on the other hand, TR1 does 
not change with I, the step decreases in 
TR2 should equal the differences in I, i.e., 
TR2 should decrease by 75 msec. at suc- 
ceeding intervals. Table 6 shows the 
means of the TR2 medians for each inter- 
val in the LR and RL conditions. The 
data in Table 6 confirm that grouping is 
not a possible explanation of the results, 

Where TR2 predicted is less than TR2 
observed, then either the true value of 
TM2 exceeds the estimated value em- 
ployed in predicting TR2, or, as antici- 
pated, there is a delay in the central 
processes concerned with the second sig- 
nal which is incurred after TR1. There 
is no way of choosing between these two 
alternatives from the present data. How- 
ever, even where TR2 predicted falls 
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TABLE 6 
Means oF MEDIAN VALvEs oF TR2 (Mszc,) 


LR RL 
I | 100 | 175 | 250 | 325 | 100 | 175 | 250 | 325 
Z | 709 | 612 | 541 | sos | 665 | 600 | 540 | sas 


short of TR2 observed, the smallness of 
the difference obtained in many cases 
makes it seem likely that some part of 
the organization of R2 has taken place 
during RT1. 

This unexpected finding would appear 
to require such a major revision of the 
single-channel hypothesis that it deserves 
to be scrutinized critically. There are 
two further possible explanations of the 
data. First, it could be true that R1 and 
R2 were organized successively, and that 
M1 was withheld until R2 was organized. 
In this case, TR1 in the L and R condi- 
tions would be expected to be considerably 
shorter than TR1 in any of the other 
conditions. Table 7 gives the observed 
mean TRI in the various conditions, and 
clearly refutes this expectation. 

Second, it is a possibility that in the 
RL and LR conditions TM2, as estimated 
in the pre- and posttest sessions, contains 
TO2. This could have come about if Ss 
had adopted a strategy of continuing 4 
movement beyond the R1 position, so that 
the organization of R2 took place partly 
during TM1, but also partly during an 
“overshoot” which continued in the 
wrong direction until modified by the 


TABLE 7 
Mean TR1 (MsEc.) 
L LR LL R RL RR 

S 

N LENS 7 ed hi cto ag 5 4 N 5 4 N x 
4 10 | 422 | 20 | 432 | 20 | 407 | 10 | 410 | 20 | 403 | 20 pa 
2 10 | 535 | 20 | 504 | 20 | 475 | 10 | 510 | 20 | 538 | 20 | 47 
3 | 10 | 453 | 20 | 451 | 20 | 472 | 10 | 434 | 20 | 431 | 20 | 434 
4 10 | 579 | 20 | 572 | 20 | 565 | 10 | 541 | 20 | 562 | 20 ot 
5 10 | 511 | 20 | 479 | 20 | 497 | 10 | 488 | 20 | 477 | 20 | 47 
6 10 | 508 | 20 | 514 | 20 | 439 | 10 | 507 | 20 | 539 | 20 a 
7 10 | 561 | 20 | 561 | 20 | 533 | 10 | 534 | 20 | 528 | 20 | 53 
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organization of the reversal movement. 
If this sort of overshooting stopped short 
of the further response position, it would 
be undetectable by E. The only factor 
against this as an explanation of the re- 
sults is that, with the small movements 
involved, it is unlikely that any fast-mov- 
ing overshoot could be reversed before 
reaching the further position. 

The conclusions from this experiment 
must be tentatively drawn, but the present 
results support the hypothesis of Vince 
and Welford (1967) that the gating 
mechanism does not exclude signals to 
increase or reduce the activity of muscles 
already in action. This would perhaps be 
expected from the common observation 
that in everyday situations reaching, 
grasping, touching movements, etc. do not 
typically show a jerkiness which would 
follow unless these movements were con- 
tinuously modified by information being 
fed back from monitoring the response. 

More surprisingly, where reversal of 
direction is required, the results from the 
present experiment may be interpreted as 
showing that a second signal can arrive 
after the processing of a previous signal 
has begun and still be dealt with during 


RTI. This interpretation hinges upon 
the assumption that Ss, required to pro- 
duce a LR or RL response to a signal in 
the pre- and posttest sessions, prepare and 
emit the entire response as a unit, rather 
than emitting the first component move- 
ment before preparing the second, 
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CONCEPT IDENTIFICATION AS A FUNCTION OF THE 


TYPE OF TRAINING SERIES? 
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The levels of the relevant dimension remained constant, alternated, or 
varied randomly on successive training trials for 3 concept-identifica- 
tion series. 1 dimension was relevant and 3 dimensions were irrele- 
vant for each series. The number of dimensions whose levels varied 
on successive trials (1 or 3) was varied factorially with the series. 
All training series had the same random-variation test series. Fol- 
lowing 6 training trials, the alternating groups made more errors on 
the test series than the other groups. When the training trials were 
run to a criterion, the random group made the most errors during 
training and the fewest errors on the test trials. The constant group 
made fewest errors during training and approximately the same num- 
ber of errors as the alternating group during the test trials. Response 
biases, associated primarily with the alternating series, appeared to 


retard test-trial performance. Th 


e number of dimensions whose levels 


varied on successive trials did not affect the results. 


It is well known that the number 
and proportion of relevant dimensions 
‘as well as the total number of dimen- 
sions influence performance on con- 
cept-identification problems (e.g., Bul- 
garella & Archer, 1962; Walker & 
Bourne, 1961). Furthermore, the se- 
quence of dimension attributes chang- 
ing on successive trials also affects per- 
formance even though the numbers of 
relevant and irrelevant dimensions are 
held constant (Kurtz & Hovland, 
1956; Peterson, 1962). 

In the latter experiments, the se- 
quence of training trials consisted of a 
relevant dimension that remained con- 
stant while the levels of the irrelevant 
dimensions varied from trial to trial, 
constant series (C), or the levels of 
the relevant and irrelevant dimensions 
varied randomly, random series (R). 
The C series yielded superior perform- 
ance on the training trials. These re- 
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sults are open to a number of inter- 
pretations. Underwood (1952) posited 
that the responses to stimuli defining 
a concept must be contiguous for the 
concept to be learned. Thus, tempo- 
rally contiguous presentation of attri- 
butes of a concept should maximize the 
likelihood of eliciting contiguous re- 
sponses and facilitate acquisition of the 
concept.. The results appear to be con- 
sistent with this interpretation. How- 
ever, Anderson and Guthrie (1966) 
suggested that these results might have 
been produced by an interaction be- 
tween the type of series and the pro- 
portions of irrelevant cues varying 
from trial to trial. Following Detam- 
bel and Stolurow (1956), Anderson 
and Guthrie assumed that concept iden- 
tification would be facilitated by (4) 
maintaining the relevant cues at a con- 
stant level while varying the irrelevant 
cues and by (b) varying the relevant 
cues while holding constant the irrele- 
vant dimensions. Thus, they postu- 
lated that the variation of irrelevant 
cues would aid performance in a 
series but hinder it in an R series. 
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To 
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test their predictions, they varied the 
levels of either one or three dimensions 
within a C series, the level of the rele- 
vant dimension remaining the same, or 
in a series alternating the two levels 
of the relevant dimension on successive 
trials. They considered the alternating 
series (A) to be a special case of the 
R or mixed series. Six trials were 
given for each of the training com- 
binations, identified as C1, C3, Al, and 
A3, respectively. A fifth training 
series, R, allowed the levels of both 
relevant and irrelevant cues to vary 
randomly. They predicted that the C3 
series would be superior to the Cl 
series whereas the Al series would be 
superior to the A3 series ; these predic- 
tions were supported by the mean 
number of errors on the six training 
trials. The series ranked, in order of 
increasing difficulty, Al, A3, C3, Gi, 
and R, providing additional evidence 
that concept identification was influ- 
enced by cue changes on successive 
trials when the total number of rele- 
vant and irrelevant dimensions was 
held constant. However, the change 
in the ranking of the mean errors on 
the test series (C3, A1, C1, A3, and 
R, in terms of increasing numbers of 
errors) raised the question of what 
was learned during the training trials. 

The C and A series permitted the 
learning of response strategies (bi- 
ases), such as repeating the same con- 
cept name or alternating the names of 
the concepts, as well as the association 
of stimulus characteristics with the con- 
cept names. The R series could foster 
association of the stimulus character- 
istics with the concept names, although 
six trials may have been insufficient to 
allow the learning of these associations 
even though six trials provided logi- 
cally sufficient information. Thus, a 
third interpretation of the superiority 
on the training trials of a C series rela- 
tive to an R series was that successful 


performance on the training trials of 
a C series could be achieved by appli- 
cation of a single response bias whereas 
successful performance on the training 
trials of an R series required the more 
difficult discriminative association of 
stimuli and responses. If Ss learned 
response strategies during the training 
trials, their performance on an R series 
should be poorer than if they learned 
associative relationships during the 
training trials. 

Previous experiments have not ex- 
amined this interpretation. The pres- 
ent experiment was designed to study 
the possibility that learning associative 
relationships during the training series 
would aid test-trial performance and 
learning a response strategy would hin- 
der test-trial performance. Three 
training series—C, A, and R preceded 
an R test series. Either one or three 
dimensions varied on successive trials 
of the training series, and the groups 
were given six training trials or else 
run to a training criterion before intro- 
duction of the test series. 

The following predictions were 
made: (a) There would be no differ- 
ence in the mean number of errors for 
the groups on the first six training 
trials. (b) The R series would pro- 
duce more errors and require more 
trials to attain the training criterion 
than the two other series. The C series 
would be expected to produce the few- 
est errors over all trials to the training 
criterion if it is assumed that the re- 
sponse strategy of saying the same 
concept name is easier than the strategy 
of alternating concept names. Fur- 
thermore, the relative ease of executing 
a response strategy on the C series 
would provide the opportunity to ob- 
serve associative relationships that 
might aid subsequent test-trial per- 
formance. The alternation strategy of 
the A series would be more difficult, 
so Ss would be less likely to notice the 
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associative relationships. (c) The re- 
sponse strategies of repeating the same 
concept name or alternating concept 
names should be determinable from the 
first three trials of the test series. 
More response strategies would be ex- 
pected with the A series than with the 
C series, The incidence of both strat- 
egies on the R series would provide an 
empirical estimate of the chance levels. 
(d) More test-series errors were ex- 
pected following a six-trial training 
series than following a criterion-train- 
ing series. (e) When the groups had 
been run to a criterion during the 
training series, the R groups should 
make the fewest errors, the A series 
the most, and the C series an inter- 
mediate amount, assuming that differ- 
ences to observe associative relation- 
ships did exist during the training 
trials, 


METHOD 


Design and materials—Stimulus materials 
for the training trials were prepared to show 
the three series (C, A, and R) with either 
one or three dimensions changing on suc- 
cessive trials. A stimulus trial presented 
four letters; the second from the left was 
always the relevant position. Letters X and 
F were the two levels of the second position. 
The other positions were tri-leveled: J, V, or 
G were used in the first position, Q, W, or 
K in the third position, and B, P, or Z in 
the fourth position. The concepts were 
labeled “1” or “2.” Fifty-four training trials 
were constructed to conform to the series and 
the number of positions varying. The type 
of series governed the characteristics of the 
relevant dimension: For the C series the 
relevant cue remained the same on all train- 
ing trials, for the A series the two relevant 
cues alternated on successive training trials, 
and for the R series the changing of the 
relevant cues was determined randomly 
within the restrictions given below. The 
number of irrelevant dimensions that changed 
levels on successive trials was determined 
by the total number of dimensions changing 
(1 or 3) and by the treatment of the rele- 
vant dimension dictated by the type of 
series. For the C1 series exactly one irrele- 
vant dimension changed levels on each trial 


and for the C3 series three irrelevant di- 
mensions changed. For the Al series no 
irrelevant dimensions changed levels (be- 
cause the relevant dimension did change) and 
for the A3 series exactly two irrelvant di- 
mensions changed levels. The levels assumed 
by the irrelevant dimensions were determined 
randomly. The one dimension that changed 
from trial to trial on the R1 series was 
chosen randomly with the restriction that 
over each block of 12 trials each dimension 
changed equally often. The level assumed 
by the changing dimension also was deter- 
mined randomly. The three dimensions that 
changed for the R3 series were chosen ran- 
domly with the same restrictions, 

A 54-trial test series was established using 
the same relevant cue and concept name 
assignments as the training series; ie, X 
or F were the relevant cues that appeared 
in the second position. The variation of 
dimensions and of the particular level 
assumed by a dimension was determined 
randomly, subject to the restriction that 
each dimension change equally often over 
12 trials and that at least one dimension 
change. The result of the randomization 
was that the number of dimensions varying 
on successive trials ranged 1-4, The same 
test series was used for all training groups. 
The series were typed in capital letters on 
memory-drum tapes. 

A practice set of cards was prepared 
using three-digit numbers as the stimuli, “cat” 
and “dog” as the responses. The number in 
the first position was the relevant cue. 

Subjects—The Ss, 253 students from in- 
troductory courses at Indiana University, 
participated to fulfill a course requirement. 
They were assigned randomly to 10 replica- 
tions of the 24-cell design provided by the 
three types of series, the two levels of di- 
mensions changing, the number of training 
trials, and the sex of $. The protocols of 
13 Ss, divided almost evenly among the cri- 
terion groups, were discarded because they 
did not meet the criterion of learning within 
54 training trials. An S who failed to meet 
the learning criterion was replaced by the 
next S. 

Procedure and apparatus—The following 
instructions were read to S: 


In this experiment your task is to learn 
two concepts. You will see some fig- 
ures on a card and you are to decide 
whether the concept called “Cat” is being 
shown or the concept called “Dog” is 
being shown. At first you will be 
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guessing, but soon you will learn the 
characteristics that belong to Cat and 
the characteristics that belong to Dog. 
To illustrate, I’ll show you the first 
figure. Do you think the name of this 
figure is Cat or Dog? Just guess. 
Next you will see the correct name. 
Here is another figure. Is this one 
Cat or Dog? We'll go on until you 
learn the concepts. The concepts are 
very simple—don’t try to look for com- 
plicated patterns. 


The practice cards were shown until S cor- 
rectly identified the two concepts three times. 
He was asked, “What went with Cat? 
What went with Dog? And did you notice 
that the numbers in the second and third 
positions didn’t matter?” If S$ indicated any 
doubt about the irrelevancy of the numbers 
in the second and third positions, the cards 
were displayed, simultaneously, for his ex- 
amination, The training trials were then 
presented using a memory drum set at a 
4-sec, rate after he had been instructed : 


Now let’s begin on the main part of the 
experiment. The figures will appear in 
the window for only a short period of 
time, and you are to call out the name 
before the correct name is shown. In 
the main part of the experiment, you will 
use easier names than Cat or Dog. 
You will use 1 and 2. A concept is 
named “1,” or it is named “2.” Re- 
member, the patterns are always simple 
ones. We will continue until you have 
seen quite a few figures. Remember, 
you say either 1 or 2 out loud as soon 
as you see the figures. The drum will 
be revolving at a steady rate, and you 
should give your answer as soon as you 
can, Do you have any questions? As 
soon as the first figure appears, start 
responding. 


The training trials were presented for six 
trials if S had been assigned to the six-trial 
group, or until S correctly anticipated the 
concept on 12 consecutive trials if he had 
been assigned to the criterion group. If S 
did not reach the criterion of learning in 
54 training trials, he was dismissed. The 
entire 54-trial test series was presented to 
cach S immediately after he completed the 
training stage with the instructions: “Now 
we'll begin on this side of the paper. The 
relationships you have learned will continue 


to be the same. Do you have any ques- 
tions?” 


RESULTS 


The main effects of the levels of di- 
mensions changing on successive trials 
(1 or 3), sex of S, and of the interac- 
tions involving these two variables 
were not significant sources of variance 
in any of the following analyses and 
will not be discussed further. 

Training trials —The mean number 
of errors on the first six training trials 
were compared for all groups in the 
experiment. The C groups made fewer 
errors than the A groups and the R 
groups made the most errors (Column 
1, Table 1). Although the differences 
between the means for the six groups 
on the first six training trials were not 
statistically significant, the ordering of 
the criterion groups remained the same 
throughout the training trials. The 
mean number of errors in all blocks of 
six training trials and the mean num- 
þer of trials to the criterion also ap- 
pear in Table 1. An analysis of vari- 
ance was conducted over the first four 
blocks of six trials shown in Table 1. 
The mean number of errors declined 
over trials, F (3, 324) = 65.12, p < 
01, and the overall rate of decline dif- 
fered for the three series as shown by 
the Group X Trial Block interaction, 
F (6, 324) =2.90, p<.01. The R 
group made more errors for more trials 
than the other groups. The differences 
in the mean number of errors among 
the groups were assessed by applica- 
tion of the Newman-Keuls procedure. 
The mean errors made by the R group 
reliably exceeded the means of both the 
C and the A groups for the second and 
fourth trial blocks and also exceeded 
the mean of the C group for the third 
trial block. The number of trials to 
the criterion was analyzed for the cri- 
terion groups (last column, Table 1). 
The C group reached the criterion in 
the fewest number of trials, the A 
group next, and the R group the most. 
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TABLE 1 
MEAN ERRORS AND TRIALS TO CRITERION ON BLOCKS oF SIX TRAINING TRIALS 
Blocks of Six Training Trials 
Groups to Creatas 
1-6 7-12 13-18 19-24 25-30 31-36 37-42 
Criterion 
Constant 1.61 14 07 04 4.39 
1.0862 4359 | .2646 2000 4.3151 
Alternating | 1.79 AT <21 ll 14 6.93 
1.3711 -7810 | .6782 4123 5831 7.1652 
Random 1.98 85 44 AT 40 .20 14 9.71 
1,3038 | 1.3077 7000 9055 -5831 5657 5831 11.8567 
Six Trial 
Constant 1.65 
1.5376 
Alternating | 1.80 
1.8248 
Random 1.82 
r 1.3928 


Note.—The entries carried to two decimal places are the means and the entries carried to four decimal places 
are the standard deviations for the means immediately above. 


The results of the training series 
supported the first two predictions: 
(a) There would be no reliable differ- 
ences between the means for the first 
six training trials, and (b) the order- 
ing of both the mean number of errors 
and the mean number of trials to reach 
the training criterion would be C, A, 
and R, in order of increasing magni- 
tude of the scores. It should be noted 
that the ordering on the first six trials 
corresponded to the ordering shown 
by the groups throughout the entire 
training series, 

Response. strategies —The response 
strategies of always saying the same 
concept name (C bias) or of alternat- 
ing the concept names (A bias) were 
determined from the responses on the 
first three test trials. On these test 
trials the sequence of relevant stimuli 
and their associated concept names 
changed on Trial 2 and remained the 
same on Trial 3 (e.g., the correct se- 
quence of concept names on the first 
three test trials was 2,1,1). Two test 
trials were sufficient to identify a C 
bias: If the correct concept name, pe UE 
were given on Trial 1, the same con- 


cept name, “1,” should be given on 
Trial 2. If the incorrect concept name 
were given on Trial 1, the other con- 
cept name, “2,” would be given on 
Trial 2. Either concept name could 
be expected on Trial 3 because S’s re- 
sponse had been wrong on one of the 
first two test trials, but only two pat- 
terns were observed: 1, 2, 2 and 2, 2, 
1. Three test trials were necessary to 
differentiate an alternating bias from 
tesponding based on knowledge of the 
associative relationships. An alter- 
nating bias would show the following 
sequence: If the correct concept name, 
“2,” were given on Trial 1, “1” should 
be given on Trial 2, and “2” on Trial 
3. If the incorrect concept name, “1,” 
were given on Trial 1, the same con- 
cept name, “1,” should be given on 
Trial 2, and “2” on Trial 3. If Ss 
were responding on the basis of the 
associative relationships, they would 
show the same sequence of responses 
on the first two test trials as Ss re- 
sponding on the basis of an alternating 
bias, but the response on Trial 3 would 
be the same as the response on Trial 
2, regardless of the correctness of the 
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Trial 1 response. The alternating bias 
response on Trial 3 would differ from 
that of Trial 2, also regardless of the 
correctness of the Trial 1 response. 

The number of Ss who used each 
pattern of responding on the first three 
test trials is given in Table 2 for the 
C, A, and R groups with either six 
training trials or runs to the training 
criterion. When the groups had six 
training trials, the C bias and associa- 
tive-relationship responding occurred 
about equally often for the three groups 
(xs not significant) but the A bias 
was used more frequently by the A 
group than by either of the other 
groups, x2(2) = 9.00, p < .02, and ex- 
ceeded the use of the other response 
biases by the A group. When the 
groups had been run to a criterion dur- 
ing the training trials, associative-rela- 
tionship responding increased for all 
groups, but the same pattern was ob- 
served: The use of the C bias did not 
differ among groups, x7(2) = 3.80, 
p> .05, but the A groups used the A 
bias more often (26% of the time) 
than did the other groups (0% of the 
time for both groups), as predicted. 
The use of both C and A biases tended 
to decline when the groups had been 
run to a criterion during the training 
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sponse biases shown by the R groups 
was used as an empirical estimate of 
the chance level, the A bias was used 
more often than chance by the A 
groups but the C bias was not used 
more often than chance by the C 
groups. 

Test trials—The mean number of 
errors was computed for each group 
for nine six-trial blocks of test trials. 
The groups with only six training trials 
made more errors on the test trials 
( =1.05) than the groups run to a 
criterion during training (X = 46), 
F (1, 216) = 14.74, p< 01, suggest- 
ing that Ss had not completely mas- 
tered the concepts within six training 
trials, although the performance was 
generally good. The mean test-trial 
errors increased from the R (.49) to 
the C (.75) to the A groups (1.02), 
F (2, 216) = 3.93, p < .05, and the 
errors decreased over the nine blocks 
of trials, F (8, 1638) = 15.53, p < .01. 
Both the type of training and the 
groups interacted with the trial blocks, 
as shown in Table 3. When six train- 
ing trials had been given, the A groups 
made reliably more errors than the 
other groups (as evaluated by the 
Newman-Keuls method) for all trial 
blocks except the ninth. The C groups 


trials. When the incidence of the re- made reliably fewer errors than the R 
TABLE 2 
NUMBER or Ss Ustnc RESPONSE STRATEGIES ON THE First THREE TEST TRIALS 
Strategy and Response Patterns 
Groupe Constant Bias Alternating Bias Associative Responding 
1,2,2 2.2.1 11,2 2,1,2 RE 211 
Criterion 
Constant 3 3 0 0 7 15 
Alternating 1 0 6 1 9 11 
„Random 0 3 0 0 3 22 
Six Trial 
Constant 4 4 3 0 3 14 
Alternating 4 2 8 4 5 5 
Random 8 1 3 0 6 10 
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TABLE 3 
MEAN ERRORS AND TRIALS TO CRITERION ON BLOCKS oF SIX TEST TRIALS 
Blocks of Six Test Trials 
Mean Trials 
Groups to Criterion 
1-6 | 7-12 | 13-18 | 19-24 | 25-30 | 31-36 | 37-42 | 43-48 | 49-54 
Criterion 

Constant 98 .79 85 53 “55 <47 46 46 39 13.36 
1.1662 | 1.2207 | 1.4629 | 1.0392 | 1.7776 | 7746 | 1.6703 | 1.4422 | 1.2923 21.1866 

Alternating | 2.15 81 156 -54 -38 149 46 ‘47 140 12.82 
1.4663 | 1.3565 | 1,0724 | 1.3601 | .7810 | 1.3267 | 1.1619 | 1.3748 | 1.0770 | 20.6461 

Random 132 11 07 104 aL 107 204 04 200 2.68 
PREF +8485 | 3162 | .2646 | .2000 | .5568| :2646 | :2000 | 2000 | ‘0000 10.1558 

ix Trial 

Constant 1.25 00 94 .80 .79 67 .73 .60 58 16.61 
1.2450 | 1.3379 | 1.8921 | 1.5492 | 1.5297 | 1.3820 | 1.0724 | 1,0344 | 1.0536 | 20.8806 

Alternating | 2.10 76 | 1.56 | 1.52 1149 |158 |139 | 1.15 86 30.04 
1.2166 | 1.7436 | 1.1662 | 1.6062 | 1.3342 | 1.7321 | 1.7000 | 1.0863 | 1.1225 | 23.4367 

Random 1.72 | 1.35 ‘72 86 -87 :75 168 162 ‘60 18.68 
1.4900 | 1.7720 | 1.2728 | 1.5033 | 1.1136 | 1.3784 | 1.3638 | ‘9110 | 1:3675 | 21.3748 


Note.—The entries carried to two decimal 


are the standard deviations for the means immediately above, 


Places are the means and the entries carried to four decimal places 


groups on the first and second trial 
blocks; these groups made approxi- 
mately the same number of errors on 
the remainder of the trial blocks. 
When the groups had been run to a 
criterion during the training trials the 
R groups made reliably fewer errors 
than either of the other groups, with 
one exception. The difference between 
the means for the R and A groups on 
the fifth trial block, though in the ex- 
pected direction, was not statistically 
significant. The C group made fewer 
errors than the A group on the first 
trial block but the two groups did not 
differ on the rest of the test trials. 
The number of Ss who failed to meet 
a test-trial criterion of 12 consecutive 
correct responses within the 54 test 
trials was determined. When six 
training trials had been given the num- 
ber of Ss who failed to meet the cri- 
terion for the C, A, and R groups were 
7, 11, and 7, respectively. When the 
training trials had been run to a cri- 
terion comparable numbers were 4, 4, 
and 2. Finally, the mean number of 
trials required to meet a test-trial cri- 
terion was evaluated using analysis of 
variance. The test-trial criterion was 
the same as the training-trial criterion, 
12 consecutive correct concept choices. 


The Ss who did not reach the test-trial 
criterion were assigned the maximum 
number of test trials, 54, so the results 
of the analysis provided a conservative 
estimate of the differences. The groups 
with six training trials required more 
trials to reach the test-trial criterion 
(X= 20.58) than the groups which 
were run to a criterion during the 
training trials (X = 9.62), F (1, 216) 
= 13.82, p < .01, and the three groups 
differed reliably, with the A groups re- 
quiring the most trials (X = 21.43), 
the C groups the next (X = 14.98), 
and the R groups the fewest trials 
(X = 10.68), F (2, 216) = 8.70, p< 
01. The interaction between the train- 
ing conditions and the groups was also 
reliable, F (2, 216) = 3.88, p < .05, as 
indicated in the last column of Table 
3. When the groups had six training 
trials, the mean trials to test-trial cri- 
terion was reliably higher for the A 
group than for the two other groups. 
The means for the C and R groups did 
not differ reliably. When the groups 
had been run to a criterion during the 
test trials, the R group required relia- 
bly fewer trials to reach the test-trial 
criterion than the C or A groups and 
the latter groups did not differ sig- 
nificantly. 


CONCEPT IDENTIFICATION AND TYPE OF TRAINING 


Comparison of Tables 1 and 3 indi- 
cated that the test series was more 
difficult than the training series. A 
rough estimate of the relative difficulty 
of the series was obtained by compar- 
ing the mean errors made to reach the 
training criterion by the criterion 
groups with the mean errors made to 
reach the test criterion by the six-trial 
groups. All of these differences were 
statistically significant, suggesting that 
the greater number of variations of the 
dimensions of the test series produced 
a more difficult task than any of the 
training series. The differences be- 
tween means for the total number of 
errors to the criterion were 3.59 for 
the R groups, 5.53 for the C groups, 
and 10.17 for the A groups. 


DISCUSSION 


The performance of the C groups on 
the training trials was superior to the 
performance of the other groups, although 
the differences were not reliable on the 
first six training trials alone. These re- 
sults supported the first two predictions 
and were consistent with previous reports 
by Kurtz and Hovland (1956) and by 
Peterson (1962). Furthermore, the 
groups with the three types of series per- 
formed differently on the test series, as 
predicted by the fourth and fifth predic- 
tions. On the test trials, following 
acquisition of the training criterion, the 
R group made fewer errors than the 
other groups; following six training 
trials, the A group made more errors 
than the other groups. 

These results could not be explained by 
variations in the number of dimensions 
changing levels on successive trials be- 
cause this variable did not differentially 
affect performance during either training 
or the test series. The results contra- 
dicted those of Anderson and Guthrie 
(1966) who reported an interaction be- 
tween the type of series and the number 
of dimensions changing levels. The most 
plausible reason for the discrepancy was 
the difference in the total number of 
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dimensions used in the concept-identifica- 
tion problems; Anderson and Guthrie 
used five while in the present experi- 
ment four were used. Presumably, the 
more dimensions, the greater the dif- 
ficulty in remembering past instances and 
their concept assignments. When de- 
mands on memory increased, Ss may 
have been more likely to concentrate on 
a few dimensions and were more sensi- 
tive to changes in these dimensions than 
they were when they could remember 
most of the dimension instances and con- 
cept assignments for a number of trials. 
Trabasso and Bower (1964) had their 
Ss recall the dimension instances and the 
concept assignments of problems with 
six dimensions. The recall for six trials 
back was only slightly better than 
chance, whereas when L. R. Peterson 
and L. H. James? used three-dimension 
problems, Ss were able to recall the di- 
mension instances and the concept assign- 
ments for six trials about 80% of the 
time if the concept assignments were un- 
ambiguous, as was the case in the present 
experiment. 

The superiority of the C group during 
the training trials appeared to support 
Underwood’s (1952) suggestion that the 
phasic requisite for the perception and 
acquisition of the relationships repre- 
sented by stimuli is the contiguity of 
the responses made to the stimuli. If the 
concept to be learned was a response bias, 
then the C series, which provided for 
maximum contiguity of the responses, 
was the most effective; however, the C 
series clearly did not produce the most 
efficient learning of the associative rela- 
tionships tested during the test trials. 
Hence, the use of a C series during 
training did not guarantee acquisition of 
the associative relationships. 

The factor related to the differences 
in performance appeared to be the oppor- 
tunity to learn a response strategy during 
the training trials afforded by the series. 
Application of a response strategy to the 
test trials would hinder learning the asso- 
ciative relationships between the stimulus 
dimensions and the concept names. The 


2 Personal communication, 1967. 
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criterion of learning on the training trials 
could have been achieved using a re- 
sponse strategy by both the C and A 
groups but not by the R group. Pre- 
sumably, the R group learned the asso- 
ciative relationships during the training 
trials and transferred them to the test 
trials. It was assumed that the easier 
the response strategy, the more likely it 
was that associative relationships would 
be noticed during the training series. 
Thus, if the constant strategy were 
easier than the alternating bias, the C 
group should have learned more on the 
training trials and transferred more to 
the test trials than the A groups, but 
less than the R groups. The results of 
the first six test trials supported this 
prediction when the training trials had 
been run to a criterion. 

No prediction was made about the 
effects of six training trials because no 
estimate of the amount learned about the 
associative relationships in six trials 
was available prior to experimentation. 
The A group with six training trials 
showed the greatest evidence of an alter- 
nating response strategy on the first three 
test trials (43% of the Ss) and made 
the most errors on the entire test series. 
The C group with six training trials did 
not evince a reliable response bias and 
made reliably fewer errors on the test 
series. Apparently, the alternating re- 
sponse bias developed within six trials 
and interfered with successful test-trial 
performance to a greater extent than 
other response biases. The incidence of 
response biases declined for both groups 
when the training trials were run to a 
criterion. 

The R, A, and C series had different 
effects on training and test performance 
which could not be attributed solely to the 
opportunity to make contiguous responses 
to stimuli representing the same concept 
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nor to the number of dimensions varying 
levels on successive trials. A plausible 
explanation appeared to be that the differ- 
ent series offered different opportunities to 
learn the associative relationships between 
the stimulus cues and the concept labels, 
The results indicated that the three types 
of training series, C, A, and R, produced 
sufficiently different performance to war- 
rant separate consideration of the three 
series when sequential effects are being 
studied. 
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This study investigated the relations! 


ciative responses to commonness of response meaning, stimulus famili- 


hip of mediated priming of asso- 


arity, and response familiarity. The results indicated that such priming 


can be used successfully to increase 
senting alternate meanings of homogr: 


the frequency of responses repre- 
aphic stimuli. The priming effect 


was found to be inversely related to commonness of response meaning, 
but directly related to response familiarity. Several interaction effects 


were noted. A supplementary study 
highly specific effect, focused on one 


suggested that priming may be a 
particular word rather than on a 


domain of semantically related responses. 


Previous investigations have demon- 
strated that individual associative re- 
sponses (Rs) to noun and adjective 
stimuli can be primed. That is, it is 
possible to increase the frequency of 
occurrence of a desired associative R 
by prior presentation of that R word 
itself, direct priming (Segal, 1967b; 
Storms, 1958). It has also been dem- 
onstrated that successful priming can 
be accomplished when words which are 
associatively related to the desired R, 
but not that R itself, are presented just 
Prior to the association test, mediated 
priming (Coleman, 1964; Cramer, 
1964, 1966). This work recently has 
been reviewed by Cofer (1967). 

4 lt has been characteristic of the de- 
sign of these studies that E has at- 
tempted to prime only one R to each 
stimulus. It might therefore be argued 
that the occurrence of priming has 
been a function of the particular sam- 
ple of Rs which were selected for 
study, rather than being a general prin- 
ciple of associative behavior. That is, 
it might be argued that the associative 
connections between these Rs and their 
ee Paper was presented in part at the 
#astern Psychological Association meetings 
a coe 1967. Appreciation is expressed 

eoffrey Keppel for his critical reading 
the present manuscript. 
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stimuli were especially amenable to 
priming, but that this effect was not 
generally representative of associative 
functioning. 

Recently, Clifton (1966) was suc- 
cessful in using direct priming to in- 
crease the associative frequency both 
of different Rs to the same stimulus 
and of the same R to different stimuli. 
The present study was designed to in- 
vestigate the effect of mediated priming 
of alternate Rs to the same stimulus 
where, in addition, the alternate Rs 
represent distinctly different meanings 
of the stimulus word. To accomplish 
this, polysemous homographs—words 
which are spelled the same way but 
have two (or more) distinct meanings 
__were used as stimuli on a word- 
association test.2 By varying the ver- 
bal context immediately preceding each 
homographic stimulus, Æ attempted to 
increase the probability of occurrence 
of an R word previously selected to 
represent one or the other meaning of 
the stimulus. A secondary purpose of 
this study was to determine the influ- 
ence of stimulus and R familiarity on 


this type of priming. 


2 It should be noted that all but two of the 
stimuli were also homophones. 


138 


EXPERIMENT I 
Method 


Experimental design—Six independent 
groups of Ss were tested individually. Half 
of these were primed for an R representing 
the more common meaning of the homo- 
graphic stimulus and half were primed for 
an R representing the less common meaning 
of that stimulus. The groups also varied 
in the familiarity of the stimulus word: 4 
of the Ss were given stimuli of Thorndike- 
Lorge (T-L) frequency of 1-49 per million, 
3 were given A stimuli, and 4 were given 
AA stimuli, Within each group, half of 
the primed-for Rs had a T-L frequency of 
1-49 per million, and half were of A and 
AA frequency. 

The experimental design is thus a 2 X 3 X2 
factorial design with two types of priming 
(Ro: more common meaning and Rx: less 
common meaning), three levels of stimulus 
familiarity (Sz, Sa, and Saa), and two levels 
of R familitarity (low: 1-49 per million and 
high: A-AA). (The latter variable is a 
repeated measure within experimental condi- 
tion.) 

Materials and apparatus—Six lists (10 
cue stimuli each) were constructed. The 
lists differed in terms of the familiarity of 
the cue stimulus and the commonness of the 
R meaning which was being primed. In 
each case, the priming words were selected 
on the basis of knowledge that when they 
were presented as word association test 
stimuli, they elicited the particular R word 
being primed as an associative R. For 
example, the homograph Tap was preceded 
by ANGER and FicHT when the R word hit 
was being primed, and by WHISKY and ALE 
when the R word beer was being primed. 
ANGER and FIGHT, when presented as stimuli, 
both elicit hit as an R, while wHisKy and ALE 
both elicit beer. Several sources of norma- 
tive data (e.g., Bousfield, Cohen, Whitmarsh, 
& Kincaid, 1961; Palermo & Jenkins, 1964) 
were used. to locate these priming words. 
The desired Rs of any one list never ap- 
peared among the priming words or cue 
stimuli of that list. That is, the primed-for 
Rs were never presented overtly, in order 
to rule out the possibility of direct priming. 
Three practice sets were presented at the 
beginning of each list, to acquaint Ss with 
the general procedure. 


3Commonness of R meaning was deter- 
mined from the Thorndike-Barnhart diction- 
ary which lists meanings according to their 
frequency of occurrence. 


PHEBE CRAMER 


Procedure—All Ss were given a brief ex- 
planation of the standard word-association 
test procedure, and were further instructed 
that in the current experiment, prior to the 
(critical) cue stimulus, they would be pre- 
sented with two additional words followed by 
a row of asterisks (which separated the 
priming words from the cue stimulus). It 
was stressed to Ss that although they were 
to say the words aloud, they were to give an 
(oral) association to the last (cue stimulus) 
word only. 

The stimuli were presented visually via a 
Lafayette memory drum, at a 2-sec. rate with 
a 4-sec. R interval following the cue stimulus. 
The lists were constructed such that, within 
each level of stimulus familiarity, all 40 Ss 
were given the same 10 cue stimuli in one of 
two orders. However, half of the Ss were 
primed for one R meaning, and half were 
primed for the alternate R meaning. 

Subjects—Each of 120 Barnard College 
female students, naive to verbal behavior 
experiments, was tested individually. Within 
each level of stimulus familiarity, 4 Ss were 
randomly assigned to one of four lists, repre- 
senting the more and less common R mean- 
ing, with two orders of stimulus presentation. 
This procedure was followed subsequently 
for each of the other levels of stimulus fa- 
miliarity. The entire sequence was repeated 
until each of the six Stimulus Familiarity X 
Response Commonness cells had a total o 
20 Ss. 


Results 


The results of this experiment were 
assessed in two ways. To determine 
the effect of the three main variables 
(commonness of R meaning, stimulus 
familiarity, and R familiarity) on prim- 
ing, an analysis of variance of differ- 
ence scores was carried out. These 
scores were based on the Rs of 120 ex- 
perimental Ss in the different priming 
conditions, as compared to those of 
120 statistical Ss. These artificial Ss 
were created on the basis of the 
weighted probability of occurrence of 
the associative R in the normative 
data.t That is, the associative Rs of 
these statistical Ss were determined by 
assigning a series of numbers to repre 

4 The author is indebted to Geoffrey Kep- 
pel for this helpful suggestion. 
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sent every R word in each normative 
associative hierarchy. The extent of 
this series was based on the frequency 
of occurrence of that R in the norma- 
tive data. The R to each stimulus then 
was determined by reading from a 
table of random numbers and assigning 
as an R the word from the normative 
hierarchy with the corresponding num- 
ber. Using these normative-based Ss 
made it possible to determine the rela- 
tive influence of different variables on 
priming, as compared to the most rea- 
sonable estimate of the Rs which would 
be given in the absence of priming, 
ie., in a standard word-association test. 

In the second analysis of the data, 
the frequency of primed-for Rs occur- 
ring to each stimulus was compared 
to the normative frequency of such Rs. 
This second analysis makes it possi- 
ble to determine the direction(s) of 
the priming effect. That is, from the 
present experiment it is possible to de- 
termine both the influence of priming 
on the frequency of the primed-for R 
as well as its influence on the non- 
primed R. According to the hypothe- 
sis of this study, priming should in- 
crease the frequency of the primed-for 
R and decrease the frequency of the 
R from the other meaning domain. It 
should be pointed out, however, that 
this result is not a logical necessity, 
ie, that an increase in the frequency of 
Ro does not necessarily imply a de- 
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crease in the frequency of Rx. Be- 
cause of the relatively low R frequen- 
cies used it would be quite possible for 
Ro priming to increase the frequency 
of Ro without changing the Rr fre- 
quency. Furthermore, it would be pos- 
sible for both the Ro and Ry frequen- 
cies to be increased, or for both to be 
decreased. This second comparison 
with normative data, then, makes it 
possible to determine the relative R 
facilitation and/or R suppression ef- 
fects of priming. j 

Relationship of main variables to 
priming effect—In this analysis, the 
number of primed-for Rs given by each 
experimental S was compared with the 
number given by 120 statistical Ss. 
Within each of the six experimental 
conditions, experimental Ss were 
paired arbitrarily with statistical Ss. 
A summary of the mean number of 
primed-for Rs given by real Ss and by 
statistical Ss is given in Table 1. (The 
maximum possible score in any cell is 
five.) Difference scores were obtained 
by subtracting the number of primed- 
for Rs occurring for statistical S from 
the number of primed-for Rs given by 
matched experimental S. 

A three-way analysis of variance 
performed on these difference scores 
found significant main effects for Com- 
monness of R Meaning, F (1, 114) = 
12.22, p < .001, and for R Familiarity 
(T-L frequency), F (1, 114) = 69.08, 


TABLE 1 
Mean NUMBER Primep-ror Rs: REAL AND STATISTICAL Ss 


More Common R Meaning (Ro) 


Less Common R Meaning (Ru) 


Sti 
amilianty R: A-AA R:1-49 R: A-AA R: 1-49 
Real Statistical Real Statistical Real Statistical Real Statistical 
1-49 1.40 80 1.00 | (1.00) | 2.00 30) 90 (.15) 
A 1.70 ee 1.15 (60) | 1.75 30) 50 (.40) 
AA 165 | (1.05) 45 (40) | 1.75 | (40) ‘50 (20) 
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p < .001. The interaction Common- 
ness of R Meaning X R Familiarity 
was also significant, F (1, 114) = 
41.49, p<.001. When Duncan’s 
NMRT (Edwards, 1960) was used to 
test the differences among the four 
means based on Commonness of R 
Meaning X R Familiarity, it was found 
that priming had a significantly greater 
effect (p < .001) in the Ry-high fa- 
miliar R condition than in any of the 
other three conditions, which did not 
differ from each other. 

The main effect of S Familiarity was 
not significant, but the interactions S 
Familiarity x Commonness of R Mean- 
ing, F (2, 114) = 5.35, p < .01, and 
S Familiarity x R Familiarity, F (2, 
114) =7.85, p< .001, were signifi- 
cant. When the six means of the S 
Familiarity x Commonness of R Mean- 
ing were tested by Duncan’s NMRT, 
it was found that the Saa-Ro condition 
showed significantly less priming effect 
that the Sr-Rr condition (p < .01). 
It also showed less effect than the 
Sa-Ro condition, the Sa-Rr condition, 
and the Sas-Ry condition (all p’s < 
.005). 

A test of the six S Familiarity x R 
Familiarity means, using the NMRT, 


indicated that the Sy-high R and Sa- 
high R conditions showed significantly 
greater (p < .001) priming effect than 
any other condition, but did not differ 
from each other. In addition, the Saa- 
high R condition showed significantly 
more priming effect than either the 
Sz-low R (p < .01) or the Saa-low R 
condition (p < .05). 

Primed vs. normative frequency — 
In this second analysis of the data, the 
frequency of the primed-for R (e.g. 
Ro) is compared both with its norma- 
tive frequency and with its frequency 
when the alternate R (eg., Rr) is 
primed. As was pointed out above, the 
occurrence of Ro and Ry is not recip- 
rocal, and the direction of change of 
both Rs, as compared to normative 
data, is of theoretical significance. 
This present analysis makes it possible 
to consider the priming effect in rela- 
tion to a (normative) base line, and 
thus to determine the direction of 
change which priming produces. 

The results of this analysis can be 
seen in Table 2. For example, for 
stimuli with low T-L frequencies (1- 

5On a prior occasion, with different fe- 


male Ss, data were gathered on these stimuli 
under standard word-association conditions. 


TABLE 2 
MEAN PERCENTAGE OF RESPONSE FREQUENCY: PRIMED-FOR RESPONSE 


More Common Meaning (Rc) Less Common Meaning (R1) 
Stimulus 
Familiarity 
Normative Prime Re Prime RL Normative Prime Ro Prime Ru 
1-49 17.4 24.5 11.0 13.1 4.0 28.5 
| asi plo ea | 
l 2 
A 10.8 28.5 4.0 10.9 | 2.0 20.5 
* + * Fatt 
| Ed | 
AA 15.7 11.0 5.5 6.6 25 22.9) 
| i | 
l 
*= 05. 
SPÍM 
eer = p < 001. 
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49/million), it can be seen that the 
average normative frequency of the R 
representing the more common mean- 
ing of these stimuli is 17.4%. When 
that Ro was primed, the frequency was 
increased to 24.5%. When R, was 
primed, the Ro frequency was de- 
creased to 11%. The results are even 
more striking for the less common 
meaning R. The average normative 
frequency of these less common Rs is 
13.1%. When this R meaning was 
primed, the frequency was increased to 
28.5%. When Ro was primed, the 
frequency was decreased to 4%. With 
but one exception (Saa-prime Ro), the 
effects of priming are in the direction 
predicted. 

These results subsequently were 
tested for significant difference. Based 
on the hypothesis that priming should 
increase the frequency of the primed- 
for R and decrease the frequency of the 
nonprimed R, one-tailed ¢ tests were 
used throughout. Considering first 
those conditions in which the more 
common meaning (Ro) was primed, 
the only comparisons which were sta- 
tistically significant were for A stimuli. 
Priming Ro significantly increased the 
frequency of Ro as compared to the 
normative frequency, ¢ (18) = 1.95, 
P< .05, and as compared to the Ro 
frequency when Ry, was primed, ¢ (18) 
=2.72, p<.01. Furthermore, the 
frequency of Ro when Ry was primed 
Was significantly less than the norma- 
tive frequency, t (18) = 2.26, p < .05. 

When the less common meaning was 
Primed, the following comparisons 
were significant: (a) for 149/million 
stimuli, when Ry, was primed, the Ru 
frequency was significantly greater 
than the normative Rr frequency, t 
(18) = 1.93, p <.05; when Ro was 
Primed, the occurrence of Rr was sig- 
hificantly less than the normative fre- 
quency, ¢ (18) = 1.90, p < .05; and 


the Ry frequency was greater when Ry 
was primed than when Ro was primed, 
t (18) = 3.34, p < .001; (b) for A 
stimuli, priming Rr significantly in- 
creased the frequency of Rr as com- 
pared to its frequency following Ro 
priming, ¢ (18) = 3.19, p < .01, and 
priming Ro also significantly reduced 
the occurrence of Rr as compared to 
the normative data, t (18) = 2.40, p < 
05; (c) for AA stimuli, the frequency 
of Ry was significantly greater when 
Ry was primed than in the normative 
data, ¢ (18) = 2.02, p < .05, and than 
when Rg was primed, ¢ (18) = 2.54, 
p< 05. 

This study demonstrated that the 
technique of mediated priming can be 
used successfully to increase the fre- 
quency of associative Rs representing 
alternative meanings of polysemous 
homographic stimuli, The results also 
show that priming is related differently 
to R meaning commonness than it is 
to frequency of R occurrence (T-L fa- 
miliarity). Priming is more effective 
in increasing the probability of Rs 
representing the less common R mean- 
ing, but also more effective in increas- 
ing the probability of highly familiar 
Rs. In addition, within this group of 
highly familiar Rs, the effect of prim- 
ing is inversely related to S familiarity. 

These results will be discussed more 
fully in the discussion following Exp. 
II. This study was conducted to de- 
termine the specificity of the priming 
effect. 


EXPERIMENT IT 


In all of the preceding discussion, 
only the occurrence of one specific R 
to each stimulus has counted as data 
supporting the priming hypothesis. 
However, in terms of priming an area 
of meaning, it is clear that there are 
several other possible associative Rs 
which could represent that semantic 
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domain. In some ways, it is remarka- 
ble that it is possible to prime one 
single word from the large repertoire 
of semantically related Rs available. 
It might be, e.g., that although the 
specifically primed-for R in the Ro-AA 
stimulus condition of Exp. I did not 
occur, priming did cause other Rs from 
the Rg meaning domain to occur more 
often than would be expected from 
normative data. Casual inspection of 
the data suggested that this might be 
the case. 

Following this line of reasoning, it 
was decided to do an additional analy- 
sis of the data to determine the fre- 
quency of occurrence of Rs from the 
meaning domain being primed. Such 
Rs would include both the specifically 
primed-for R and any other Rs which 
represent that meaning of the stimulus 
word. In order to obtain objective 
scoring criteria for the latter Rs, a 
supplementary study was conducted. 


Method 


Subjects—The Ss were 61 female Univer- 
sity of California students, naive to verbal 
behavior experiments. They were tested in 
small groups of four or five, and were ran- 
domly assigned to condition, 

Materials—Four different booklets were 
prepared. Each booklet consisted of a series 
of 30 pages, on each of which was printed 
a stimulus word, a definition of the stimulus, 
and a list of R words. In the first booklet, 
these words consisted of the Rs given by 
Ss in Exp. I who were primed to give the 
more common meaning R. The R words 
for a second booklet consisted of the Rs of 
Ss in Exp. I who were primed to give the 
less common R. In the first booklet, the 
stimulus word was followed by a definition 
corresponding to its more common meaning, 
while in the second booklet the same stimulus 
was followed by a definition corresponding to 
the less common meaning. 

The remaining two booklets consisted of 
the same 30 stimuli with a third set of asso- 
ciative Rs. These Rs were those given by 
the original sample of normative Ss. How- 


ever, in one booklet the stimulus word was 
followed by the more common definition, 
while in the other booklet the less common 
definition followed the stimulus. These two 
booklets were included in order to be able 
to assess the change in the distribution of 
associative Rs which could be attributed to 
the priming procedure. 

Procedure—Each S was given one of the 
four booklets, with instructions to look at 
each stimulus word at the top of the page, 
consider the definition provided, and then 
check all of the (R) words in the list below 
which seemed to be related to that meaning. 

The results of the Ss in each of the four 
groups (Primed Ro— Ro definition, Primed 
Ri — Rr definition, Normative R — Ro defi- 
nition, and Normative R—Rx definition) 
were combined to make four master rating 
booklets. Subsequently, any R which was 
rated by more than 50% of the Ss as being 
related to the particular meaning of the 
stimulus provided was considered to be a 
criterion R, i.e., to be an R belonging to the 
Ro (or Ru) meaning domain. The criterion 
of 50% was chosen as a cutoff point after 
an inspection of the rating results. It ap- 
peared that this value would allow for in- 
clusion of a sufficient number of R words 
to make a differentiating range of scores 
possible, if in fact such a differentiation 
existed. 


Results 


The total number of criterion Rs 
from the Ro (or Ry) meaning domain 
was determined for each stimulus word 
in each of the four S groups, and these 
figures were converted to percentages. 
As can be seen from Table 3, priming 
resulted in the elicitation of more Rs 
from the primed meaning domain than 


TABLE 3 


MEAN PERCENTAGE FREQUENCY OF 
RESPONSES FROM MEANING DoMAIN 


Less Common 
Meaning (R1) 


Norm | Prime Ro| Norm | Prime Rt 
Ei 44.5 | 55.0 | 29.9] 53.0 


31.3} 52.0 | 35.8] 53.0 
AA 31.2 | 43.5 | 29.7| 41.0 


More Common 
Stimulus Meaning (Rc) 


Familiarity 


OOO 
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were elicited from this domain under 
standard, normative conditions. 

Testing for differences between the 
primed and normative means indicated 
that priming significantly increased the 
occurrence of Rs representing the more 
common meaning only with A stimuli, 
(18) = 1.82, p <.05. (Since the 
hypothesis predicted a priming effect, 
one-tailed tests were used throughout.) 
For the less common meaning, priming 
significantly increased the number of 
Rs from the meaning domain both for 
1-49 stimuli, ¢ (18) = 2.10, p < .05, 
and A stimuli, + (18) = 1.18, p < .05. 

As in the preceding experiment, the 
number of Rs from the primed mean- 
ing domain given by experimental Ss 
was compared to the number of such 
Rs given by statistical Ss. Using the 
same arbitrary matching as in Exp. I, 
difference scores between the two 
groups were obtained for each of the 
six experimental conditions. A three- 
way analysis of variance performed on 
these difference scores found a signifi- 
cant main effect for R familiarity, F 
(1, 114) = 8.15, p< 01. (R famili- 
arity is a repeated measure and refers 
to the familiarity of the single primed- 
for R.) None of the other main effects 
or interactions was significant. 

The fact that the only variable which 
was significantly related to the prim- 
ing effect was based on the familiarity 
of the single primed-for R suggested 
that it might be the occurrence of this 
R alone which was responsible for the 
results in Table 3. That is, it is possi- 
ble that this table only reflects the in- 
crease in the primed-for R, rather than 
a general increase in Rs from the 
primed meaning domain. 

Accordingly, an additional analysis 
of criterion Rs from the two meaning 
domains was made, in which the 
primed-for R was eliminated from the 
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count. The results of the analysis indi- 
cated that although there was a ten- 
dency for the mean percentage of Rs 
from the experimentally primed mean- 
ing domain to exceed the mean per- 
centage of such Rs in the normative 
data, none of the primed vs. normative 
differences was significant. Subse- 
quently, a three-way analysis of vari- 
ance based on the difference between 
the criterion scores of the experimental 
and statistical Ss was performed (with 
the primed-for Rs omitted from the 
criterion scores). This analysis re- 
sulted in no significant main or inter- 
action effects, suggesting that the sin- 
gle significant variable of R familiarity 
of the preceding analysis of variance 
was in fact the result of the inclusion of 
the primed-for R among the criterion 
Rs. 


Discussion 


The supplementary study of the fre- 
quency of Rs from the primed meaning 
domain indicated that, apart from the 
occurrence of the primed-for R itself, 
the primed experimental Ss do not give 
significantly more Rs from the primed 
meaning domain than do the unprimed, 
normative Ss. This is a striking finding 
and suggests that priming is a highly 
specific effect which focuses on a single 
R. This specificity was noted in an 
earlier study (Cramer, 1965) where 
priming produced importation in recall 
of the specifically primed-for R word, 
‘but not of other words from the same 
semantic category. 

‘As was pointed out above, priming gen- 
erally is more effective in increasing the 
frequency of Rs representing the less 
common meaning of the stimulus. The 
additional finding that priming is more 
likely to increase the occurrence of Rs 
of higher (T-L) familiarity is in large 
part due to the interaction R Familiarity 
Meaning Commonness. The maximum 
priming effect occurs for high familiar- 
Jess common meaning Rs. 
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However, there is also an interaction 
effect between R familiarity and stimulus 
familiarity, such that the maximum prim- 
ing effect occurs with high familiar Rs 
in combination with low familiar stimuli. 
The next largest effect occurs for high 
familiar Rs to A stimuli, and the next 
largest effect occurs for high familiar 
Rs to AA stimuli. In other words, for 
Rs of high familiarity the effect of prim- 
ing is inversely related both to common- 
ness of R meaning, and to stimulus 
familiarity. 

The interaction Commonness of R 
Meaning X S Familiarity is the result of 
the unexpected finding in the Rg — Sa, 
condition. While each of the other five 
experimental groups showed an increase 
in the number of primed-for Rs as a re- 
sult of priming, this experimental condi- 
tion resulted in a decrease in the number 
of primed-for Rs—a finding which one 
might term a negative priming effect. 
This effect can also be seen in Table 1. 
While the normative frequency of Rs in 
the Ro — Saa condition is 15.7%, after 
priming Rg the R frequency is decreased 
to 11%. 

This unexpected finding of negative 
priming raises some questions about the 
limitations of mediated priming. The 
lack of priming effect does not appear to 
be due to a ceiling effect, since the norma- 
tive frequency of 15.7% is well below 
the upper limit of priming effectiveness 
(35-50%: cf. Clifton, 1966; Segal & 
Cofer, 1960). Rather, inspection of the 
data suggests that this unexpected result 
may indicate that it is difficult to prime 
some antonym Rs. That is, when the 
priming words are of antagonistic mean- 
ing to the S (but consonant with the 
antonym R), they may be ineffective. 
Findings by Segal (1967a) of no effect 
following priming via an incongruent 
context are consistent with this notion. 
Secondly, priming may be more dif- 
ficult when the primary R of the norma- 
tive R distribution is antagonistic in 
meaning to the R being primed. 
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CHARLES N. COFER 


Pennsylvania 


State University 


3 experiments compared item recall and category clustering in free 


recall of nouns presented in sent 
tions. 


ences and in various control condi- 


Item recall was always poorer for nouns presented in sentences 


and category clustering was usually diminished by the sentence con- 


text. An interpretation which su; 
disrupt category membership or 
tent with the findings. 


Common category membership and 
associative overlap have been found to 
be powerful predictors of the number 
of items recalled and of the amount of 
clustering observed in free recall of 
nouns (see Cofer, 1965; Tulving, 
1968). The usual interpretations of 
these findings involve processes akin 
to coding: (a) the category names are 
presumed to be used by S to aid him 
in the retrieval of items, or (b) in the 
case of interassociated items, the re- 
tention of some of the items enables S 
to reproduce other items with which 
those recalled are associated. 

Studies of the kind just alluded to 
have employed lists of nouns. Al- 
though the nouns used have been inter- 
telated conceptually or associatively, an 
important feature of ordinary con- 
nected discourse involving interrela- 
tions of words has been omitted from 
such studies. This is the grammatical 
or syntactical organization which is 
characteristic of ordinary language. 
The question may be raised whether, 


1The results of Exp. I are taken from 
Cofer and Segal (1959) and those of Exp. 
I from Cofer (1961). Experiment III was 
performed under a faculty grant of aid from 
the University of California, Berkeley, 
1962-63. Grateful acknowledgment is made 
to the following for their aid in various 
phases of this work: Erwin Segal, Gerald 
M. Reicher, Nancy Scopetta D'Amato, Rona 
Copen Fisher, and Linda Weingarten. 


ggests that context may temporarily 
internoun associations seems consis- 


when syntactic constraints are in oper- 
ation, the recall of nouns will have the 
same properties that it has in the ab- 
sence of these constraints. This is not 
to say that conceptual and associative 
interrelations among items in a sen- 
tence or a passage will have no influ- 
ence on the recall of that sentence or 
passage; indeed, Rosenberg (1966a, 
1966b) has found evidence that asso- 
ciative relations do relate to the recall 
of sentences. However, the question 
being raised here concerns the extent 
to which the recall of nouns themselves 
is influenced by syntactic relations at 
the time of presentation. 

Tt would seem likely that syntactic 
constraints on nouns should alter their 
conceptual and associative relation- 
ships at least temporarily. Sentences 
can readily be supposed to have the ef- 
fect of perhaps removing a noun from 
its general class. When one says or 
reads “The doctor... y” €g» the 
reference is usually to an individual 
who has that title, rather than to the 
class of doctors or to the higher level 
classes, professions or occupations. 
Similarly, a sentence presumably re- 
lates a given word to other words 
which may be neither conceptually nor 
associatively highly related to it. Thus, 
in the sentence, “The doctor put his 
hat on the sofa while he treated the 
boy,” three nouns are related to doc- 
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tor which are probably not associates 
of doctor at all, and the three nouns 
do not readily fit into the conceptual 
categories in which doctor belongs. 

Interitem associations or conceptual 
relations seem to facilitate item recall 
and clustering when presentation of the 
items is in a list. If the analysis of 
the preceding paragraph is valid, one 
should anticipate that free recall of 
nouns after their presentation in sen- 
tences will be inferior in both quantity 
and organization as compared to their 
recall after presentation in a list. Such 
a result, if obtained, would lend some 
support to the interpretation of the role 
of syntax just indicated. 

The experiments reported in the 
present paper were designed to test 
the effects on clustering and recall of 
nouns of embedding the nouns in sen- 
tences. 


EXPERIMENT I 
Method 


Materials —Twenty-four nouns were se- 
lected, six from each of four categories— 
occupations, articles of clothing, articles of 
furniture, and four-footed animals. The 
nouns for each category were selected from 
those which, according to the norms of 
Cohen, Bousfield, and Whitmarsh (1957), 
were frequent associates to the category 
names, 

Six sentences were made up, all having 
very similar structures, and each containing 
one noun from each of the four categories, 
The sentences follow: 


The lawyer took off his hat, turned off 
the lamp, and put the cat out. The 
engineer dressed only in pants lay on 
a sofa near the dog. The carpenter put 
his coat on the stool while he watered 
the horse. The professor wore a sweater 
and sat on a table as he watched the rat. 
The doctor put his shoes by the couch 
as he cared for the cow. The nurse 
wearing a dress stood on a chair in 
order to see the rabbit. 
These sentences were read, in the order 
indicated and at the rate of about one word 
per 2 sec., to S's of the experimental group 


(Group S). In order to provide control 
data for the recall of the nouns from the 
sentences, the 24 nouns alone were formed 
into two lists. In one the sequence of the 
nouns was random, and in the other it 
was the same as it was in the sentence, 
ie, it read lawyer, hat, lamp, cat, engi- 
neer, etc. The randomized list of nouns 
was read to one control group (Group 
RO) and the nouns alone, arranged in sen- 
tence order, to the other control group 
(Group SO). Presentation rate was about 
2 sec/word. 

Subjects.—The Ss, students in undergrad- 
uate psychology classes at the University of 
Maryland, were tested in small groups of 
four to six outside of their class meetings, 
There were 30 Ss in Group S, 19 in Group 
RO, and 20 in Group SO. 

Procedure.—The ordinary instructions for 
free recall were given to Groups RO and 
SO, i.e., Ss were told a list of words would 
be read to them and that afterwards they 
were to recall the words in any order in 
which they thought of them. There were 
two 3-min, written recalls. One followed 
immediately after list presentation and the 
second 5 min. following the completion of 
the first recall. A second recall was used 
because, typically, clustering has been found 
to increase at second recall (Gonzalez & 
Cofer, 1959). The interval was filled by 
having Ss complete a “Sentence Formation 
Test” (Cofer, 1959). 

The procedure for Group S was in all 
respects identical to that for the control 
groups except that this group heard the list 
of six sentences. After list presentation, Ss 
were asked to recall the nouns by writing 
them in a column on a sheet of paper. 
Prior to list presentation, two other sentences 
had been given, and illustrations were given 
of the partial kinds of recall that were de- 
sired. Examples were the recall of verbs, 
of adjectives, and of nouns. The Ss did 
not know, at presentation of the sentences, 
which part of speech they would be asked 
to recall. Like the control groups, they 
were told to recall the nouns in any order 
in which the words occurred to them. 

Scoring—The recalls were scored twice. 
One scoring was based on clustering in the 
four categories. To score a cluster two Or 
more words from the same category must 
occur in succession. The other was based 
on clustering in terms of the six sentences, 
ie., sequences of two or more words from 
the same sentence were scored as clusters. 
Words correctly recalled were also counted. 
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Results 


Clustering results will first be con- 
sidered in terms of the ratio of repeti- 
tion (RR). This value is given by 
subtracting the number of clusters from 
the number of clustered words and di- 
viding by one less than the number of 
words recalled. The chance value for 
the four-category case is .2173 and for 
the six-category case is .130 (Cohen, 
Sakoda, & Bousfield, 1954). Raw 
values were used throughout. Mean 
values for RR and number of items 
recalled (N) are presented in Table 1. 

In clustering at immediate recall 
based on four categories, the sentence 
group (S) has the lowest ratio of repe- 
tition and the random-order noun 
group the largest one. Similar rela- 
tions are found for the delayed recall, 
and all values exceed chance expecta- 
tion, Analyses of variance for groups 
with unequal N (Winer, 1962, pp. 96- 
104) for these two recalls, however, 
yield F values (2, 66) of 1.919 and 
1.930, respectively, whereas the re- 
quired value for significance at the .05 
level is 3.15. 

The means for the six-category case 
show a relatively high value in im- 
mediate recall for the sentence group 
and a value slightly in excess of chance 
expectation for the sentence-order con- 
trol group. The mean for the random- 


order group falls below chance in both 
recalls, as it should since there was no 
basis for the six-category clustering in 
this group. The analysis of variance 
for the immediate recall data in this 
case is complicated by heterogeneity of 
variance (due to the random-order 
group) but is significant, F (2, 66) = 
5.99, p<.01. A chi-square analysis 
of the number of Ss clustering signifi- 
cantly at immediate recall yields a p 
value at the .01 level, with the sentence 
group showing the most and the ran- 
dom-order group the least clustering. 
The values at delayed recall for the 
six-category case are not significantly 
different; clustering in all groups at de- 
layed recall is below or near chance 
values. 

The changes in the ratio of repeti- 
tion between immediate and delayed 
recalls were evaluated by means of 
suitable ¢ tests. The rise in RR for 
the Group S is not significant t= 
1.474, tos = 2.064, that for Group So 
just misses significance at the .05 level, 
t = 2.084, tos = 2.093, and the one for 
Group RO is significant, t= 2.756, 
tos = 2.101. The only significant 
change between immediate and delayed 
recalls in RR for six-category scoring 
is the decline for the sentence group, 
t= 3.196, tor = 2.797. The rise in 
clustering at second recall for Group 


TABLE 1 


MEAN VALUES FOR THE RATIO OF REPETITION Scores (RR) AND 
NumpER oF Items (N) RECALLED, Exp. I 


Four Category Six Category 
Group Immediate Immediate Delayed 
RR N RR RR n 
ES 366 ia 30 
S 366 11.93 2 16 -103 30 
So 458 16.25 169 143 20 
RO 485 14.58 : 4 


Note.—n is the number of Ss. 
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RO is a typical finding; the absence of 
a rise, as in Group S, is not. 

These results, considered altogether, 
suggest the establishment in the sen- 
tence group (and to a lesser extent in 
the sentence-order group) of two clus- 
tering tendencies which conflict one 
with another, i.e., clustering in nouns 
by categories is incompatible with clus- 
tering them by common sentence mem- 
bership. This may account for the re- 
duced means for four-category RR in 
both recalls for these two groups, as 
compared to the random-order group, 
although the differences are not sig- 
nificant. It is supported by the sig- 
nificant six-category clustering shown 
by these groups at immediate recall. 
The six-category clustering is not 
strong, however, as it falls at delayed 
recall; four-category clustering in the 
sentence group is apparently still af- 
fected, however, as the RR value does 
not rise significantly between immedi- 
ate and delayed recalls for this group, 
as it does in Group RO and as it com- 
monly does for lists of nouns (Gon- 
zalez & Cofer, 1959). 

Table 1 also shows mean values for 
the number of words correctly recalled. 
For no group is the change between 
immediate and delayed recall signifi- 
cant, but an analysis of variance for 
unequal Ns for each recall shows highly 
significant differences, F (2, 66) = 
11.41 and 7.39, respectively, p < .01. 
Scheffé’s comparison of the means for 
the recalls indicates that those for SO 
and RO are not significantly different 
but that each is significantly different 
from that for S. 


Discussion 


The general findings may be sum- 
marized as follows. Recall of words is 
significantly impaired in the sentence 
condition at both recalls, as compared 
to the other two groups. While the 
groups at either recall do not differ sig- 
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nificantly in four-category clustering, the 
failure of the sentence group to show a 
significant rise in RR between immediate 
and delayed recalls suggests some impair- 
ment, a finding supported by its lower 
four-category RR values at both recalls, 
It clusters significantly on the six-cate- 
gory basis at immediate recall but this de- 
clines significantly at delayed recall. 

It is possible that embeddedness in sen- 
tences has to some extent altered the 
recall and the degree of clustering of 
these nouns. At least in part this may 
be due to the establishment of a compet- 
ing tendency to cluster on the basis of 
sentences (six category). Another fac- 
tor, involved both in absolute recall and 
in clustering, which cannot be evaluated 
on the basis of this experiment, is the 
fact that Ss in Group S heard many more 
words than Ss in the other groups. A 
control for this factor is included in the 
next experiment. 


Experiment II 


In addition to a control for number 
of words presented to the sentence 
group, separate groups were run in 
which low-frequency associates of the 
category names were also embedded in 
sentences, It is surmised that category 
clustering of low-frequency associates, 
which do not cluster as strongly as 
high-frequency associates (Bousfield, 
Cohen, & Whitmarsh, 1958), should be 
more affected by the sentence context 
than the high-frequency nouns of Exp. 
J; 


Method 


Experiment I was partially replicated by 
the use of the same sentences and nouns, 
random order of the nouns alone, and sen- 
tence order of the nouns alone. A fourth 
group was added, however, to give a con- 
trol on the sheer number of words with 
which Ss in the sentence condition were 
confronted. This was accomplished by 
taking all the words in the six sentences, 
numbering them, and randomizing them by 
means of a table of random numbers. This 
fourth group then heard all the words of 


. only the nouns. 
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the sentences of which they were to recall 
This condition is referred 
to as the random words (RW) condition. 

A further variation in the procedure was 
introduced. In the sentence presentation of 
Exp. I, S, during the presentation of the 
sentences, did not know which type of word 
he would be asked to recall. The variation 
consisted of telling S ahead of time that he 
would be asked to recall the nouns, and 
this change was applied to the sentence con- 
dition as well as the random word condition. 
For both of these conditions a practice 
illustration was given. For the sentence 
condition this consisted of the presentation 
of two sentences and a listing of the nouns 
they contained to illustrate what S was being 
asked to do. For the random word con- 
dition, the words of the two practice sen- 
tences were randomized, read, and the nouns 
were similarly listed. 

In summary, there were four groups (S, 
SO, RO, RW) in each part of this experi- 
ment, the parts involving either the high- 
frequency noun associates or the low-fre- 
quency noun associates. 

Materials—The nouns used for the high- 
frequency associates list were identical to 
those used in Exp. I. Low-frequency asso- 
ciates were taken from the Connecticut 
norms (Cohen, Bousfield, & Whitmarsh, 
1957) and were substituted for the high- 
frequency nouns in the other part of the 
experiment. 

_ Instructions—Except for the modification 
involved in telling S he would be asked to 
recall nouns only, the same instructions were 
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used as in Exp. I, the same practice sen- 
tences were used, and the same procedure 
for presentation was used. 

Subjects—The Ss were students in psy- 
chology classes in the College of Commerce 
at New York University and in the Baruch 
School of the City College of New York 
and were run in their classes. Each experi- 
mental group, however, was made up of more 
than one class. These Ss seemed generally 
less motivated and perhaps less able than 
those of Exp. I and III. 

Second recall—Following the first 3-min. 
recall period, a 5-min. interval filled by 
an association test was introduced. After 
this interval, a second 3-min. recall period 
was provided. The Ss wrote their reponses. 

Scoring—All recalls were scored for four- 
category clustering and for six-category clus- 
tering, as in Exp. I. Raw scores were used 
throughout. 


Results 


Table 2 presents the results. The 
two parts of the experiment will be 
analyzed separately. 

The differences in mean four-cate- 
gory RR at immediate recall for the 
high-frequency associates are signifi- 
cant, F (3, 163) = 7.92, p < 01, and 
the Scheffé test indicates that Groups 
RO and RW do not differ but that 
both differ from Group S (p < .05). 
Group RW differs from Group SO 


TABLE 2 


MEAN VALUES FOR THE RATIO OF REPETITION SCORES (RR) AND 
Number or Irems RECALLED (N), EXP. Il 


Immediate Recall Delayed Recall 
Group N 
N RR/4 RR/6 N RR/4 RR/6 
High-Frequency Asso- 
Cciates 
9 416 oot | 12.22 | .505 059 | 40 
BY 1245 | 303 | 11s | 10.09 | 403 | 117 | 42 
Rw 12.56 | 456 ‘053. | 12.26 | .465 oa | 46 
> 9.79 | .243 147 9.08 | .315 069 
Low-F, S 
8 Seep 10.14 | .284 110 29 416 049 1 
219 188 ; ; : 
wW 103? 319 064 | 10.12 | .342 082 17 
$ 8.53 | .275 143 7.82 | .280 102 


Note.—n is the number of Ss. (RR, 


/4 refers to four-category clustering and RR/6 to sentence clustering.) 
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($ < .05) but Group SO differs sig- 
nificantly from neither Group RO nor 
S. At immediate recall, there is also 
a significant result, F (3, 163) = 8.38, 
p < .01, for number of items correctly 
recalled, Again Groups RO and RW 
do not differ significantly by Scheffé’s 
test but both differ significantly (p < 
.05) from Group S. Group SO differs 
significantly from no other group. 

As Groups RO and RW differ from 
one another only slightly (true also at 
delayed recall), it appears that their 
differences from Group S cannot be 
due to sheer number of words, at least 
when at list presentation S knows what 
he is to recall. Neither RW nor RO 
shows any six-category clustering at 
either recall but, of course, this basis 
of organization was not available to 
them in their lists. 

The inferiority of Group S to the 
control groups in RR and N at both 
recalls replicates the findings of Exp. 
I, as does the intermediate status of 
Group SO. Again, Groups S and SO 
show marginal six-category clustering 
at immediate recall which disappears at 
second recall. All groups show a sig- 
nificant rise in four-category RR be- 
tween immediate and delayed recall ex- 
cept Group RW; changes between re- 
calls in N are not significant. Even 
though Groups S and SO show aug- 
mented RR at second recall, their mean 
values remain inferior to those of the 
control groups. This may represent a 
continuing effect of the sentence con- 
text and the sentence order. 

Data for the low-frequency associ- 
ates are presented in the lower part of 
Table 2. For four-category RR at im- 
mediate recall, the differences are not 
significant, F (3, 66) = 1.11, p > .05, 
and the means for Groups S, RO, and 
RW differ little. Group RO shows a 
significant rise (by f test) at second 
recall, the only one to do so. For N 
at immediate recall, F (3, 66) = 2.87, 
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P<.05>.01. Scheffé’s test again 
indicates that Groups RO and RW dif- 
fer significantly from Group S (p= 
.05) as well as from Group SO but 
do not differ from each other. Groups 
SO and S do not differ. 

The similarity of Groups RW and 
RO confirms the finding with the high- 
frequency associates, and the results 
for N are similar also. For four- 
category RR, however, the greater ef- 
fects of sentence context on low-fre- 
quency associates did not materialize, 
despite the presence of six-category 
clustering at a slightly better than 
chance level in both Groups S and SO. 


EXpPermMent III 


In this experiment the same nouns 
were used as in Exp. I and Groups S 
and RO were replicated. The nouns 
were placed in new sentences, however, 
to determine whether the effects of sen- 
tenc modification seen in Exp. I and 
for the high-frequency associates of 
Exp. II were specific to the kinds of 
sentences employed there. The group 
presented the new sentences is desig- 
nated S’. 


Method 


Materials—The new sentences, pre- 
sented by means of a tape recorder, were as 
follows : 


The hat that was tossed at the cat by 
the lawyer struck the lamp. 

The dogs tugged at the pants that the 
engineer had left on the sofa. 

The stool on which the doctor had put 
his coat was overturned by the horse. 
Under the sweater on the table the rat 
was hiding from the professor. 

The carpenter took the shoes he had 
put under the chair and went to feed the 
cow. 

Sitting on the couch, the nurse petted 
the rabbit that was nibbling at her 
dress. 


Subjects —Fifteen volunteers from ineo 
ductory psychology at the University o 


x = 
= —= 
7 —— —— —_—_—_—_—_—_________—_: 
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TABLE 3 


MEAN VALUES ror Four-Catecory RATIO 
oF REPETITION ScorES (RR) AND 
NuMBER oF ITEMS RECALLED 


(N), Exe. II 
Immediate Recall Delayed Recall 
Condition 
RR N RR N 
S 378 | 13.73 | .431 | 14.00 
s! 325 | 11.07 | .476 | 10.53 
RO | .451 | 15.67 | .527 | 15.60 


Note,—Numbers of Ss in each group is 15. 


California, Berkeley, were run individually 
in each of the three groups. 

Procedure-—The procedure was identical 
to that of Exp. I, as was the scoring. The 
words were recorded at about a 2-sec. rate. 
Six-category clustering is not reported, as 
the prior experiments showed that it occurred 
only slightly. 


Results 


Table 3 shows the results. Consis- 
tent with the results of the first two 
experiments, the sentence conditions 
show less four-category clustering and 
poorer recall of items than the RO 
group at both recalls. The difference 
between the two sentence conditions 
is small, and the overall analysis of the 
differences in four-category RR falls 
short of significance (F < 1.0). How- 
ever, the differences in N are signifi- 
cant, F (2, 42) = 9.98, p < .01, and 
Scheffé’s test indicates that Groups 
RO and S differ from Group S’ but 
not from each other. 


Discussion 


The major purpose of these experi- 
ments was to determine whether nouns 
would behave in free recall after presenta- 
tion in a sentence context differently from 
= way they behave after presentation in 
ists. For the set of high-frequency 
associates to category names used here, 
in all three experiments the four-category 
RR for the S and S’ groups was inferior 
Ed that of control groups (RO, RW, and 

O), though the differences reached sig- 
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nificance only once. The consistency of 
the differences obtained, however, points 
to a real effect, supported by the emer- 
gence of some six-category (sentence) 
clustering in S and SO groups. The re- 
sults for four-category RR for the low- 
frequency associates are less clear-cut, 
though it can be remarked that Cofer 
and Segal (1959a) have reported that 
low-frequency associates (not the ones 
used here) are not as much affected in 
RR scores by adjectival modification as 
are high-frequency associates. In all 
cases, the S groups reproduce fewer list 
items than the control groups, and if the 
findings of Exp. II have any generality, 
this decrement cannot be due to the sheer 
number of words presented the S (and 
S’) groups, since the two RW groups and 
two RO groups of that experiment did 
not differ on this measure. 

Some data reported by Rosenberg 
(1965a, 1965b) suggest an interpretation 
of the findings concerning sentence con- 
text reported here and already suggested 
in the introduction. Rosenberg (1965a) 
placed a word, e.g., a noun, in a sentence 
and asked Ss to produce other words, 
including nouns, which were to be in- 
serted in empty spaces in the sentences. 
These empty spaces corresponded to verb 
and direct-object “slots” in the sentences. 
He also reported (Rosenberg, 1965b) 
free-association data for these noun- 
stimulus items. The distribution of nouns 
produced for the direct-object slot in a 
sentence when a noun is presented in the 
subject slot differs considerably from the 
distribution for the subject noun when it 
is presented alone as a stimulus in free 
association. It may be that the shift in 
the associative distributions of nouns 
arising from sentence embeddedness can 
explain the results obtained here for four- 
category clustering and for item recall. 
Why this effect should be more pro- 
nounced for item recall than for cluster- 
ing is, of course, problematical. It may 
be that with low-frequency associates of 
category names their tendencies to elicit 
as associates other members of the cate- 
gory are so low that the sentence context 
does not influence them. 
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VOLUNTARY CONTROL OF 


FRAME OF REFERENCE AND 


SLOPE EQUIVALENCE UNDER HEAD ROTATION* 


FRED ATTNEAVE ann KATHLEEN W. REID 


University of Oregon 


Transfer studies show that people normally associate responses with 
physical rather than retinal stimulus orientations, In this study ex- 
perimental S's were instructed to adopt a head-anchored reference sys- 
tem (“Think of the top of your head as ‘up’.”) with heads tilted, dur- 
ing either initial learning or transfer. These instructions strongly 
facilitated transfer based on retinal invariance with head position 
changed. Moreover, faster response to retinal verticals and hori- 
zontals than to retinal diagonals with head tilted, prior to transfer, 
was significantly predictive of superior performance on the transfer 
task, which required same response to same retinal stimulus with head 
upright. Conclusions: (a) Invariance of perceived or phenomenal 
slant (rather than either physical or retinal slant) is the critical de- 
terminant of transfer. (b) Likewise, lines perceived as vertical and 
horizontal tend to evoke faster responses than those perceived as 
obliques. (c) Phenomenal slant depends on the orientation of a frame 
of reference, which is subject to voluntary as well as proprioceptive 


control. 


In discrimination reaction time 
(DRT) experiments recently reported 
by Attneave and Olson (1967), phy- 
sically vertical and horizontal lines 
evoked faster responses than physically 
diagonal lines, whether S viewed the 
stimuli with head upright or with head 
tilted 45° so that physical and retinal 
orientation were in opposition. A 
transfer study in the same context 
showed further that identifying re- 
sponses were associated with physical 
rather than retinal orientation: When 
head tilt was changed by 45° after a 
learning period, Ss had no difficulty 
giving the same responses to the same 
Physical stimuli, but typically showed 
marked disruption when required to 
give the same responses to the same 
Tetinal stimuli. 

In the case of three atypical Ss who 
showed good transfer on the basis of 
retinal invariance when shifted from 


eis research was supported by the Air 
orce Office of Scientific Research, Grant 
No. 973-66, 


head upright to head tilted, there was 
some suggestion that they did so not by 
virtue of any simple, static connections, 
but rather by rotating the whole orien- 
tational reference system into corre- 
spondence with the axes of the head. 

The set of hypotheses that the au- 
thors now propose to investigate may 
be stated briefly as follows: (a) the 
orientation of a stimulus is perceived 
(described, categorized) in terms of a 
subjective frame of reference; (b) this 
reference system is labile; (c) normally 
it is kept in an invariant relationship 
to the physical vertical and horizontal ; 
(d) however, it is subject to some 
degree of voluntary control such that 
under special instructions it can be 
kept in invariant relationship to the 
axes of the head; and (e) it is the 
stimulus as perceived (i.e., the descrip- 
tion of the object within the prevailing 
reference system) that constitutes the 
antecedent term when S-R associations 
are formed or exercised. 

If these hypotheses are correct, in- 
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structing S to think of the top of his 
head as ‘up’, when his head is tilted, 
might be expected to alter radically the 
associations between physical orienta- 
tions and identifying responses, either 
at the time of learning or at the time 
of recall. More specifically, it was pre- 
dicted that this set on the part of S 
would remove or diminish the difficulty 
of maintaining a constant response to 
a retinally invariant stimulus under 
head rotation. The effects of such in- 
structions on performance under two 
different conditions of transfer are in- 
vestigated in the present study. 
Somewhat incidentally, the authors 
were also interested in determining 
whether the attempted voluntary shift 
in frame of reference would alter the 
relative difficulty of identifying di- 
agonals vs. horizontals and verticals. 


METHOD 


Subjects—The Ss were 64 (4 groups of 
16) University of Oregon undergraduate 
volunteers, who were paid for participation, 
All met a criterion of 2/2 acuity with no de- 
tectable astigmatism in the right eye. 

Materials—For the major portion of the 
study, the 32 stimulus cards described by 
Attneave and Olson (1967) were used again. 
Each contained a single line segment .5° long, 
positioned 2° from a central fixation point in 
a circular field. The lines varied over four 
orientations (horizontal, vertical, and the two 
diagonals) and eight positions relative to the 
fixation point (above, upper right, etc.). 

Instructional or practice materials in- 
cluded the following: (a) four cards illus- 
trating the four slants with segments cen- 
tered in the field; under some conditions these 
cards contained the words TOP and BOT- 
TOM positioned either at the physical top 
and bottom of the field or, alternatively, 
rotated 45° clockwise; (b) four cards illus- 
trating the four slants with segments lo- 
cated 2° from a central fixation point in 
randomly chosen directions; and (c) four 
cards each containing eight segments (two 
examples of each slant) randomly ordered 
in a circular array 2° from the center of 
the field. 

Apparatus—A  two-field _ tachistoscope 
(Gerbrands) was used to present S’s right 
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eye with a circular field 7 in. in diameter 
2 ft. away. An adjustable headpiece held 
S's head either upright or tilted 45° clock- 
wise. The pre- and postexposure field, which 
contained a central fixation point, and the 
stimulus field containing a line segment were 
matched in luminance at 74 mL. 

When S pressed a hand switch, the stimu- 
lus field was exposed for 35 sec. and a clock 
was started.. The S’s vocal response turned 
off the clock by means of a voice relay. 

Procedure-—The S was told, without 
elaboration, that the purpose of the experi- 
ment was to study his reaction time under 
several different conditions. He was told 
that on each trial he would look into the 
tachistoscope, keeping his head fitted snugly 
into the headpiece, and fixate the dot in the 
preexposure field. When he pressed the 
hand switch a short line would flash on the 
screen. The line would have one of four 
slants, each with a different name. The S$ 
was to say the name of the line as quickly 
as possible after its exposure. He was then 
shown each line, centered in the field, and 
told its name. The names (which were 
counterbalanced over slants within each 
group) were Adam, Abner, Albert, and 
Andrew. Each slant was then shown off- 
center, as in the main body of the experi- 
ment; S was told that these were samples 
of the actual stimuli and asked for the name 
of each. Finally, before any reaction times 
were taken, he was shown the cards with 
eight lines in a circular array, and asked to 
go around the circle giving the name of 
each. This process was continued with suc- 
cessive cards to a criterion of one perfect 
circuit (two correct responses to each 
slant). 

Reaction times were then taken on 64 
trials involving two successive random per- 
mutations of the 32 stimulus cards. These 
were followed, after further instructions and 
alteration of head position, with 32 transfer 
trials employing another random permuta- 
tion of the cards. Details of general pro- 
cedure not specified here were as in the 
Attneave and Olson (1967, Exp. I and IJ) 
study. 

The Ss were assigned to four groups de- 
fined by two conditions of transfer, each 
with an experimental (E) and a control (C) 
group. The difference between E and, 
groups was simply that the former were 1- 
structed to adopt a particular (head-an- 
chored) frame of reference in viewing the 
stimuli, either initially or during the trans: 
fer trials. The present control groups, 1 


FRAME OF REFERENCE UNDER HEAD ROTATION 


and 2C, were very nearly replications of the 
groups called 5 and 2, respectively, in the 
Attneave and Olson (1967) study. 

Under Cond. 1, Ss viewed the stimuli 
initially (i.e. during instructions, preliminary 
practice, and the first 64 DRT trials) with 
their heads tilted 45° clockwise. They were 
all then shifted to a head-upright position 
for the 32 transfer trials. After this shift, 
they were required to give the same responses 
to the same retinal stimuli. In the case of 
Group 1E, the critical part of the initial in- 
structions was as follows: 


When your head is tilted, there are two 
ways you can think of “up” and “down”: 
you can think of the ceiling as “up” and 
the floor as “down,” or. you can think of 
the top of your head as “up” and your 
chin as “down.” Now, as you look at 
these lines, I want you to think of “up” 
and “down” the second way: always 
think of the top of your head as “up.” 


This idea was repeated in several different 
paraphrasings, and its importance to the 
experiment was emphasized. When S was 
first told the names of the slants, the four 
cards on which the lines appeared contained 
the words TOP and BOTTOM rotated 45°, 
ie. at the retinal top and bottom of the 
field. No subsequent cards contained these 
words, No attempt was made to influence 
the frame of reference of the 1C group, with 
the exception that the words TOP and 
BOTTOM were shown, without comment, 
at the gravitational top and bottom of the 
initial four cards. Thus the TOP-BOT- 
TOM labels merely reinforced the natural 
tendencies of control Ss (cf. Group 5 of the 
Attneave and Olson study), whereas they 
constituted a part of the general effort to 
induce a head-oriented frame of reference in 
the experimental Ss. 

At the time of transfer, the new assign- 
ment of names to slants was illustrated by 
saying “It’s just as if you were reading and 
rotated your head and rotated the book at 
the same time.” Samples were shown, and 
care was taken to insure that every S under- 
stood the principle involved. 

It should now be evident to the reader 
that if people are able voluntarily to rotate 
the frame of reference for orientation in a 
functionally meaningful way, the experi- 
Mental instructions should have the effect 
of improving transfer performance. 

Condition 2 consisted of training with head 
upright and transfer with head tilted 45°; 
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during the latter S was again required to 
give the same names to the same retinal 
slants. During the preliminary and train- 
ing trials Ss in Group 2C and Group 2E 
were treated identically, and like those in 
Group 1C except that the words TOP and 
BOTTOM did not appear at all on the initial 
instructional cards. In the case of Group 2E 
only, instructions were designed to influence 
S’s frame of reference at the time of trans- 
fer. After being given the control instruc- 
tions, S was urged to think of the top of 
his head as up, etc, and assured that if 
he did so his identifications would be cor- 
rect. The four cards bearing the words 
TOP and BOTTOM rotated 45° were used 
to illustrate assignment of names to slants 
under the transfer task. The transfer in- 
structions for Group 2C were like those used 
under Cond. 1 except for minor changes 
relating to direction of shift. With this 
group the TOP-BOTTOM labels were 
never used, since presenting them in the 
gravitational orientation at the time of trans- 
fer would have been in rather obvious con- 
tradiction to the transfer instructions. The 
aim was essentially to let control Ss handle 
the transfer task however they would, with 
no special help or hindrance. Control Ss, 
it should be emphasized, understood the re- 
quirements of the transfer task (and made 
practically no errors on it); they under- 
stood the principle that the lines were being 
rotated along with the head, but any direct 
suggestion that they should think of up, 
down, etc, in any particular way was 
avoided. 


RESULTS 


Under Cond. 1 (see the left graph 
in Fig. 1), the effect of the experi- 


mental instructions was strikingly 
positive. Whereas control Ss were 


markedly slowed down (as in the Att- 
neave and Olson, 1967, study) when 
shifted from head tilted to head upright 
and required to give the same names 
for the same retinal slants, experi- 
mental Ss, who had presumably viewed 
the lines in terms of a head-anchored 
frame of reference from the beginning, 
showed no mean decrement but rather 
a continuing improvement. This dif- 
ference between Groups 1C and 1E is 
highly reliable, t (30) = .448/.094 = 
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4.76, p < .001. Here and in the sub- 
sequent test, the difference referred to 
is between two distributions of dif- 
ferences, the latter being between 
DRT, averaged within each S, on the 
second 32 training trials (i.e., Blocks 
5-8 of the graph) and corresponding 
DRT on the 32 transfer trials. 

Results under Cond. 2 (Fig. 1, right 
graph) show a facilitating effect of ro- 
tating the reference system with the 
head at the time of the retrieval of 
associates. This effect is quite sub- 
stantial, though smaller than that of 
Cond. 1, ¢ (30) = .254/.081 = 3.14, p 
<.01. 

Reaction times from the 64 pretrans- 
fer trials of Cond. 1 were separately 
averaged for diagonal stimuli and for 
horizontal and vertical stimuli (gravi- 
tationally defined) within each S. The 
differences between these means (di- 
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agonals minus H-V) are plotted on the 
abscissa of Fig. 2 for all Ss, with dif- 
ferent symbols for experimentals and 
controls. On the ordinate is plotted 
each S’s “transfer loss,”? i.e., the in- 
crease in his DRT (averaged over all 
slants) from the last 32 pretransfer 
trials to the 32 posttransfer trials. 
The expectations with which we ex- 
amine this graph are somewhat as fol- 
lows: (a) that control Ss will respond 
faster to (physical) horizontals and 
verticals than to diagonals despite their 
head tilt (as in the Attneave and Olson 
study) and show high transfer losses; 
(b) that experimental Ss will respond 


2 This term is not to be taken too literally. 
Negative values of “transfer loss” do not 
mean that altering the head position im- 
proved performance; a more likely interpre- 
tation is that performance continued to im- 
prove with practice despite the change. 
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Fia. 2, Relationship between two response variables under Cond. 1. (Each point represents 
an S. It is hypothesized that both variables are dependent on S’s frame of reference.) 


faster to diagonals which are vertical 
and horizontal relative to their heads 
and assumed frames of reference and 
show low transfer losses; (c) that if 
some control Ss happen to adopt a 
head-oriented frame of reference on 
their own initiative, this deviation from 
the group will be reflected in decreased 
scores on both variables; (d) likewise, 
that if some experimental Ss are un- 
willing or unable to adopt a head- 
anchored frame of reference, their 
Scores will be relatively high on both 
variables; and (e) that, as a result of 
all the preceding factors, the two vari- 
ables will be positively correlated. 

k These expectations are partially con- 
Pe We have already seen that 
Se loss is significantly greater for 
ne 1C than for Group 1E; on the 
oe variable (diagonal DRT minus 
at DRT) the difference is not sig- 
icant at a convincing level, t (30) = 


1.87, .05 < p < .1. However, the cor- 
relation between the two variables is 
quite significant, albeit modest. Pool- 
ing groups, one finds r = .53, p < .001. 
For Group 1E alone, r = 51, p < .05; 
for Group 1C alone, r = .31 which is 
not significant. 

Tt is somewhat surprising that Group 
1C shows a 50-50 split on the slant 
variable, half of the Ss being faster on 
diagonals, the other half on horizontals 
and verticals. The authors have no 
good reason to suppose that this repre- 
sents anything but a sampling differ- 
ence from the more extensive results 
of Attneave and Olson (1967), who 
found lower DRTs to (physical) ver- 
ticals and horizontals under similar 
conditions. Group 1E, on the other 
hand, is faster on diagonals (retinal 
horizontals and verticals), but not quite 
significantly so, t (15) = 1.92, .05 < p 
ally 
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Discussion 


The results obtained strongly support 
the system of hypotheses set forth in the 
introduction. Optimal transfer occurs 
when there is an invariant relationship 
between naming response and perceived 
slant, from one situation to another. Per- 
ceived (phenomenal, subjective) slant is 
slant relative to a prevailing descriptive 
or reference system. The mapping of 
receptor inputs on to this system is vari- 
able; the system may be “rotated” relative 
to the input. Under normal conditions, 
as in the Attneave and Olson study or in 
the present control groups, the rotation is 
such as to achieve “constancy,” i.e., per- 
ceived slant equivalent to physical slant, 
and it occurs without conscious interven- 
tion on the part of the observer. The ref- 
erence system is subject to voluntary 
control, however, at least to the extent 
that it can be maintained in constant rela- 
tionship to the axes of the head. (This 
is not to say that more drastic forms of 
control are not possible.) 

The authors are well aware that the 
foregoing discussion is composed mainly 
of assertions about unobservable mediat- 
ing variables (orientation of reference 
system and perceived or phenomenal 
slant). If there is any possibility of 
understanding or describing economically 
the present results without recourse to 
such variables, we are entirely unable to 
see it. The conclusion that phenomenal 
orientation (rather than either retinal or 
physical orientation) is the critical ante- 
cedent of identifying behavior is entirely 
in agreement with the conclusion of Rock 
(1956) and Rock and Heimer (1957) 
from related studies. Moreover, Rock 
and Leaman (1963) have shown quite 
convincingly that a voluntarily assumed 
vertical, which need not be coincident with 
either the retinal or the gravitational 
vertical, may serve as the crucial determi- 
nant of perceived symmetry and the conse- 
quent effect of symmetry on similarity 
judgments. 

One can infer, with a little more cau- 
tion, that lines perceived as horizontal 
and vertical (for whatever reason) are 
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identified faster than lines perceived as 
diagonals. If this effect were not located 
at the phenomenal level, at least in part, 
there would be no basis for the signifi- 
cant correlation shown in Fig. 2, i.e., for 
the limited predictability of transfer loss 
from differential performance on diag- 
onals vs. horizontals and verticals before 
transfer. Some features of Fig. 2 are 
mildly puzzling, however. It is not ob- 
vious why, in the region between —200 
and 0 on the abscissa, there is a perfect 
vertical separation between the five ex- 
perimental points and the eight control 
points. An effect somewhat like this is 
to be expected if abscissa values contain 
more random error than ordinate values, 
i.e. if the transfer-loss variable is more 
closely related to an underlying frame 
of reference than is the slant-difference 
variable. As a limiting case, imagine all 
the points representing Ss with one frame 
of reference in one tight cluster, and all 
the points representing Ss with the other 
frame of reference in another tight cluster 
to the upper right, then introduce a great 
deal of random variation on X and some- 
what less on Y. The best guess, in 
accord with this interpretation, is that 
the five Ss of Group 1E who show a 
positive transfer loss employed a gravity- 
oriented frame of reference despite in- 
structions to the contrary, and differ only 
randomly from controls. 

The results of Attneave and Olson 
(1967) did not rule out the possibility 
that horizontals and verticals are proc- 
essed more quickly than diagonals at the 
projection level as well as at the phe- 
nomenal level, in which case the two 
effects might combine either additively 
(with head upright) or subtractively 
(with head tilted). The likelihood of this 
possibility is increased by the fact that 
our present Group 1C responded no faster 
to one slant than to another. Some 
interesting alternative hypotheses remain 
tenable, however. Suppose, €g- that a 
control S employed a dual frame of ref- 
erence and classified the four lines as (a) 
physically vertical, (b) physically Ue 
zontal, (c) retinally vertical, or parats 
to the major axis of the head, and 
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retinally horizontal, or parallel to a line Rock, I. The orientation of forms on the 
joining the eyes. By this system he might retina and in the environment. American 
well identify the four slants with equal _ Journal of Psychology, 1956, 69, 513-528. 
speed before transfer, but perform poorly Rock, I, & Hemer W. The effect of 
on the transfer task with head upright. retinal and phenomenal orientation on ie 
perception of form. American Journal of 
REFERENCES Psychology, 1957, 70, 493-511. 
i Rock, I., & LEAMAN, R. An experimental 
Arrneave, F., & Orson, R. K. Discrimin- analysis of visual symmetry. Acta Psy- 
ability of stimuli varying in physical and chologica, 1963, 21, 171-183. 
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mental Psychology, 1967, 74, 149-157. (Received August 25, 1967) 
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CONDITIONING AS GENERALIZATION PROCEDURE 


IN-MAO LIU 


National Taiwan University 


If the stimulus complex consisting of the CS and UCS together with 
their temporal relationship is regarded as a training stimulus of a gen- 
eralization experiment, the CS plus a fraction of the UCS (retaining 
the temporal relationship) as well as the CS alone is a test stimulus. 
Using 12 experimental groups of 10 Ss each, S was presented a light 
(300 msec.) and a tone (15 or 75 db., 200 msec.), and was instructed 
to press a key upon hearing the tone. After a certain number of trials 
(1 or 9), S was presented the light plus a fraction of the tone (0, 10, 
or 200 msec.) on a test trial. The results together with those obtained 
from control Ss not only support the temporal generalization inter- 
pretation of classical conditioning but elucidate the process of how a 
voluntary response becomes involuntary. 


Tt may be plausible to regard classi- 
cal conditioning as a special case of 
generalization. The reason is as fol- 
lows: The stimulus complex consisting 
of the CS and UCS together with their 
temporal relationship can be viewed as 
a training stimulus in a typical gen- 
eralization experiment. The CS pre- 
sented alone on a test trial in a condi- 
tioning experiment corresponds, then, 
to a test stimulus in a generalization 
experiment, The stimulus dimension 
along which the training and test 
stimuli are differentiated is now a tem- 
poral one. In this sense, classical con- 
ditioning becomes a case of temporal 
generalization. 

The above interpretation of classical 
conditioning naturally suggests the fol- 
lowing experiment. Since the CS plus 
UCS constitutes the training stimulus, 
the CS plus a fraction of the UCS as 
well as the CS alone serves as a test 
stimulus, because it is still a portion of 
the training stimulus when considered 
along the temporal dimension. Second, 
conditioning being dependent on train- 
ing, its extension of the meaning (in- 
cluding response tendency to the CS 
plus a fraction of the UCS) should 
also depend on training. Third, if 
classical conditioning is a sort of tem- 


poral generalization, everything equal, 
a weak UCS may facilitate temporal 
generalization. This deduction is 
based on the fact that, when a weak 
UCS is used, more weight will go to 
the CS part. In other words, the pro- 
portion of stimulus elements contained 
in the CS part becomes larger or more 
dominant. 

Because of the very nature of tem- 
poral generalization, the results of this 
experiment have bearing on how a vol- 
untary response becomes involuntary 
(Liu, 1968). If S starts to respond 
earlier than the time he has been in- 
structed to respond, one can say that 
S’s response becomes involuntary. 


METHOD 


Subjects—The Ss were 160 junior high 
school freshmen (80 boys and 80 girls). 
They had had no experience in participating 
in psychological experiments. They were 
not informed of the purpose of the experi- 
ment, and were asked not to discuss their 
experience with others after the experiment. 

Apparatus and procedure—A typical con- 
ditioning procedure involving a voluntary 
response of key pressing was employed. The 
UCS was a 200-msec. tone of 1,000 cps Pro- 
duced by a Maico audiometer. The CS was 
a 300-msec. light produced by a 45-w. lamp 
through a red glass 6 cm. in diameter. 
The light source was at S’s eye level 50 cm. 
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away. . The interstimulus interval during 
conditioning trials was 300 msec. 

The S was seated in a soundproof room. 
An air-conditioner in the room produced a 
constant level of noise. The S was in- 
structed as follows (in Mandarin). 


During the experiment, you will hear 
tones several times. Each time you hear 
a tone, please press this key [pointing 
to S] as soon as possible. Now try 
just once to press the key. [The S 
tries.) During the experiment, please 
do not close your eyes and fall to sleep. 
I am now going to put earphones on 
you. You will hear tones only from 
the right ear. 


No mention of the light was made in the 
instructions. 

The Ss were divided into 12 experimental 
and 4 control groups according to order of 
their appearance. Special attention was paid 
to make each group of 10 Ss consist of five 
males and five females. 

The 12 experimental groups were used to 
form a 2X2 3 factorial design. The first 
factor manipulated the tone intensity, 15 or 
75 db. The second factor differentiated the 
number of paired presentations of the light 
and tone, one or nine trials. The third 
factor referred to the duration with which 
the tone was presented on a test trial fol- 
lowing a given number of training trials. 
The tone duration used was 0, 10, or 200 
msec, A response obtained on a test trial 
with the O-msec. tone is known as a CR. 
Since it was suspected that Ss might dis- 
cuss their experience with others after the 
experiment, though they were asked not to, 
a test trial with the 0-msec. tone was ad- 
ministered so that the 200-msec. tone fol- 
lowed 400 msec. after the light cessation. 
In this way, it is easy to see whether the 
obtained response on the test trial was a 
CR or not, without leaving Ss with some 
feeling of embarrassment. In accordance 
with the 2x 2X3 factorial design, the fol- 
lowing group abbreviations consisting of 
three characters will be used. The first, 
either L or H, refers to the low- or high- 
intensity tone. The second indicates 1 or 9 
training trials. The third is one of the 
ec: 0, 10, and 200 (msec.). An S in 
rae L-9-10, for instance, received nine 
raining trials, followed by a test trial on 
which the 10-msec. tone was presented, all the 
tones being of 15 db. It is to be noted that 
Sc light was administered on every trial for 

S in the experimental groups. 
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No paired presentation of the light and 
tone was administered to Ss in the control 
groups. For Ss in two out of the four con- 
trol groups, given tones of 15 or 75 db., the 
200-msec. tone was presented on the first 
trial followed by the administration of the 
10-msec. tone on the second trial. For Ss 
in the other two control groups, each dis- 
tinguished again on the basis of the tone 
intensity, nine trials of the 200-msec. tone 
were followed by a trial of the 10-msec. 
tone. For these two groups, between the 
first and second trials and between the 
ninth and tenth trials, the light was adminis- 
tered, with the latter separated from the pre- 
ceding and following tones by at least 2 sec. 

For all Ss, the average intertrial inter- 
val was 10 sec. On each trial, the reaction 
time measured from the light onset was 
obtained. For those Ss in the control 
groups, the reaction time, measured 300 msec. 
before the tone onset, was obtained. 


RESULTS 


Since the latency of a CR is usually 
measured from the CS onset, the reac- 
tion time of a generalized response in 
the present experiment was measured 
from the light onset. When S did not 
respond within 1,800 msec. after the 
light onset, the reaction time was 
assigned the value of 1,800 msec. On 
test trials of the O-msec. tone, if the 
reaction time exceeded 800 msec., it 
was assigned the value of 1,800 msec. 
This is because there was an empty in- 
terval of 400 msec. between the light 
cessation and the tone onset and be- 
cause reaction time to a tone generally 
takes more than 100 msec. Therefore, 
if the reaction time happened to be 
within 100 msec. after the tone onset, 
it was taken as a generalized response 
to the light, otherwise, it was not. 

Data from experimental groups.— 
Figure 1 presents the results of the 
experimental groups. As is evident 
from the figure, clear generalization 
gradients were obtained. An analysis 
of variance shows the tendency to be 
very significant, F (2, 108) = 5.93, 
p<.01. Each point on the abscissa 
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should be understood as the light plus 
the given tone duration in question. 
The gradients then indicate that gen- 
eralized responses in terms of the reac- 
tion time vary in a regular fashion with 
increasing difference between test and 
training stimuli. 

According to Fig. 1 the gradient ob- 
tained from the L-1 groups (including 
L-1-0, L-1-10, and L-1-200) parallels 
that of the L-9 groups. Apparently 
the effect of more training shifted the 
gradient just downward. A similar 
tendency is apparent for the H-1 and 
H-9 groups. The analysis of variance 
indicates this tendency to be significant, 
F (1, 108) = 7.03, p < .01. 

The tone intensity effect was not sig- 
nificant, F (1, 108) =.21. However, 
there were some tendencies of the 
interactions Tone Intensity X Test 
Stimulus Duration and Tone Intensity 
Xx Training F (2, 108) = 1.77, and 
F (1, 108) = 2.30, respectively, to 
be significant. Neither the three-way 
interaction nor the interaction of Test 
Stimulus Duration X Training was 
significant, F (2, 108) = 1.07, and 
F (2, 108) = .99, respectively. 


1000 
H 
z 
g 800 
ns 
4 
600 
400 


Tone 


Duration 


Fic. 1. Temporal generalization gradients, 
the training stimulus being a 300-msec. light 
plus a 200-msec. tone. (To=0 msec., 
Tı=10 msec., and T2=200 msec.) 
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Data from control groups.—tit is a 
well-known fact that simple reaction 
time is long when the stimulus is very 
weak and becomes shorter as the stim- 
ulus increases to medium and high 
intensities. Besides, simple reaction 
time is known to decrease as the num- 
ber of trials increases. Therefore, the 
results obtained from the experimental 
groups might largely reflect these 
effects only. In order to segregate 
these effects, the data obtained from 
the control groups were observed. 

In order to match for each experi- 
mental condition, actually 16 control 
groups were required. However, some 
conditions introduced at one stage of 
training apparently did not affect other 
conditions introduced at another stage 
of training. Therefore, only four con- 
trol groups as described before suffice 
for the present purpose. 

The control condition for the O-msec. 
tone on test trials in four of the experi- 
mental groups was, of course, to pre- 
sent the CS (light) alone on test trials 
following the corresponding number of 
the tone alone trials, No S in the 
control groups responded to the light. 
Hence, no response (assigned the reac- 
tion time of 1,800 msec.) was a base- 
line therefrom to judge the significance 
of the reaction time obtained on test 
trials with the 0-msec. tone in those 
experimental groups trained with dif- 
ferent tone intensities. 

Table 1 includes the reaction time 
data obtained under all the control con- 
ditions. With respect to the control 
condition of the 10-msec. tone, the dif- 
ference in the tone intensity produced 
a significant difference in the reaction 
time, F (1, 36) = 46.26, p < Ol; the 
difference in the amount of training 
(one or nine trials) also produced a 
significant difference, F (1, 36) = 
6.65, p<.01. As to the control 
condition, for Ss in those two control 
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TABLE 1 


Comparison or MEAN REACTION Time DATA (MSEC.)* OBTAINED UNDER 
EAcu PAIR OF EXPERIMENTAL AND CONTROL CONDITIONS 


Training Trial 1 Training Trials 9 
Test Tone (msec.) 
Condition 
0 10 200 0 10 200 
M SD M SD M SD M SD M SD M SD 

15 db. 

Experimental 824 | 675 820 | 576 671 350 759 | 561 750 | 477 | 515 168 

Control 1800 O | 1710 | 285 874 60 | 1800 O | 1319 | 534 | 7474 89 

Difference 4(9) =6.39%* | £(18) =4.38** g>=2,27* (9) =5.86"* | 4(18) =2,52* | £(18) =3.86 
75 db. 

Experimental 1270 | 685 821 568 495 134 688 | 592 411 108 | 418 139 

Control 1800 0 844 | 368 698 107 | 1800 0 55 68 | 6304 55 

Difference se -151 is) =.11 4(18) Tesi 1(9) =5.94% | ¢(18) =6.02%* | £(18) =4.49%" 

| ! | 


a Reaction time was measured from the light onset. 


before the tone onset. 


When there was no light, it was measured 300 msec. 


b The Mann-Witney test was applied because the variances are not homogeneous. 
¢ Underlying distribution is bimodal, hence the Mann-Witney test was used. 
Data of the ninth trials, since on the next trials the same Ss were tested with the 10-msec. tone. 


*p < 05. 
* p < 01, 


groups who received nine trials of the 
200-msec. tone, the data from the sec- 
ond and ninth trials were used. Since 
the asymptote was reached earlier, the 
data from the ninth trials served the 
purpose. By making use of the fact 
that the design was partly of within Ss, 
an analysis of variance shows that the 
tone intensity and the amount of 
training all produced significant treat- 
ment effects, F (1, 18) = 26.83, p< 
01, and F (1, 18) = 19.17, p < 01, 
respectively. 

Comparison of data from experi- 
mental and control groups—In order 
to assess the effect of adding the CS 
for the experimental groups, it is na- 
tural to compare the data obtained 
from each experimental group with 
those obtained under the correspond- 
ing control condition. For instance, 
the data from Group H-1-10 should 
be compared with the data from those 
Ss who received a trial of the 75-db. 
UCS alone and a test trial consisting 
of the administration of the 10-msec. 
tone of 75 db. 


In Table 1, the mean reaction time of 
each experimental group is compared 
with that obtained from the corre- 
sponding control condition. The lat- 
ter serves as a base line from which 
the magnitude or at least the presence’ 
or absence of temporal generalization 
under each experimental condition 
should be judged. As is apparent from 
Table 1, decreases in the mean reaction 
time from the control to experimental 
condition are significant for all the L 
groups (including L-1-0, L-1-10, 
L-1-200, 1-9-0, L-9-10, and L-9-200). 
With respect to the H groups, de- 
creases in the mean reaction time are 
significant for all the H-9 groups. 
Whereas, when there was only one 
training trial of the paired presentation 
of the CS and UCS, decreases in the 
mean reaction time are not significant 
for Groups H-1-0 and H-1-10, though 
significant for Group H-1-200. 


Discussion 


The procedure of temporal generaliza- 
tion is somewhat implicit in the experi- 
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ments of, e.g., McAllister (1953), Boneau 
(1958), Ebel and Prokasy (1963), and 
Leonard and Theios (1967). By manipu- 
lating CS-UCS interval, these experi- 
ments attempted to find the nature of CR 
formation at CS-UCS intervals other than 
those used in the original training. 

In the present experiment, the validity 
of the concept of temporal generalization 
is supported by clear generalization 
gradients obtained with the data of the 
experimental groups. As the CR in 
classical conditioning is known to be de- 
pendent on the number of training trials, 
its extension of the meaning, generalized 
response, depends on the amount of 
training in a similar way. 

The UCS intensity effect was not ob- 
tained with the data from the experi- 
mental groups. This is due to the fact 
that the H-1 groups showed poor tempo- 
ral generalization. In order to find the 
degree of temporal generalization, it was 
necessary to compare the results obtained 
from the experimental and control groups. 
The comparison demonstrated that lack of 
the UCS intensity effect was due pri- 
marily to the absence of temporal gen- 
eralization in Groups H-1-0 and H-1-10. 
Since temporal generalization was ob- 
tained with Group H-1-200, this means 
that in order to obtain temporal general- 
ization under the H-1 condition the tone 
duration much longer than 10 msec. has 
to be presented with the CS on the test 
trials, 

On the other hand, under the L-1 
condition, the presence of temporal gen- 
eralization was clearly demonstrated. 
The different results obtained under the 
H-1 and L-1 conditions can be under- 
stood if one transforms the events in 
question (CS and UCS) taking place in 
the temporal continuum into the spatially 
occupied events. Then, the degree of simi- 
‘larity of the CS plus a fraction of the 
weak UCS to the CS plus the weak UCS 
is intuitively larger than that of similar- 
ity of the same CS plus a fraction of the 
strong UCS to the same CS plus the 
strong UCS. This can be stated in rig- 
orous mathematical language as follows. 
If E, is contained in E,, define the degree 


of similarity of E, to E, by E,/E,, and 
let H=strong UCS, H/k=a fraction 
of the strong UCS; L= weak UCS, 
L/k =a fraction of the weak UCS; and 
F=CS. Then, by a simple algebraic 
manipulation, since k > 1, the following 
inequality holds: 


(F + H/k)/(F+H) < 
(F + L/k)/(F + L). 


With more training trials, temporal 
generalization with the H-9 groups as 
well as the L-9 groups are obtained. 
The present experiment in effect demon- 
strated that temporal generalization is 
not only determined by the similarity of 
test to training stimulus but also deter- 
mined by the amount of training. 

In the present experiment, it is to be 
noted that one is dealing exclusively with 
the evocative characteristic of a response 
(Liu, 1957). For this purpose, the con- 
clusion depends only on reaction time 
data. A decrease in reaction time implies 
that S is responding to a temporal frac- 
tion of the original stimulus. To put it 
another way, S is departing from what 
he has been originally instructed to do. 
In this sense, a decrease in reaction time 
means that a voluntary response becomes 
involuntary (Liu, 1968). According to 
the present results, so far as the initia- 
tion component of voluntary response is 
concerned, the process of voluntary re- 
sponse becoming involuntary depends not 
only on the similarity of test to training 
stimulus (regarded as temporal entities) 
but also on the amount of training. 
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MEMORY AND PROBLEM SOLVING: 
A FAILURE TO REPLICATE * 


MORTON W. WEIR 
University of Ilinois 


9-yr.-olds were given a 3-alternative probabilistic task in which 1 
group (memory) was allowed to keep an accurate record of past re- 
sponses and their outcomes by means of a pegboard. A 2nd group 
also placed a peg following each choice trial, but the pegboard was 
designed so as to make the peg placements uninformative regarding 


past responses and their outcomes. 
Results of a previous study (Weir, 1967) which 


pegboard at all. 


A 3rd (control) group had no 


indicated that Ss in the memory condition were able to make use of 
the recorded information by rejecting ineffective response patterns 
were not replicated. Additional, more detailed analysis also failed to 


confirm the earlier findings. 


In an earlier study (Weir, 1967), 9-yr-olds 
were allowed to keep an accurate record of 
past responses and their outcomes in a three- 
choice probabilistic task in which choices of 
one of the three alternatives paid off on a 
66% schedule and choices of the other two 
alternatives never paid off. It was predicted 
that this aid to memory would permit 9-yr- 
olds to reject simple pattern strategies which 
ordinarily dominate their responding in this 
task, (e.g., left, middle, right or right, mid- 
dle, left response patterns) and to replace 
this pattern responding with more frequent 
choice of the payoff alternative. This pre- 
diction was confirmed, and since the result- 
ing maximizing behavior is known to be 
characteristic of adults in this task (Weir, 
1964), it was suggested that the 9-yr.-old 
Ss were able to make use of the information 
provided by the memory aid in a manner 
similar to that in which the adult makes use 
of his own memory. 

This interpretation is clouded somewhat by 
the fact that, like adults, Ss younger than 
9 yr. also tend to maximize their choice of 
the payoff alternative (Weir, 1964). It is 
therefore possible that the memory aid 
served only to distract the 9-yr.-old Ss, and 
their maximizing behavior was actually a 
regression to a more primitive form of be- 
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Sister Mary Ida, OSF, ee and the teachers and 
upils of St. Matthew’s School and Katherine Cox, 
Princi al, and the teachers and pupils of Carrie Busey 
School for their kind cooperation in making these 
studies possible. 


havior rather than behavior which was more 
adult-like, 

The present experiment was designed to 
examine this possibility by requiring 9-yr.- 
olds to make the same record-keeping re- 
sponse between each choice trial (i.e. placing 
a peg in a pegboard), but in such a way that 
they were unable to maintain an accurate 
record of past responses and their outcomes. 
The results of this study, hereafter re- 
ferred to as Replication 1, were of such a 
nature that a second, identical experiment 
was also performed (Replication 2). 

Method.—Forty-five children with an 
average chronological age of 9.9 yr. served 
as Ss in Replication 1, while 45 children 
from another school (average age 9.8 yr.) 
served as Ss in Replication 2. 

The apparatus has been described in de- 
tail elsewhere (Stevenson & Zigler, 1958). 
It consisted of a yellow panel containing 2 
horizontal row of three black knobs. Cen- 
tered above the knobs was a signal light 
and centered below the knobs was a delivery 
hole for marbles. The marbles fell into an 
enclosed, clear plastic container. The device 
used by Ss for recording past responses and 
their outcomes (the memory board) has 
been described in detail previously (Weir, 
1967). Essentially, it consisted of a white 
Plexiglas pegboard containing 30 rows of 
three holes each. The holes in each row 
were connected by a straight black line, 
making it obvious that they formed a single 
row. Four such boards were provided, along 
with a supply of both pink and black pegs- 

The pegboard which did not allow Ss to 
keep a complete record of past responses ani 
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their outcomes contained 30 rows of six 
holes each and was 16 in. long and 9 in. 
wide. This pegboard was otherwise identical 
to those described previously, except that 
no black lines connected the holes in each 
row. Only pink pegs were supplied with 
this pegboard. 

Each S was seated in front of the appara- 
tus and was told that when the signal light 
went on, he was to push one of the three 
knobs. He was also told that if he was 
correct, a marble would fall into the en- 
closed plastic container, and that the object 
of the game was to win as many marbles 
as possible. The Ss in the control (C) 
condition were given no further instructions 
and the experimental trials were begun. 

The Ss in the memory (M) condition 
were shown one of the memory boards and 
told that it would help them remember what 
they had done in the task. They were in- 
structed to place a peg in the hole corre- 
sponding to the position of their choice (left, 
middle, or right) ; a black peg when a mar- 
ble was won, and a pink peg when no 
marble was dispensed. On each successive 
trial they advanced to the next row below 
for their peg placement. This procedure 
allowed them to maintain an accurate record 
of all past responses and their outcomes. 
The memory board was placed beside the 
choice apparatus, and after one board was 
filled, another was placed beside it and the 
trials continued without interruption. 

The Ss in the large board (LB) condition 
were told that after each choice response 
they were to place a peg in the pegboard 
to “keep track of the number of times they 
had pushed the knobs.” They were not in- 
structed to try to record the position of 
their responses nor their outcomes, and as 
far as Es could determine, no S attempted 
to keep such a record. 

In all groups, one alternative was rein- 
forced 66% of the times it was chosen, 
while the other two never paid off. The 
reinforcement schedule was identical to that 
used in the previous study (Weir, 1967). 
For each S, one of the three alternatives 
Was designated as correct, and the position 
of the reinforcing alternative was equalized 
as nearly as possible within each of the ex- 
Perimental groups. All Ss were given 120 
os Since the intertrial interval for the 
eee LB groups included the time taken 

Peg placement, the pace of the C group 
a slowed so as to be approximately equal 
© that of the other two groups. As in the 
Previous experiment, two Es were present 
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during the study. One E instructed S and 
operated the choice apparatus. The second 
E sat beside and slightly behind S, recorded 
his responses, and aided with peg placement 
in the M group when necessary. 

Result—A one-way analysis of variance 
performed on the number of choices of the 
payoff alternative made by Ss in Replication 
1 indicated no significant difference asso- 
ciated with experimental groups, F (2, 42) 
= 2.92, p>.05, with means of 69.3, 84.3, 
and 80.6 for the LB, M, and C groups, re- 
spectively. The comparison between the M 
and C conditions is of particular interest. 
Although the difference between these two 
groups is in the same direction as that re- 
ported in the previous study (Weir, 1967), 
the magnitude is much less. Means for the 
M and C groups in the earlier study were 
88.5 and 75.1, respectively. 

Since the difference between the M and 
C conditions in the 9-yr.-age group was the 
key finding of the earlier study, a second 
replication appeared warranted. The second 
replication was identical to the first except 
that it was carried out on 45 9-yr.-olds from 
a different school. Results again indicated 
that there were no significant differences 
among the three experimental groups in the 
number of choices of the payoff alternative 
(means of 79.1, 76.3, and 73.5 for the LB, 
M, and C conditions, respectively). When 
the two replications were combined into a 
single 2X 3 X 6 analysis of variance (Repli- 
cations X Groups X Blocks of 20 Trials), 
the only significant effect was associated with 
Trials, F (5, 420) =119.5, p<.001. No 
other main effects or interactions were sig- 
nificant. 

As in the prior study (Weir, 1967), the 
number of left, middle, right (LMR) and 
right, middle, left (RML) response pat- 
terns made by each of the three experimental 
groups was also examined by entering the 
total number of such patterns for each S 
into a 3 (Conditions) X 2 (Replications) 
analysis of variance. The outcome of this 
analysis indicated that neither main effect 
approached significance (F<1 in both 
cases), but the Replications X Conditions 
interaction was significant, F (2, 84) = 3.66, 
p<.05. This interaction was clearly the 
result of a considerable difference in the per- 
formance of the LB groups in the two repli- 
cations, relative to the performance of the 
other two groups, In Replication 1, the 
LB group made fewer LMR and RML re- 
sponse patterns (7.3) than did the same 
group in Replication 2 (11.9). This trend 
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was reversed for the other two conditions, 
however, with the M groups making an 
average of 9.1 and 6.6 response patterns in 
Replications 1 and 2, respectively, while the 
C groups made an average of 11.9 and 7.0 
patterns in the two replications. This pat- 
tern of scores lends itself to no ready expla- 
nation, but it should be noted that in neither 
replication were the two groups of major 
interest—the M and C conditions—signif- 
icantly different from one another in the 
number of response patterns made (#<1 
in both cases). 

Thus, in neither the first nor the second 
replication were the previously observed dif- 
ferences between the M and C conditions 
repeated. In an attempt to determine if 
other, more detailed aspects of the perform- 
ance of the M and C conditions in the two 
replications were similar to those in the pre- 
vious study, several sequential dependency 
analyses were performed. The probability of 
a response to the payoff alternative was cal- 
culated as a function of the number of prior 
responses to that alternative or either of the 
other two alternatives, and their outcomes 
(reinforced or nonreinforced). No syste- 
matic differences in these measures were 
found among the M, C, and LB conditions 
in Replications 1 and 2. Further analysis of 
the data from the original study (Weir, 
1967), however, indicated that there were 
consistent differences in these sequential de- 
pendencies for the 9-yr.-olds in the M and 
C conditions in that experiment, The prob- 
ability of repeating a high side response was 
higher for Ss in the M condition following 
both reinforced trials and nonreinforced 
trials than it was for control Ss. Follow- 
ing a response to either of the two nonrein- 
forcing alternatives, however, Ss in the C 
condition were more likely to choose the pay- 
off alternative than were M-condition Ss. 
These results are consistent with the find- 
ings reported in the earlier study that Ss 
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in the M condition chose the payoff alterna- 
tive more frequently and made fewer LMR 
and RML patterns than did Ss in the C 
condition. 

Thus, for the sequential dependency analy- 
ses as well as for the analysis of the more 
gross aspects of Ss’ behavior, the two repli- 
cations showed no significant differences be- 
tween the M and C conditions. When small 
differences did occur between these two 
groups they were in the same direction as 
the differences reported in the previous 
study, but in no case did the difference reach 
statistical significance in either replication. 
The reason for this failure to replicate is 
not clear. One possibility is that the three 
different schools from which Ss were drawn 
in the original study and the two replications 
represent different ability levels. A large 
ability difference is unlikely, however, as 
all three schools are in the same general loca- 
tion in the city, and all are located in middle- 
and upper-middle-class neighborhoods. If 
any ability difference exists among the pop- 
ulations in these three schools, it would 
probably be slightly in favor of the two 
schools represented in the replications. Fur- 
ther research is necessary to explore this 
possibility. At present, however, it appears 
that the most reasonable explanation for the 
earlier finding regarding the effect of a 
memory aid on the problem-solving behavior 
of 9-yr.-olds is that it was a chance occur- 
rence and not the result of the independent 
variables under manipulation. 
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EFFECTS OF NUMBER OF RELEVANT DIMENSIONS IN 
DISJUNCTIVE CONCEPT LEARNING: 


NANCY JANE LOONEY anp ROBERT C. HAYGOOD 


Kansas State University 


The purpose of this study was to determine the relative effects of 
increasing the number of relevant stimulus dimensions in a concept for 
different conceptual rules and instructional conditions. The results 
indicate that the effect with inclusive disjunction is similar to that 
previously found for conjunction, a linear decrement in performance. 
However, the rate of decrement is greater for disjunctive than for 


conjunctive problems. Similar results were obtained in a comparison 
of instructional conditions. 


Although the 


linear decrement with 


increases in number of relevant dimensions appears to be quite general, 
the rate of decrement depends highly on the exact conditions of the 


conceptual problem. 


The increase in difficulty of learning a 
concept as the concept becomes more com- 
plex has been well documented. One im- 
portant variable governing concept com- 
plexity is the number of relevant dimensions 
(NR) in the concept. Walker and Bourne 
(1961), using visually presented geometric 
designs, found an accelerating increase in 
problem difficulty with increases in NR. 
Similarly, Bulgarella and Archer (1962), 
using abstract auditory signals as stimuli, 
found an increase in problem difficulty with 
increases in NR, though in this case the 
increase was linear. NR has also been used 
as a control variable in a variety of other 
studies (e.g, Bourne & Haygood, 1959) with 
essentially the same results. 

The above studies have all used simple 
affirmation (eg. red) for problems with 
one relevant dimension and conjunction (¢8+ 
red and square) for problems with two 
or more relevant dimensions. The only study 
of the effects of NR with other than a con- 
junctive rule was done by Kepros and 
Bourne (1966), using the biconditional rule 
(eg, red if and only if square) with two, 
three, and four relevant dimensions. Kepros 
and Bourne found a linear increase in bi- 
conditional problem difficulty with increased 
NR. However, since they used only the 
biconditional rule, the relative effects of 
NR in biconditional and conjunctive prob- 
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lems could not be assessed. The effects of 
NR with other nonconjunctive rules, such as 
inclusive disjunction (eg. either red or 
square or both), have not been studied, al- 
though it is known that these rules are more 
difficult than conjunction (Haygood & 
Bourne, 1965; ‘Neisser & Weene, 1962). 
Thus, for practical purposes, the relative ef- 
fects of increasing NR with nonconjunctive 
rules are unknown. 

All previous studies of NR seem to have 
used instructions of the type characterized 
by Haygood and Bourne (1965) as at- 
tribute identification (AI). In AI, S re- 
ceives thorough instruction on the correct 
rule for the problem, and his task is to 
learn the correct attributes. Two other in- 
structional conditions have been studied ex- 
perimentally. In rule learning (RL), S is 
told the relevant attributes and must discover 
the correct rule of combination. In com- 
plete learning (CL), S is told neither the 
rule nor the attributes and must learn both. 
Haygood and Bourne found CL to be the 
most dificult, AI intermediate, and RL the 
easiest in a typical. concept-identification 
problem. This order of difficulty was con- 
firmed in a subsequent study (Haygood & 
Stevenson, 1967). The effects of NR under 
RL and CL instructional conditions have 
not been studied experimentally. 

The overall effects of NR, type of con- 
ceptual rule, and instructional condition are, 
in themselves, predictable from previous 
studies. The purpose of the present study 
was to compare the effects of NR under 
different types of rules and instructional con- 
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ditions. Conjunction and inclusive disjunc- 
tion were chosen for study. The bicondi- 
tional rule was eliminated because of its 
extreme difficulty for naive Ss under CL 
instructions. | One other basic rule, the 
conditional (e.g., if red then square), was 
eliminated because it does not generalize un- 
ambiguously to three and four relevant 
dimensions, 

Method.—The Ss were 108 students from 
introductory psychology classes at Kansas 
State University who received class credit 
for participation. Three Ss were assigned 
to each of 36 treatment combinations in order 
of appearance. Three Ss were unable to 
solye the problem during the allotted hour 
and were replaced. Three additional Ss 
were replaced for reasons unrelated to the 
experimental treatments, such as failure to 
follow instructions. 

The design of the experiment was a 3 X 3 
2X2 factorial with three levels of NR 
(two, three, and four relevant dimensions), 
three instructional conditions (RL, AI, and 
CL), two conceptual rules (conjunction and 
inclusive disjunction), and two different sets 
of relevant attributes at each level of NR. 
In all cases there was one irrelevant dimen- 
sion (background texture). 

The experimental task and apparatus were 
essentially the same as those described by 
Haygood and Bourne (1965). The Ss were 
required to learn the correct method of sort- 
ing stimulus patterns (geometric designs) 
to a criterion of 25 consecutive correct re- 
sponses. After S was given detailed instruc- 
tions appropriate to his instructional and rule 
condition, the stimulus cards were presented 
manually one at a time. To each card, S 
was required to say “X” or “not-X” to 
indicate example or nonexample of the con- 
cept. Following each response, E gave 
feedback by placing the stimulus card in one 
of two labeled wooden boxes, and saying 
“wrong” if S’s response was incorrect. 
Thus $ was able to see the last card in each 
category. The S was given as much time 
as needed to respond and instructions 
stressed accuracy rather than speed. 

Results and discussion—In the analysis of 
variance of trials to last error, three of the 
main effects were significant. Problem 
difficulty increased as NR increased, F (2, 
72) = 18.49, p<.01. This function was es- 
sentially linear, Fiin= 36.95, p<.01, with 
the linear component accounting for 99.9% 
of the variance. The two conceptual rules 
differed in difficulty with inclusive disjunc- 
tion the hardest, F (1, 72) =24.69, p<.01. 
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Instructional condition affected performance 
with CL the hardest and RL the easiest, F 
(2, 72) =92.58, p<.01. These results are 
consistent with those of previous studies 
(Haygood & Bourne, 1965; Neisser & 
Weene, 1962; Walker & Bourne, 1961). 

As noted above, two of the interactions 
were of primary importance in this study. 
The effect of increasing NR was greater 
with inclusive disjunction than with conjunc- 
tion, F (2, 72) = 16.50, p < .01. This inter- 
action is shown in Fig. 1. Similarly, the 
NR X Instructions interaction, shown in Fig. 
2, was significant, F (4, 72) = 3.71, p < .01. 
The effect, linear in all cases, was greatest in 
CL and least in RL. The dip in the curve 
of NR for conjunction was caused by un- 
expectedly superior performance of the 
AI and CL groups at the level of three rele- 
vant dimensions, and was partially responsi- 
ble for a significant NR X Rules X Instruc- 
tions interaction, F (4, 72) = 4.47, p<.01. 
This result probably represents sampling 
error. A new set of 12 Ss was run for 
these conditions and for these Ss the dip in 
the curve vanishes. When performance on 
conjunctive problems was analyzed sepa- 
rately, no significant effect of NR was found 
using either set of Ss. Thus the present 
data are not strictly comparable to previous 
results; this finding is in part attributable to 
the unreliability from smaller numbers of 
Ss, and may also be related to inclusion of 
instructional conditions not previously 
studied. It should be noted that only 18 Ss 
of the present experiment were run under the 
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NR for three instructional conditions. 


Al-conjunction conditions of previous 
studies. 

These results indicate that the effect of 
NR with inclusive disjunction is much the 
same as that found with conjunction and 
biconditional rules, a linear decrement in 
performance. The rate of decrement, how- 
ever, clearly depends on the difficulty of the 


rule, increasing as rule difficulty increases. 
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In the same way, the more difficult the in- 
structional condition, the greater the decre- 
ment írom increased NR. 

Thus, it appears that effects of NR, pre- 
viously established under limited conditions, 
are quite general. However, the exact in- 


crease in problem diffculty with increases in 
NR is highly dependent on the exact nature 
of the conceptual problem; the more difficult 
the problem is in other respects, the greater 
the effect of NR. 
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GENERALIZATION GRADIENTS OBTAINED FROM INDIVIDUAL 


SUBJECTS FOLLOWING C. 


SHEPARD SIEGEL, ELIOT HEARST, NAN 


LASSICAL CONDITIONING? 


CY GEORGE, ann EDGAR O'NEAL 


University of Missouri, Columbia 


After classical eyelid conditioning to an C 
1s with several tonal frequencies. Decremental 


given nonreinforced trial 


generalization gradients were obtained, witl 
the CS and less often to values farther from 


ts were reliably obtained from individual Ss, 


to frequencies at or near 
the CS. These gradien! 


auditory CS, rabbits were 


h Ss responding most often 


as has previously been shown for operant conditioning. 


Generalization gradients obtained in the 
classical conditioning situation have typically 
been presented as group averages, decre- 


-13479, and by research rom, 
of Missouri. ‘The data contained in this study were 
Presented at the May, 1967 meeting of the Mi 
sychological Association, Chicago, Ilinois. 


mental functions haying been reported, with 
response strength to the generalization test 
stimuli being proportional to their physical 
distance from the CS used during training 
(e.g., Hovland 1937). 

Loucks (1933) and Razran (1949) sug- 
gested that such gradients may be an artifact 
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of the averaging of data from many Ss, and 
do not represent a real process within the 
individual S. Furthermore, recent studies 
using human Ss and GSR conditioning (e.g., 
Epstein & Burstein, 1966; van Olst & 
Orlebeke, 1965) have obtained generalization 
“gradients” other than the typical decre- 
mental functions. 

Considering the ubiquitous use of the 
concept of the generalization of classically 
CRs in theoretical interpretations of com- 
plex learning situations (Hilgard & Bower, 
1966; Mostofsky, 1965), it would appear 
that the phenomenon should have a firmer 
empirical basis. The purpose of the present 
investigation was to determine if reliable 
generalization gradients could be obtained 
in individual infrahuman Ss after eyelid con- 
ditioning. 

Method.—The Ss were 20 male New Zea- 
land white rabbits, weighing 2-3 kg. Condi- 
tioning was carried out in a sound-attenuated 
chamber. The outer eyelid response was 
graphically recorded -by a modification of 
the technique described by Gormezano 
(1966). The UCS was a 4-ma. shock to 
S’s left cheek, lasting .1 sec. The shock was 
delivered through a pair of chronically im- 
planted wire electrodes, mounted approxi- 
mately 1 cm. apart and 1 cm. below the left 
eye. 

The CS was a pure tone at 75 db. above 
.0002 dynes/cm*. Five independent groups 
of four Ss each were trained with one of 
five different CS frequencies, either 500, 
1,000, 2,000, 3,000, or 4,000 cps. The CS 
duration was 1.1 sec., the last .1 sec. of 
which overlapped the UCS, the result being a 
CS-UCS interval of 1 sec. A CR was de- 
fined as an eyelid closure occurring during 
the CS-UCS interval and sufficient to cause 
a recording pen deflection of at least 1 mm. 
The intertrial interval was either .5, 1, or 1.5 
min. (with an average of 1 min.) and the 
different intervals were presented in a pre- 
determined random order. 

All Ss received 100 daily trials of acquisi- 
tion with continuous reinforcement for four 
days. On the fifth day, Ss received an ad- 
ditional 25 acquisition trials, immediately 
followed by 75 extinction trials. Generaliza- 
tion testing was carried out with a modi- 
fication of Guttman and Kalish’s (1956) op- 
erant generalization testing technique, in 
which all Ss are presented with all test stim- 
uli during an extinction session. Each of the 
five CS values was randomly presented to 
each S within each block of five extinction 


trials. Fifteen such blocks of extinction 
trials were given to each S. 

Results and discussion—During the last 
25 conditioning trials, administered on the 
final day of training prior to the generali- 
zation test trials, the mean percentage of 
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(Each of the five columns contains the generaliza 


of the test stimuli for each S, 


responses occurring to each 
tion test data for the four Ss trained with one value of 


the CS, The arrow indicates the training stimulus for each S.) 


CRs for groups trained at 500, 1,000, 2,000, 
3,000, and 4,000 cps was 85%, 80%, 100%, 
69%, and 66%, respectively. An analysis of 


in this terminal acquisition performance 
(F<1). 
_ The groups also did not differ markedly 
in level of responding during the extinction 
test trials. The mean percentage of CRs 
during the generalization test for groups 
trained at 500, 1,000, 2,000, 3,000, and 4,000 
eps was 26%, 23%, 30%, 25%, and 20%, 
respectively. These group differences did 
hot approach statistical significance (F <1). 
Figure 1 depicts the mean generalization 
gradients obtained from groups trained at 
each CS value. Each of the five graphs rep- 
Tesents the mean generalization gradient 
pened from the four Ss trained at one of 
e CS values. The ordinate is the per- 
centage of total generalization test re- 
Sponses occurring to each of the five gener- 
alization test stimuli? As may be seen 


2 Other meas 
a ures of response strength, such as abso- 
lute frequency of CR or CR magnitude (Pennypacker, 


variance indicated no significant differences , 


from examination of Fig. 1, all groups 
yielded decremental gradients, peaking at 
the CS used during training, and falling off 
at frequencies increasingly more distant 
from the value of the acquisition CS. 

Individual generalization gradients ob- 
tained from each of the 20 Ss are presented 
in Fig. 2. Sixteen of the 20 individual gen- 
eralization gradients display maximal re- 
sponding to the appropriate CS. 

The results of this investigation indicate 
that decremental generalization gradients can 
be reliably obtained with infrahuman S's 
after eyelid conditioning, in contrast to sev- 
eral irregular functions obtained from hu- 
man Ss and GSR conditioning (e.g., Epstein 
& Burstein, 1966; van Olst & Orlebeke, 
1965). Furthermore, gradients clearly ap- 
pear in individual Ss, which argues against 
suggestions (Loucks, 1933; Razran, 1949) 
that decremental functions result only from 
averaging groups of Ss. 


1964), yielded similar results, although these indexes 
were more variable than the percentage measure 


reported. 
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The general method used by Guttman and 
Kalish (1956) for obtaining individual 
gradients after operant conditioning proved 
practical for studying classical conditioning, 
even though a continuous reinforcement 
schedule was used here in contrast to the in- 
termittent schedules employed by Guttman 
and Kalish, 
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SHORT-TERM RETENTION OF SEQUENTIALLY PRESENTED 
DIGITS AS A FUNCTION OF INTERDIGIT INTERVAL, 
DIGIT DURATION, AND SERIES LENGTH 


THOMAS E. SITTERLEY 
The Boeing Company, Seattle 


The purpose of this experiment was to delineate more fully the 
effects of digit duration, interdigit interval, and series length upon 


retention. 


10 Ss were presented series of 4, 6, 8, 10, 12, and 14 digits 


in ascending (4 to 14) and descending (14 to 4) order, at durations and 


intervals of 100, 200, 500, 1,000 msec. 
increased as a function of increased interval or duration. 


Percentage of digits recalled 
The data 


indicated that Ss perceived and encoded new information while they 
simultaneously organized and rehearsed previously presented informa- 
tion. In addition, short-term memory has a limited capacity, influenced 
by presentation rate, and exceeded whenever stimuli were received at a 


rate higher than about 2/sec. 


A factor which affects immediate memory 
capacity in sequential tasks is the rate of 
stimulus presentation. The presentation rate 
may be changed by varying the duration of 
the stimulus, the interstimulus interval, or 
both. While reducing the presentation rate 
generally leads to improved retention under 
free recall conditions (Posner, 1963), pre- 
vious investigations have predominantly 
varied the rate-changing parameters singly. 
When both stimulus duration and interstimu- 
lus interval have been varied, their relation- 


1This research is based on a PhD dissertation sub- 
mitted to the University of Arizona (1966) and sup- 
ported in part by National Science Foundation Grant 
GB-3955 to the University of Arizona. The author 
acknowledges the kind assistance of James E. King 
and Neil R. Bartlett, 


ship with presentation rate has not been com- 
pletely predictable (Corballis, 1966). 

A more complete understanding of the 
effect of presentation rate on retention may 
be obtained by examining the relationship 
of stimulus duration and interstimulus inter- 
val to presentation rate and retention. This 
study, by systematically varying both stim- 
ulus duration and interstimulus interval of 
visually presented digits over a range © 
series lengths, seeks to determine the relative 
effect of the presentation rate controlling 
parameters on retention. Additionally, it 1 
intended that the present study will provide 
further insights into the operation and capa- 
city limits of the short-term memory 
process. 
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Method.—An electromechanical system al- 
lowed a preprogrammed, sequential, digit 
series to be presented to 10 paid under- 
graduates for a specified digit duration (D) 
and interdigit interval (I). Two timers 
stepped a paper tape reader, the duration be- 
tween steps setting D, and the number of 
blank steps between stimulus codes on the 
tape specifying I. Other codes in the tape 
stopped the stimulus presentation at the end 
of each series of digits and produced a 2- 
sec, delay between the activation of the 
start switch and the onset of the first digit in 
the succeeding series presentation. 

A relay system translated the codes into 
electrical outputs which operated an Tn- 
dustrial Electronics Engineers, In-Line 
Readout (Model 120). The readout was 
mounted on the back of an aluminum dis- 
play panel, 103 in, wide X 17 in. high, and 
projected }-in. high digits (visual angle sub- 
tended: 38 min.-59 min.). The projection 
screen was exposed through a centered ł-in. 
diameter opening in the panel 5 in. from the 
top. The screen was the fixation point for 
the presentations, and subtended between 57 
min, and 1° 29 min. of visual angle (depend- 
ing on Ss’ head movements). The pro- 
grammed digit order was taken from a table 
of random numbers with the restrictions that 
no digit would appear twice in a row, and 
that there would be no ascending or descend- 
ing sequences of three or more digits. 
floormounted switch in Ss’ room was used to 
initiate the tape reading sequence. 

The temporal qualities of the stimulus 
series were calibrated using a log-linear 
photometer with its output fed into an oscil- 
loscope. When the projection lamps were 
operated, the oscilloscope displayed the rise 
and decay characteristics of the lamps and 
the timers were adjusted to obtain the 
selected stimulus durations and intervals. 

A preprinted data sheet that contained six 
groups of blocked spaces in which each 
group corresponded to a particular series 
length was given to S for each D X I condi- 
tion. The data sheet was designed so that 
S would always know the length of the series 
to be presented and, when recording, into 
which position the digits belonged. In this 
way, both the correct digit and digit order 
were recorded along with any omissions. 
For each unique combination of I and D, 
there were separate data sheets for the 
ascending and descending orders of presenta- 
tion. The ascending data sheets started with 
the shortest series length at the top and 
concluded with the longest series at the bot- 
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tom, while the data sheets for the descending 
order were reversed. 

An air-conditioned, soundproof room with 
a triple-plate, one-way window and inter- 
communication system isolated S from E, 
while allowing full monitoring capability 
throughout the sessions. 

Series of digits in a factorial combina- 
tion of I and D, each having lengths of 100, 
200, 500, and 1,000 msec. were visually pre- 
sented to each S. There were six series 
lengths (L) presented in both an ascending 
(4 to 14) and descending (14 to 4) order 
(O). The order of the presentation of con- 
ditions was randomized for each S in DXI 
XO blocks. Each block contained four dif- 
ferent presentations of each of the six L. 

At the outset of the experiment, S was | 
read a standard set of instructions that in- 
formed him of the nature of the task and 
pointed out that the order in which digits 
were recorded in the spaces was unim- 
portant, as long as the series was repro- 
duced in the correct sequence. A short 
practice session was then conducted to 
familiarize S with the mechanics of the 
experiment. 

After. the preexperimental training S ini- 
tiated the first series by a momentary push 
of the floor switch producing a 2-sec. fore- 
period followed by the digit series. At the 
end of this cycle, the system stopped and S 
immediately recorded the digits that he 
could recall. When S was ready for an- 
other series, he activated the floor switch 
again, and the cycle was repeated. After 
each data sheet was completed, S received 
a short rest period while E changed the pro- 
gram tape. Since some data sheets took 
less time than others, Ss worked for 1 hr. 
per day rather than for a specified number of 
data sheets per day; however, no data sheet 
was split between days. i 

Results —The number of digits correctly 
recalled in the proper position for each series 
was the dependent variable in a repeated 
measures analysis of variance in which each 
S received every condition, thereby serving 
as his own control. Neither the main effect 
of O nor the interactions involving O were 
significant. 

Digit recall was very high at the 4-digit 
series length, and then fell to 59.3% correct 
when there were 14 digits in the series. 
However, a decrease jn the percentage re- 
called does not necessarily imply a decrease 
jn the memory span. The digit span aver- 
aged over all conditions significantly in- 
creased as the series length increased, F 
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(5, 45) =85.21, p<.001. Figure 1 shows 
the average number as well as the range of 
digits recalled for each series length. The 
digit span was only about 4.5 digits at the 
fastest presentation rate, but reached well 
over 9 digits at the slowest presentation rate. 

The significant effects of I, F (3, 27) = 
69.61, p<.001, and D, F (3, 27) =47.41, 
p <.001, were quite similar. As either D or 
I increased from 100 to 1,000 msec., the per- 
centage of digits correctly recalled increased 
linearly up to 500 msec., after which the 
percentage recalled was asymptotic. The 
significant interactions L X D, F (15, 135) = 
9.20, p < 001, and L X I, F (15, 135) = 12.11, 
p <.001, were likewise very similar. While 
the percentage of digits correctly recalled de- 
creased with increases in L, the percentage of 
digits recalled increased with increases in 
either D or I. After 500 msec., increases in 
time failed to show further increased recall. 

Figure 2 shows the significant effect of 
interdigit interval plotted against digit dura- 
tion, F (9, 81) =18.86, p<.001. Retention 
increased with interdigit interval and stimu- 
lus duration up to 500 msec., but remained 
relatively constant for those presentations 
longer than 500 msec. 

The only significant three-way interaction 
was LX D XI, F (45, 405) =5.01, p <.001. 
This interaction indicated that: (a) as L in- 
creased, the percentage of the digits retained 
decreased, (b) increases in digit duration 
yielded increased retention for each level of 
interdigit interval, and (c) retention im- 
proved as the interval became longer. The 
data showed once again that the increases in 
retention leveled off after the duration of 
interval reached 500 msec. or longer. 
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Frc. 1. Mean and range of digits recalled as a 
function of series length. 
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Fic, 2. Interaction between digit duration and 
interdigit interval. 


The data concerning digit duration and 
interdigit interval indicated that these two 
variables have very similar effects upon re- 
tention and that retention is dependent upon 
the total length of D and I, and not upon any 
particular ratio of duration to interval. For 
example, a single digit display time of 600 
msec. might be made up of duration and in- 
terval components of 100 and 500 msec., re- 
spectively, or of 500 and 100 msec., respec- 
tively. If the effects of interval and duration 
are equal, retention should be approximately 
the same whether the interval or the duration 
is the predominant component. This is shown 
graphically in Fig. 3, which plots the percent- 
age (averaged over series lengths) of digits 
correctly recalled as a function of the larger 
or predominant component of the total time. 

As has been noted, the data suggest a 
leveling off of retention as the total time 
between the onset of successive digits 
equalled approximately 500 msec. In order 
to examine further this apparent asymptote 
of retention, the time between the onset of 
successive digits was examined for all the 
data. In Fig. 4 both time between succes- 
sive digits and presentation rate in digits/ 
second are plotted against the percentage 
of digits correctly recalled. The graph 
clearly shows that recall performance 10- 
creased until the time between the onset of 
the successive digits reached 600 msec. (rate 
of 1.67 digits/sec). Beyond this point, per- 
formance is essentially asymptotic. 

Discussion—The results of the present 
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study indicate that recall performance tended 
to decrease whenever either the digit dura- 
tion or the interdigit interval was shortened. 
As long as the total time of D plus I was 
held constant, performance was essentially 
constant (Fig. 3). This finding would lead 
to the conclusion that, as long as the digits 
are perceived, it is the overall presentation 
rate that determines recall performance, and 
not any separate effects of interdigit interval 
or, duration. 

An important implication of the results 
was that while new information was being 


DIGITS /SEC. 
gag sg see 3 s 


70 


PERCENT DIGITS RECALLED 


50 


Pering ia eaa 


ame on onn 4 
ANE S a 


2.0 


SEC. BETWEEN DIGIT ONSETS 


Fic. 4. Percentage of digits recalled as a function 


of both time between successive digit onsets and 


Presentation rate in digits per second. 


introduced into S’s perceptual system, old 
information was simultaneously being pro- 
cessed, categorized, and stored. An increase 
in retention performance, when the stimulus 
duration is constant and the presentation rate 
is decreased through longer interstimulus in- 
tervals, is often attributed to the fact that 
more time is available to S for organization 
and rehearsal of the material (Posner, 
1963). While the present data indicate that 
retention increased as a function of increased 
interdigit intervals with digit duration con- 
stant, retention likewise increased when the 
interdigit interval was held constant and the 
presentation rate was decreased through 
increasing the stimulus durations. In the 
second situation, where the interdigit inter- 
yal remained constant, no additional free 
time was made available through longer in- 
tervals for the organization and rehearsal 
process. Since there was no increase in the 
amount of free time, it might be assumed 
that whatever the process was that caused 
the increase in retention, it must have been 
operating while the new stimulus information 
was being received by S. 

The second implication of the present re- 
sults is that the short-term memory system 
is a limited capacity system which can 
remember only so many “things” introduced 
per unit time. When the system was filled 
over its capacity through high presentation 
rates, recall performance reached an asymp- 
tote with those digits exceeding the per 
unit time capacity being lost. This effect can 
be seen in Fig. 1 where the highest rate of 
presentation (5 digits/sec.) limits retention 
to approximately 4.5 digits, regardless how 
many digits were presented. It is interesting 
that this per-unit-time capacity appears to 
be in the vicinity of 2 stimuli/sec. (Fig. 
2, 3, and 4) which is within the range found 
by others who have investigated internal pro- 
cessing limits, eg. Telford (1931) mea- 
suring internal processing capacity during 
continuous stimulus presentation, Vince 
(1948) for the separation of events in time, 
and Noble, Fitts, and Warren (1955) in a 
pursuit tracking task. The results of the 
present experiment show that as long as 
the presentation rate was no faster than 1.67 
digits/sec, recall performance was essen- 
tially asymptotic (Fig. 4); and it is postu- 
fated that recall was probably limited by 
some total rate capacity of the memory 
storage. 

The absence of a significant order effect 
might be attributed to S’s a priori knowledge 
of the series length for each trial, 
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Those studies that report decreases in per- 
centage of digits recalled with increases in 
series length (Posner, 1963) were con- 
firmed. When the mean number of digits 
recalled was calculated from the percentage 
recalled at the various series lengths, it was 
found that the mean number of digits re- 
called in this study was close to the time 
honored magic number seven, plus or minus 
two (Miller, 1956). 
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TRUTH AND FALSITY OF VERBAL STATEMENTS AS CONDITIONED 
STIMULI IN CLASSICAL AND DIFFERENTIAL EYELID CONDITIONING: 


ROBERT A. FLEMING,? DAVID A. GRANT, anp JANE A. NORTH 


University of Wisconsin 


The eyelid responses of 120 Ss were differentially conditioned to 
visually presented true and false statements. Each CS was 
different, and the differential cue was the truth or falsity of the 
statement. True statements evoked more CRs than false statements 
when all were reinforced in a mixed series, and asymptotically, true 
statements produced more CRs than false statements when only true or 
only false statements were used. The “truth value” of the state- 
ments served as a good differential stimulus but the differential 
conditioning did not transfer to the words “true” and “false” even 
though Ss explicitly verbalized the word during the training process. 
The results generally paralleled those obtained with right and wrong 


arithmetic problems in earlier studies. 


The present investigation was aimed at 
studying the phenomena of classical and 
differential conditioning when the differen- 
tial stimulus was the truth or falsity of a 
visually presented verbal statement, The ex- 
periment extends and supplements earlier 
studies in which the correctness of arithmetic 
was the discriminandum (Fleming, Cere- 
kwicki, & Grant, 1968; Fleming, Grant, 
North, & Levy, 1968). These earlier results 
confirmed and extended those of El’kin 
(1957) who reported that true statements 
were more effective conditioned stimuli than 
false statements. In addition, they showed 

1 Thi rch was supported by ts to thi 
second) author, by the Uned States Public Health 
Service (MH 06792) and by National Aeronautics and 
Space Administration through an institutional grant 
to the Graduate School of the University of Wisconsin. 

2United States Public Health Service Predoctoral 
Fellow now with the Naval Electronics Laboratory, 
San Diego. 


that better differential conditioning was ob- 
tained when the wrong problems were given 
slightly aversive classical reinforcement and 
the right were not, than vice versa; whereas, 
with differential instrumental reinforcement, 
better differential conditioning was obtained 
when the right problems were given positive 
reinforcement and the wrong problems were 
given negative reinforcement as opposed to 
the reverse relationship. 

Method—The apparatus was the same as 
that used by Fleming, Grant, North, and 
Levy (1968). The conditioned stimuli were 
short simple true or false sentences of the 
form: BEES MAKE HONEY and HORSES CAN 
FLY. Sixty different true and 60 differ- 
ent false sentences were used in the experi- 
ment, and a different sentence was presentd 
on each trial. The sentences were projected 
in white letters on a dark background, and 
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ranged from 2 to 5 words in length. Physi- 
cally, the letters were 13 mm. high, 8 mm. 
wide, and the total width of a sentence 
ranged from 90 to 215 mm. in width on the 
screen which was about 120 cm. in front 
of S’s eye. The duration of the CS was 
2,100 msec., and on reinforced trials the 
UCS, a corneal air puff, of 200-msec. dura- 
tion was delivered 1,900 msec. after the onset 
of the CS. All Ss were given 60 trials, four 
groups receiving a random sequence of 30 
true and 30 false sentences, one group Te- 
ceiving 60 true sentences, and one group re- 
ceiving 60 false sentences. The experiment 
employed the same experimental design as 
that used by Fleming, Cerekwicki, and Grant 
(1968). Group TF(100) received rein- 
forcement of all 60 true and false sentences; 
Groups T(100) and F(100) received all 
true or all false sentences, respectively, and 
all were reinforced; Group TF(50) re- 
ceived 50% random reinforcement on all the 
true and false sentences; Group T+ F— had 
all true sentences and none of the false 
sentences reinforced; and Group F+ T— 
had the false sentences reinforced. At the 
conclusion of the 60 trials, all Ss were 
given 10 generalization trials, 5 to the stimu- 
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lus word TRUE and 5 to the stimulus word 
FALSE. During the training trials, Ss had 
been instructed to say as quickly and accu- 
rately as possible whether the sentence was 
“true” or “false.” The Ss were 60 men 
and 60 women students from introductory 
psychology classes at the University of 
Wisconsin, 10 of each being assigned ran- 
domly to each group. 

Results and discussion—All responses 
greater than 1 mm. in magnitude which oc- 
curred from 200 to 1,900 msec. after CS 
onset were scored as CRs. The Hartman 
and Ross (1961) criterion was used for 
classifying Ss as either voluntary form re- 
sponders (Vs) or conditioners (Cs). Of 
the 120 Ss 5 each in the T+ F—, TF(100), 
and T(100), 3 Ss in the F+ T— and in the 
F(100), and 2 Ss in the TF(50) groups 
were classified as Vs. Without verbalization 
almost 50% of the Ss are usually Vs (Flem- 
ing, Grant, North, & Levy, 1968), so that 
Ss’ verbalization of “true” and “false” was 
evidently an effective device for reducing the 
V form responding. Data on Cs only will 
be reported. 

The chief results of the experiment are 
shown in Fig. 1. An immediately apparent 
difference between the results in Fig. 1 and 
those reported by Fleming, Grant, North, 
and Levy (1968) is the higher level of 
responding on the first block of trials in this 
study. Evidently Ss tended to blink im- 
mediately after reading each sentence. At 
the conclusion of the main experiment, 10 
additional Ss were given the same mixed 
true and false sentences used in the main ex- 
periment, but none were reinforced. For 
these Ss the initial response level was ap- 
proximately 50%, and this gradually de- 
clined, reaching 10% to 20% responding after 
30 trials. (With these Ss there were no dif- 
ferences in level of responding to true and 
false sentences, indicating the two types of 
sentences were essentially equivalent in the 
absence of reinforcement.) This decline in 
response rate with nonreinforcement as com- 
pared with the curves of Fig. 1 suggests that 
the role of reinforcement was to preserve the 
jnitial response level rather than to in- 
crease it. 

Analyses of variance were performed, 
comparing responding to true and false 
statements on all trials and on the last 30 
trials of the experiment. 

In Panel A, the within-Ss comparison of 
responding to T and F sentences showed that 
the T sentences produced significantly more 
CRs, t (14) =3.69, p < 01. On the last 30 
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trials the difference between number of CRs 
to.T and F sentences was not significant, 
t (14) =1.89. 

In Panel B, the between-Ss comparison of 
T and F conditioned stimuli produced no sig- 
nificant difference over all trials, ¢ (30) = 
1.68, which corresponds to the results found 
by Fleming, Cerekwicki, and Grant (1968), 
but on the last 30 trials there were more CRs 
to T than to F sentences, t (30) =2.06, p 
< .05. 

As was the case in the Fleming, Cere- 
kwicki, and Grant study (1968), intermittent 
reinforcement in the TF(50) group produced 
no significant differences in responding to T 
and F sentences as shown in Panel C. For 
all trials, ¢ (16) =.86, and for the last 30 
trials, ¢ (16) =.49. 

In Panel D there was significant terminal 
discrimination in both the T+ F— and the 
F+ T— groups, t (14) =3.46 and t (16) = 
4.28, respectively, over the last 30 trials. 
Although the discrimination was somewhat 
superior in the F+ T— group as compared 
with the T+ F— group, the difference was 
not significant, ¢ (30) =.60 for all trials 
and ¢ (30) =.70 for the last 30 trials. 

In contrast with the experiments where 
arithmetic problems were used as condi- 
tioned stimuli, Ss showed inconsistency in 
classifying some of the sentence as true or 
false. Comparisons between the T(100) and 
the F(100) groups showed that the unambig- 
uous T sentences produced significantly 
more CRs on the last 15 trials than did the 
unambiguous F sentence, £ (30) =2.41. 
These groups did not differ, however, in re- 
sponding to T and F sentences which had 
been inconsistently classified, (30) =.95. 
This observation further supports the gen- 
eral notion that the truth or falsity of the 
sentence had an important differential stim- 
ulus value. 

In generalization of responding to the 
words “true” and “false,” the only group 
which showed significant differential gener- 
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alization between the two words was the 
F(100) group whose Ss gave more CRs to 
the word “false” than to the word “true,” x? 
(1) = 5.08, $ < .05. 

Conclusion—The findings of this investi- 
gation provided confirmation of those of 
Elkin (1957) in that true statements evoked 
more CRs than false statements in a within- 
Ss comparison, just as the right arithmetic 
had evoked more CRs than the wrong 
arithmetic problems in the earlier work from 
the authors’ laboratory. In the present study, 
however, the T-F differential effect also oc- 
curred with a between-Ss comparison of 
results from groups reinforced exclusively 
on true or exclusively on false sentences, 

In contrast to conditioning the eyelid re- 
sponse to the arithmetic conditioned stimuli, 
where normal acquisition curves were ob- 
tained, reinforcing sentences with the UCS 
appeared here to prevent habituation of an 
initial high rate of blinking. In spite of 
this difference in the nature of the condi- 
tioning process, however, the results of this 
experiment corresponded generally to the re- 
sults obtained with arithmetic problems as 
far as the differential effect of truth or 
falsity of the conditioned stimuli was con- 
cerned. 
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INTERACTION OF AVERSIVE STIMULI: 


SUMMATION OR 


INHIBITION ?* 
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3 experiments were performed to determine the manner in which 2 


aversive stimuli, noise and shock, 
The Ist 2 experiments showed 


shock masked the reinforcing proper! 


combined to influence behavior. 


that an irrelevant, unescapable electric 


ties of noise reduction. The 3rd 


experiment demonstrated that the aversive properties of’ noise and 


shock summated, i.e., 
shock alone. 
that aversive drives summated and 


noise plus shock were 
These results were in 


more aversive than 
terpreted as demonstrating 
that reinforcement was propor- 


tional to the relative reduction in total aversiveness. 


How do aversive stimuli combine in 
typical animal learning experiments? 
Do they summate? Are they mutually 
or unilaterally inhibitory? Is the in- 
teraction process governed by the rela- 
tive compatibility of the unconditioned 
responses. elicited? These questions 
have neither been resolved nor investi- 
gated extensively. 

_ Within the confines of animal learn- 
ing theory, Clark Hull, in a classic 
note appended to Chapter XIV of 
Principles of Behavior (1943, pp. 254- 
256), was the first to focus attention on 
how drives combine. In that note he 
proposed that all irrelevant drives com- 
bined with the dominant drive'to pro- 
duce an increment in total drive 
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strength, which in turn interacted with 
the dominant habit to increase reaction 
potential. This formulation stimulated 
a great deal of research particularly on 
the interaction of hunger and thirst, 
but relatively little on the interaction 
of aversive states with the exception of 
a series of studies by Spence (1958) 
and Taylor (1956). In those studies 
they demonstrated that anxiety in hu- 
man Ss, either as a personality char- 
acteristic or induced by experimental 
manipulation, enhances acquisition and 
performance of simple classical and in- 
strumental responding. From these re- 
sults they concluded that anxiety is a 
generalized drive state which strength- 
ens the relevant response unless com- 
peting response tendencies are elicited 
by the irrelevant drive. 
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In human psychophysics, however, 
much greater attention has been paid 
to the question of how stimuli interact, 
„particularly in audition where a vast 
amount of research has been done on 
masking. Here it is consistently re- 
ported that the addition of irrelevant 
stimuli decreases the detectability of 
specific stimuli. The more intense and 
greater the similarity of the irrelevant 
stimulus to the original stimulus, the 
greater the masking (Fletcher & Mun- 
son, 1937; Stevens & Davis, 1938). 
Similarly, in studies of cutaneous sen- 
sitivity, vibrotactile stimulation raises 
the threshold for electric shock-induced 
pain (Melzack, Wall, & Weisz, 1963), 
although at higher shock levels pain is 
apparently somewhat enhanced by si- 
multaneous vibrotactile stimulation 
(Sherrick, 1966). 

In the overview, research in learn- 
ing has tended to show that irrelevant 
drives summate with the relevant drive 
to increase performance, while research 
from other disciplines suggests that 
irrelevant stimuli make the relevant 
stimulus less detectable. The aim of 
the present research was to explore 
the interaction of aversive stimuli. 
Two types of experiments are reported. 
In the first experiment a design and 
procedure adapted from the animal 
learning literature is used (cf. Crespi, 
1942; Woods, 1967). The Ss were 
trained to perform a lever-pressing re- 
sponse either in the presence or ab- 
sence of an irrelevant stimulus and 
then, after a fixed number of trials, 
one-half of the animals were shifted 
to the opposite condition. The second 
and third experiments were patterned 
more after psychophysical procedures; 
S was presented with a choice between 
two aversive stimuli and the criterion 
of difference between stimuli was 75% 
preference for one stimulus over the 
other. 
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EXPERIMENT I 


Experiment I studied the effects of 
an unescapable, irrelevant electric 
shock on the acquisition and perform- 
ance of an operant response motivated 
by noise and reinforced by noise reduc- 
tion. 


Method 


Subjects—The Ss were 40 Caesarian- 
derived, Sprague-Dawley 250-300 gm. male 
rats purchased from the Charles River 
Breeding Laboratory, Wilmington, Massa- 
chusetts. 

Apparaius—The behavioral apparatus was 
a 10% in. X 84 in. X 74 in. cage with alumi- 
num walls, a wire-screen top, and grid floor 
of b-in. stainless steel grids spaced 4 in. 
center to center. The instrumental response 
recording device was patterned after oné 
recently described by Bolles (1964) and 
consisted of a l-in. hole centered 2 in. above 
the floor at one end of the cage. The hole 
was bisected by two photocell beams which, 
when interrupted by S, activated recording 
and control equipment. 

The cage was suspended 11 in. from the 
floor of a sound-attenuated chamber (25-40 
db. attenuation). The chamber was 26 in. 
wide, 30 in. high, and 15 in. deep (interior 
dimensions), and was located in a quiet room, 
away from associated programming equip- 
ment. 

The sound system consisted of a Jensen 
“Hypex” DD-100A driver unit with a Uni- 
versity 4409 horn mounted in the sound 
chamber’s ceiling, a Grason-Stadler 901 noise 
generator (modified to produce a 600-ohm 
output), a Hewlett Packard 350 B attenua- 
tor, a Heath W-5M power amplifier, and an 
Allison Laboratories 2BR filter. The latter 
was used to shape a 1,200-4,800 cycle band of 
white noise with a 35 db. per octave rolloff 
on either side of the band. i 

Noise level calibration was accomplished 
with a Scott Model 412 sound level meter 
set on the slow response function and on the 
C weighting scale. The variation of the 
noise field within the cage was + 24 db. 

The shock source consisted of a variable 
transformer which controlled the voltage ap- 
plied to a step-up transformer. Output of 
the step-up transformer was connected to the 
grids through a 150-K ohm resistor, and 
shock intensity was specified in terms of the 
voltage at the secondary of the step-up trans- 
former. A 12-pole, 10-position stepping 
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switch was used to scramble the pattern of 
shock applied to the grids. 

Design and procedure. —Four groups of 10 
Ss were run under the following condi- 
tions: The sound level was set at 120 db. (re 
0002 dynes/cm*) for all groups, and a re- 
sponse by S reduced the 120-db. white noise 
to the ambient sound level in the sound- 
attenuated chamber for 30 sec. Responses 
occurring during the “off” period did not pro- 
long the “noise off” period. Each experi- 
mental session lasted 15 min, Number of 
reinforcements per session and latency of the 
response to the nearest second were recorded 
on a printing counter. Each S was given 
one training session daily for 10 days with 2 
days of no training elapsing between the 
fifth and sixth sessions, The Ss had ad-lib 
access to food and water except during the 
daily experimental session. 

The training conditions for the four groups 
differed in only one respect: the presence or 
absence of a 115-y. unescapable “irrelevant” 
shock. The Shock-Shock group received 
shock during all 10 sessions; the Shock-No 
Shock group received shock during the first 
5 sessions, but no shock during the second 
5 sessions, Conversely, the No Shock-No 
Shock group received no shock at any time, 
and the No Shock-Shock group received 
shock only during the last 5 sessions. 


Results 


Figure 1 shows the mean number of 
reinforced responses made by each 
group during the 10 successive 15-min. 
sessions. The two groups with no 
shock present during the first 5 days 
(No Shock-No Shock and No Shock- 
Shock) showed a gradual increase in 
the number of reinforcements pet ses- 
sion, while the two groups receiving an 
irrelevant, unescapable shock showed 
a fairly rapid diminution in number of 
reinforcements over the 5 days of train- 
ing. On the sixth day of training 
when the experimental conditions were 
reversed for half the animals, the 
Shock-No Shock group showed a rapid 
increment in rate of responding, and 
the No Shock-Shock group showed a 
tapid decrement in responding. 

When these data were compared 
with original learning of naive Ss un- 
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Fic. 1. Number of reinforcements per 15- 
min. session as a function of irrelevant 
shock. (The dashed vertical line indicates 
the point at which the irrelevant shock condi- 
tions were reversed for half the groups.) 


der the same conditions, little differ- 
ence in the shape of the curves was 
apparent except that the point of origin 
was lower for the Shock-No Shock 
group and higher for the No Shock- 
Shock group. Analysis of the latency 
scores revealed the same pattern of 
results. Analysis of variance on both 
number of reinforcements and latency 
scores showed that the combined irrele- 
vant shock groups differed significantly 
during the first five sessions (p < 
001). Similarly, a 2 X 2 analysis of 
variance on the total number of rein- 
forcements during Trials 6-10 showed 
both training and current conditions to 
be significant (p’s < .01) which indi- 
cated, as the figure shows, that it took 
some time for the groups shifted from 
one condition to the other to reach the 
performance level of the nonshifted 
groups. 

This finding demonstrates that a re- 
sponse motivated by noise and rein- 
forced by noise reduction is not 
strengthened by the addition of an ir- 
relevant aversive stimulus. If one as- 
sumes that the reinforcement resulting 
from noise reduction in this experiment 
remained constant in the presence of an 
irrelevant shock, then these results are 
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in direct opposition to the view that 
drive stimuli summate (Hull, 1943). 
On the other hand, it is possible that 
the aversive properties of the two 
stimuli—noise and shock—summated 
and that reinforcement was propor- 
tional to the reduction in total aver- 
siveness. In the absence of irrelevant 
shock the instrumental response pro- 
duced a 100% reduction, but when the 
irrelevant shock was present the re- 
sponse produced only a small reduc- 
tion in total aversiveness which was, 
therefore, much less reinforcing 
(Campbell, 1955). The following two 
experiments investigated this possibil- 
ity in more detail. 


ExPERIMENT II 


This experiment measured the effect 
of several intensities of irrelevant 
shock on noise aversion difference 
limens at 60, 80, 100, and 120 db. The 
noise aversion difference limen was de- 
fined as the change in stimulus inten- 
sity that produces a 75% preference 
for the less intense of the two noise 
stimuli. This experimental procedure 
is analogous to experiments in human 
psychophysics on masking where the 
effects of an irrelevant stimulus on the 
detectability of a specific stimulus 
change are studied. Here the aim was 
to study the effects of an irrelevant 
aversive stimulus (shock) on the rein- 
forcing properties of a reduction in 
sound intensity. 


Method 


Subjects—The Ss were 768 Caesarian-de- 
rived, Sprague-Dawley 250-300 gm. male 
rats purchased from the Charles River 
Breeding Laboratory, Wilmington, Massa- 
chusetts. 

Apparatus.—The behavioral apparatus was 
a cage 15 X 7 X 8 in. with aluminum sides, a 
wire-screen top, and a shock-grid floor. The 
shock grids were inserted in a Plexiglas 
frame which was mounted on a central axle 
beneath the fixed aluminum cage. Move- 


ment of the animal from one side to the 
other activated a small mercury well switch 
which, in turn, controlled the intensity of 
sound reaching S and activated counters 
recording time S spent on the two sides of 
the cage. The speaker was suspended from 
the ceiling and the cage was mounted 6 in. 
above the floor of an Industrial Acoustics 
Construction Company chamber (30-50 db. 
attenuation of airborne sound). The interior 
dimensions were 28 in. wide X 24 in. deep X 
56 in. high. The chamber was mounted on 
vibration pads to minimize floor-transmitted 
sounds and was housed in a quiet room adja- 
cent to the room containing associated con- 
trol and recording equipment. The sound 
generating and transducing system was simi- 
lar to that used in Exp. I, except the noise 
generator was a Scientific Prototype Model 
650 and the amplifier a McIntosh Model 
M1-75. Variation in sound intensity within 
the cage was approximately + 1.5 db. The 
shock source and grid scrambling system 
were the same as in Exp. I. 

Design and procedure—Noise aversion dif- 
ference limen functions were obtained at 60, 
80, 100, and 120 db. of white noise with 
irrelevant shock levels of 0, 45, 115, and 
300 v. In addition, aversion difference limens 
(ADLs) were obtained at 70, 90, and 110 
db. with no irrelevant drive present. 

The design consisted of pairing each of the 
standard stimuli with a number of com- 
parison stimuli in a design similar to the 
constant stimulus method used in human 
psychophysics, The basic difference was 
that each § was run only once to avoid se- 
quence or adaptation effects, whereas in de- 
tectability experiments each S is typically 
run through the entire set of comparison 
stimuli several times. At all standard inten- 
sities, except 60 and 120 db., comparison 
stimulus intensities both higher and lower 
than the standard were used. At 60 db. 
only increments in intensity were used be- 
cause comparison stimuli less than 60 db. 
failed to produce reliable preferences for 
the less intense stimulus, and at 120 db. only 
decrements in sound intensity were used to 
avoid permanent hearing loss and damage to 
the sound generating equipment. Table 1 
shows the intensity of the various compari- 
son stimuli used at the 60-, 80-, 100-, and 
120-db. standard levels for four levels of 
irrelevant shock. At each standard level 
and irrelevant shock condition, a range of 
comparison stimuli designed to produce pref- 
erences for the standard stimulus, ranging 
from at least 25% to 75%, was selected (ex- 
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TABLE 1 


PERCENTAGE OF PREFERENCES FOR THE 
STANDARD NOISE AS A FUNCTION OF 
IRRELEVANT SHOCK INTENSITY AND 
Comparison NOISE INTENSITY 


_ Intensity of Irrelevant Shock 
Comparison Levels and Percentage of Preference 


ov. 45 v. 115 v. 300 v. 


db. | % | db, | % | db. % | db. | % 


Standard Noise Level: 60 db. 


Standard Noise Level: 80 db. 


78| 25 74| 28 | 68| 44 | 80} 50 
19| 23 77| 44) 71) 11 95 | 63 
80| 38 | 80| 56| 74| 35 | 110) 81 
81) 11 83 | 64 | 80| 54 
82| 96 | 86| 88 | 86| 56 


Standard Noise Level: 120 db. 


117) 2 | 110} 1] 112] 11 | 108) 25 
118| 25 | 112 | 32 | 114 | 26 | 112 | 32 
119| 14 |114 | 36 | 116] 47 | 116 | 50 
120| 50 | 116 | 18 | 118 | 42 | 120] 47 
121| 40 | 118 | 49 | 120 | 50 


cept for the 60- and 120-db. groups where 
the range was 50%-75%, and 50%-25%, 
respectively). Comparison levels, similar to 
those shown in the table for the 0-v. irrele- 
vant shock groups, were used at the 70-, 90-, 
and 110-db, standard levels. 

Each S was taken from the home cage and 
Placed in the center of the apparatus. The 
noise- and shock-producing systems were 
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started a few seconds later. The S re- 
mained in the cage for 100 min.; time in 
seconds spent on the standard and comparison 
noise sides was recorded on electric timers. 
The Ss were assigned to groups in coun- 
terbalanced order and the standard and com- 
parison sides of the cage were counterbal- 
anced to control for side preferences. 


Results 


Table 1 shows the mean percentage 
of time spent on the standard stimulus 
side during the last 20 min. of the 100- 
min, test period for all groups. In 
general, the more intense the compari- 
son noise intensity, the greater the 
preference for the standard noise in- 
tensity and vice versa; and the higher 
the irrelevant shock, the greater the 
change in noise intensity required to 
produce preference or avoidance of the 
standard stimulus. 

To determine ADLs for each stan- 
dard noise level at each level of irrele- 
vant shock, the preference data were 
transformed into Z scores and a 
straight line fitted to the data using a 
least squares method. These fitted 
functions were used to interpolate the 
25% and 75% preference levels, and 
the ADL was taken as half the distance 
between these two points. This pro- 
cedure is analogous to that used in hu- 
man psychophysics where the DL is 
defined as one-half the distance be- 
tween the upper and lower limens 
(Guilford, 1954). 

Figure 2 shows ADLs, based on the 
last 20 min. of the session, for all 
groups, as a function of sound level. 
With no irrelevant shock present, 
ADLs range from .85 to 2.4 db. except 
at 60 db. where the ADL is 9.3. The 
fact that such small changes produced 
a preference for the less intense stimu- 
lus is quite remarkable for two rea- 
sons: (a) variations within the sound 
field were +1.5 db—approximately 
equal to the average DL, and (b) the 
white noise used as a stimulus is in- 
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Fic. 2. Aversion difference limens as a 
function of masking shock intensity. 


herently variable. Oscillographic ob- 
servation of the stimulus showed it 
varied approximately +.5 db. from an 
average level. These observations sug- 
gest that preference for the less intense 
side of the cage was strengthened pri- 


rapid reduction of noise occurring 
when S crossed to the less intense side 
and by the punishing effects of the 
rapid increase in noise as S crossed to 
the more intense side. Other changes 
in noise intensity were either more 
gradual or not correlated with S’s 
movement. 

Figure 2 shows, also, that the addi- 
tion of electric shock as an irrelevant, 
unescapable, aversive stimulus sys- 
tematically increases the magnitude of 
the ADL. The higher the irrelevant 
shock the greater the change in noise 
intensity required to produce a prefer- 
ence for the less intense of two auditory 
stimuli. 

Figure 3 shows the manner in which 
the magnitude of the ADLs change 
during the course of the 100-min. 
session. The ADLs were calculated in 
the same way as before, but for every 
20 min. of the session, rather than just 
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Fic. 3. Aversion difference limens during successive 20-min. periods of the 100-min. training 


session. (The parameter is intensity of masking shock.) 
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shows that masking shock raises the 
DL from the onset of the experimental 
session. There is little tendency for 
the functions to converge during the 
course of the session, indicating that the 
masking effect of the shock is not a 
transitory phenomenon. Perhaps the 
most striking feature of these data is 
the rate at which asymptotic preference 
is reached at different noise levels. At 
the highest sound level, 120 db., asymp- 
totic preference is apparently reached 
within the first 20 min., but at lower 
sound levels the rate at which asymp- 
totic preference is reached is much 
slower, regardless of the masking shock 
intensity. 

The data from this experiment dem- 
onstrate that Weber’s law holds for 
most of the range in which noise is an 
aversive stimulus. When no irrelevant 
drive is present (0-v. masking shock), 
the ADL remains constant at approxi- 
mately 1.5 db. from 70 to 120 db. Be- 
cause the decibel is a ratio measure of 
sound pressures the Weber ratios also 
remain constant. When the ADLs are 
expressed as ratios of sound pressure 
(A p/p) as shown by the right-hand 
ordinate in Fig. 2, the average ratio is 
approximately .20 from 70 to 120 db. 
The shape but not the absolute values 
of the Weber function for ayersiveness 
is nearly identical to the DL functions 
obtained with human Ss (Miller, 
1947), Both functions show charac- 
teristically high ratios at low inten- 
sities followed by constant ratios at 
moderate to high intensities. 

These data are of interest when they 
are compared to a previous study 
which also determined ADLs for noise 
(Campbell, 1955) because of the sub- 
stantial differences between the two 
studies, Several weaknesses in the 
earlier study make the present research 
far more definitive. Those deficiencies 
included the use of a broadband sound 
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generating system which probably em- 
phasized low-frequency components of 
the sound stimulus to which the rat is 
relatively insensitive, a short session 
which did not allow the rat to reach 
asymptotic preference levels, and speci- 
fied changes in sound energy in watts 
rather than in decibels. Both the aver- 
sion threshold and the ADLs were 
much higher in the earlier experiment. 


ExPERIMENT III 


The results of Exp. I and II leave 
unanswered one question of primary 
importance: Do aversive stimuli sum- 
mate on some common scale of aver- 
siveness to produce a sensation of aver- 
siveness greater than the aversive 
properties of either stimulus alone? 
Intuitively, and on the basis of the 
previous experiments, it seems most 
plausible that they should, Yet some 
years ago Gardner, Licklider, and 
Weisz (1960) proposed and apparently 
demonstrated that high intensities of 
white noise inhibited dental drill-in- 
duced pain. Similarly, in folk medi- 
cine, counterirritants are still widely 
used to alleviate discomfort. Both of 
these observations suggest that the ad- 
dition of an irrelevant aversive stimu- 
Jus tends to mask or inhibit the painful 
or irritating components of the original 
stimulus. Since the previous experi- 
ments presented no direct evidence 
that the aversive properties of noise 
and shock summated, the purpose of 
the present experiment was to deter- 
mine whether noise plus shock were 
more or less aversive than shock alone. 


Method 


The apparatus was the same as that used 
jn Exp. IL Procedurally the apparatus 
was arranged so that when S was on one 
side of the cage a 110-db. white-noise stimu- 
lus plus one of five standard levels of 
shock (45, 72, 115, 185, or 300 v.) was 
present. When S moved to the other side the 
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noise was switched off and a comparison 
shock switched on. Table 2 shows the com- 
parison shock levels run for each standard 
level. The comparison levels were selected 
on the basis of pilot research. Eight male, 
250-300 gm. Caesarian-derived, Sprague- 
Dawley rats were run at each comparison 
level. 

Each S remained in the apparatus for 100 
min. and the percentage of time S spent on 
the comparison side (shock only) was re- 
corded. All other procedural details were 
the same as in Exp. II. 


Results 


The point at which shock and shock 
plus noise were equally aversive was 
determined for each standard level as 
follows: The preference scores for the 
last 20 min. of the 100-min. session 
(shown in Table 2) were transformed 
to Z scores, a straight line fitted to the 
resulting function, and the comparison 
level which S preferred 50% of the 
time computed. The comparison volt- 
ages (shock only) which produced 
equal preference for the standard stim- 
uli (shock + 110 db. of white noise) 
were as follows: 45 v.+ 110 db.= 
52.6 v.; 72 v. + 110 db. = 79.4 v.; 115 
v. + 110 db. = 122.3 v.; 185 v. + 110 
db. = 200.6 v.; 300 v.+110 db.= 
330.0 v. Thus, shock plus noise are 
clearly more aversive than shock alone, 
but the increment in aversiveness pro- 


TABLE 2 
EXPERIMENTAL DESIGN AND PERCENTAGE OF 


PREFERENCES FOR THE COMPARISON 
STIMULUS 


Comparison Levels and Percentage of Preferences 
Standard Level; Shock Plus 110 db. 


45 v. 72v. 115 v. 185 v. 300 v. 


duced by the addition of noise is rela- 
tively small. In fact the increment in 
aversiveness is either equal to or not 
much greater than the ADLs for shock 
at those intensities (Campbell, in 
press). 

More interesting, perhaps, is the 
finding that as standard shock inten- 
sity increased, greater absolute incre- 
ments in shock were necessary to make 
shock alone as aversive as shock plus 
noise. This again confirms the general 
principle that the higher the shock, the 
greater the change in shock necessary 
to produce a difference in aversiveness 
(Campbell 1956). These findings 
also show that, for rats, the addition of 
an irrelevant aversive stimulus (white 
noise) does not inhibit the painfulness 
of the primary stimulus (electric 
shock) as predicted by Gardner et al. 
(1960) or by the counterirritant hy- 
pothesis. 


Discussion 


The findings of this study clearly 
demonstrate that the addition of an irre- 
levant aversive stimulus (shock) to @ 
response motivated by another aversive 
stimulus (noise) and reinforced by reduc- 
tion of that stimulus (noise reduction) 
seriously impairs acquisition and per- 
formance of that response. As such, 
these findings are contrary to the hypoth- 
esis that the heightened generalized 
drive produced by the addition of an 
irrelevant drive multiplies with habit 
strength to enhance reaction potential 
(Hull, 1943). Rather, the fihdings sug- 
gest that the addition of an irrelevant 
aversive stimulus serves to mask the re- 
inforcing properties of the relevant reim- 
forcement (noise reduction). This is 
particularly evident in Fig. 1 where the 
learning curves of the Shock-No Shock 
group and the No Shock-Shock group 
were nearly identical in type except 
for the point of origin, suggesting that 
once reinforcement is introduced acquis!- 
tion takes place normally. Similarly, the 
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introduction of irrelevant shock in the 
No Shock-Shock group produces a decre- 
ment in performance that appears to be 
a typical extinction function usually pro- 
duced by the omission of reinforcement. 
From this analysis one also might argue 
that the shift of irrelevant drive condi- 
tions in Exp. I should have produced 
“elation” and “depression” effects 
(Crespi, 1942). However, neither elation 
nor depression effects have been con- 
sistently obtained with shifts in amount 
of aversive stimulus reduction (e.g. 
Bower, Fowler, & Trapold, 1959; Woods, 
1967), and in this study the changes in 
amount of reinforcement coincide with 
changes in the total aversive stimulation 
—a factor which easily could mitigate 
against obtaining either Crespi effect. 

The simplest explanation of the present 
data is to assume that the aversive prop- 
erties of noise and shock combine in 
some additive fashion to produce a height- 
ened level of aversive stimulation or drive, 
as demonstrated in Exp. III. Then, 
applying Weber’s law to the reinforcing 
properties of aversive stimulus reduction 
(Campbell, 1955, 1956), a much greater 
reduction in total aversiveness is neces- 
sary to produce the amount of reinforce- 
ment equivalent to that obtained by noise 
reduction in the absence of the irrelevant 
shock. Thus, the relative reduction in 
total aversiveness produced by reducing 
the 120-db. sound to zero is smaller the 
higher the irrelevant shock. 

Another possible explanation of these 
results is to assume that shock elicits re- 
Sponses which are incompatible with the 
criterion responses, thereby preventing 
multiplication of the heightened drive 
with the developing habit. The initial 
Tesponse to shock may be scrambling, 
leaping, and other “flight” behavior fol- 
lowed by “freezing” behavior. Both types 
of behavior conceivably could minimize 
the pain produced by shock and offer an 
alternative source of reinforcement. 
Moreover, the amount of reinforcement 
derived from these behaviors might in- 
Crease as a function of irrelevant shock 
level to produce the findings obtained. 

This explanation seems particularly ap- 
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plicable to Exp. I where the irrelevant 
drive lowered the level of responding to 
practically zero, suggesting that the 
animal was performing competing re- 
sponses to reduce the shock-produced 
pain. In the second experiment this 
explanation seems less likely since the 
animal could reduce both the noise and 
the shock (via crouching or other pain 
minimizing responses) by staying on the 
less intense noise side. Similarly, in the 
preference apparatus, shock increased the 
number of times S shuttled from one side 
to the other, thereby increasing the num- 
ber of times the animal was reinforced by 
noise reduction for crossing to the less 
intense side and the number of times S 
was punished for moving to the more in- 
tense side, Since the noise ADLs in- 
creased as irrelevant shock intensity in- 
creased, it seems more plausible to assume 
that the proportional reduction in overall 
aversiveness produced by noise reduction 
decreased as shock intensity increases. 
To what extent does the conclusion 
that the addition of an irrelevant aversive 
stimulus reduces the effect of the relevant 
reinforcement generalize to other motiva- 
tional conditions? If one takes the sim- 
plistic view that all reinforcement is the 
result of stimulus intensity reduction 
(Dollard & Miller, 1950), then, ob- 
viously, the addition of any irrelevant 
stimulus would reduce the reinforcing 
properties of any drive-stimulus reduc- 
tion. On the other hand, it now is clear 
for appetitive conditioning that consum- 
matory responses (rg’s) play a substan- 
tial role in the reinforcement process, 
and it is possible that irrelevant drives 
could enhance those responses under 
some limited conditions. For example, 
the heightened autonomic activity asso- 
ciated with fear or anxiety may actually 
increase thirst by producing a dry mouth 
through vasoconstriction, thereby ac- 
counting for results such as those re- 
ported by Amsel (1950). Similarly, in 
eyelid conditioning, higher levels of sub- 
ject anxiety—either induced or selected— 
may potentiate the eyeblink response as 
part of a general defensive behavior pat- 


tern. 
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In conclusion, it appears that the addi- 
tion of an irrelevant drive can either en- 
hance or inhibit acquisition and perform- 
ance of the criterion response. On the 
basis of present and past data it is pos- 
tulated that enhancement occurs only 
when the specific consummatory or de- 
fensive response is amplified or sensitized 
by the irrelevant stimulus, and the far 
more typical effect of an irrelevant moti- 
vating condition is to reduce the reinforc- 
ing properties of the relevant incentive or 
stimulus reduction by increasing the over- 
all level of motivation. 
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repeated massed exposures. 


Inspection behaviors * and attention- 
like processes play a central role in 
verbal learning and particularly when 
learning is from written stimulus ma- 
terials. The importance of attentive 
processes in these situations is due to 
the fact that S’s inspection activities 
determine the character of the effective 
stimuli that result from study. The 
effective stimuli, in turn, determine 
what is learned (for a more detailed 
discussion see Rothkopf, 1965). 

The purpose of the present experi- 
ment was twofold. The first was to 
see whether verbal constructions that 
attract more attention than others, in 
a prose context, also tend to be learned 
more readily. The second was to €x- 
plore the effect of repeated exposure 
to written material on attentive proc- 
esses, i.e, does repeated exposure to 


1 The authors wish to thank Carolyn Shef- 
sky for her help in collecting the data. 

2These have also been called mathe- 
magenic behaviors (Rothkopf, 1963, 1965). 
The roots are Greek: mathema (that which 
is learned), and gignesthai (to be born) ; 
ie, behaviors that give birth to learning. 
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text change the manner in which writ- 
ten prose is inspected. 

These objectives were approached 
through two experiments. The experi- 
mental plan for both of these studies 
was to expose Ss to a written passage 
n times and then to add some addi- 
tional materials, in widely dispersed 
locations within the passage, on pres- 
entation »+1. Experiment I was 
an attempt to calibrate noticeability, 
ie., the attention-commanding prop- 
erty, of the added sentence fragments 
by directing Ss to find them during 
presentation n+1. Experiment Il, 
on the other hand, was a learning ex- 
periment. Following exposure ” +1, 
Ss were tested to determine how much 
new material had been learned during 
that presentation. 


METHOD 


Materials—The experimental passage con- 
sisted of a 1,333-word excerpt from a 
Government publication on Thailand (United 
States Department of Defense, 1965). Two 
versions of the passage were used. One 


consisted of the basic 1,260-word text. 
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The other, called the augmented text, was 
identical to the basic text but included, in 
addition, 26 new elements. Each added 
element consisted of a word or short phrase 
ranging in length (L) 1-6 words. The 
number (N) of added sentence fragments 
of various lengths were as follows: L=1, 
N=8; L=2, N=7; L=3, N=2; L=4, 
N=3; L=5, N=3; L=6, N=3. Each 
new element, in combination with the basic 
text, formed a grammatical English sen- 
tence. All sentences in the basic text, even 
without the added fragments, were well 
formed grammatically, Actually the basic 
text was prepared by removing elements 
from the augmented text. 

General procedure.—Group procedures and 
self-pacing were used throughout. Group 
size varied from 13 to 65 Ss. Assignment 
of Ss to treatments was random with re- 
strictions to assure that all treatments of 
both the detectability and learning experi- 
ments would be approximately proportion- 
ally represented in every group. Both ex- 
periments involved several phases that fol- 
lowed each other in immediate succession, 
The experimental materials for each phase 
were placed in separate, numbered envelopes. 
A package containing envelopes for all 
phases of the experiment in which he was 
participating was handed to each S$ at the 
beginning of an experimental session. Each 
envelope carried directions for use as well 
as directions as to what envelope to turn to 
next. The last envelope in each set con- 
tained filler materials intended to occupy Ss 
until it was clear that all Ss had completed 
their experimental task, 

At various occasions, Ss were directed to 
record the time. They did this by copying 
a number projected on a screen. The num- 
ber was incremented by one every 5 sec. by 
advancing a filmstrip projector with a pair 
of Hunter decade timers. 


Experiment I: Noticeability of Added 
Fragments 


Design—The three experimental treat- 
ments were exposed to the basic passage 1, 
2, or 3 times (n), respectively. The num- 
ber of Ss in these treatments, N, was, re- 
spectively, 29, 30, and 27. Time interval 
between exposures was approximately 30 sec. 

Procedure—Each S read the basic pass- 
age n times. Immediately after the nth 
presentation, Ss were directed to search 
through the augmented passage and to un- 
derline any newly added material. 
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Experiment II: Learning 


Design—tin the main experimental treat- 
ments (see Table 1 for design and treat- 
ment designations) the basic passage was 
exposed to Ss n times. On exposure n+1, 
Ss were given the augmented passage, i.e. 
basic passage with added fragments, ex- 
actly once. Values for n were 0, 1, 2, and 3, 
Time interval between exposures was ap- 
proximately 30 sec. 

Learning was measured by a completion 
test (e.g, Ebbinghaus, 1897; more recently 
called cloze procedure, Taylor, 1953) that 
was administered immediately after the ex- 
posure to the augmented text. The comple- 
tion test consisted of the augmented text 
from which 69 single content words had been 
deleted according to an unbiased scheme. 
Twenty-six of these deletions were from 
new elements in the augmented text, ie, 
one for each added fragment. The remain- 
ing 43 items came from the basic text seg- 
ments that had been exposed n times. 

Three additional treatments served as ref- 
erence conditions (see Table 1 for treatment 
codes). One of these was given the com- 
pletion test without exposure to any experi- 
mental text. The other two were tested 
after seeing the basic passage for, respec- 
tively, 1 or 4 exposures each and never saw 
the augmented text. 

Subjects—Volunteer college students 
served as Ss for Exp. I, the detectability 
study (N=86), and Exp. II, the main 
learning experiment (N = 181). They came 
from the following institutions: Newark 
State College, Elizabeth, N. J., Rockland 
Community College, Suffern, N. Y., School 
of General Studies, Columbia University, 


TABLE 1 
DESIGN, LEARNING EXPERIMENT 


No. of Pre- | No. of Pre- 
Treatment | sentationof | sentation of N 
Codes Basic Passage | Augmented 
(n) Passage 
L-1A 0 1 30 
L-1B1A 1 1 30 
L-2B1A 2 1 31 
L-3BiA 3 1 30 
L-1B 1 0 15 
L-4B 4 0 15 
L-0 0 0 30 
a This code simplifies references to treatment ig 


text. L-3B1A means a learning treatment (L); a 
exposures to the basic text GR followed by one € 
posure to the augmented text (1A). 


NOTICEABILITY AND LEARNING NEWLY ADDED FRAGMENTS 


N. Y., and Fairleigh Dickinson University, 
Madison, N. J.° 


RESULTS 


Experiment I: Detectability of Added 
Sentence Fragments 


The main purpose of Exp. I was to 
determine whether (a) the detectabil- 
ity of new material on exposure + 1 
was affected by the number of expo- 
sures (n) to the basic passage, (b) 
various added sentence fragments dif- 
fered markedly in the likelihood that 
they were noticed, and (c) whether 
such differences held over the range of 
values for n that were used in this ex- 
periment. 

‘A detection was counted successful 
if at least one word of an added sen- 
tence fragment was underlined* The 
average proportion of successful de- 
tections over all 26 new sentence seg- 
ments was plotted against number of 
exposures in Fig. 1. The data in Fig. 
1 indicate that proportion of correct 
detections was greater after two ex- 
posures than it was after one. There 
were no differences between two and 
three exposures. 

Also shown in Fig. 1 is the effect 
of 1-3 exposures on the incidence of 
false positive responses. A false posi- 
tive response was defined as any string 
of one or more consecutive words from 
the basic text that had been under- 


3 The authors are indebted to Lincoln F. 
Hanson for his help in obtaining Ss at 
Columbia University and at Rockland Com- 
munity College, and to Kathryn B. Daniel 
for similar aid at Newark State College. 

4The results reported here are essentially 
unchanged if a detection is called success- 
ful only if all the words of a new sentence 
particle have been underlined. A sub- 
stantial portion of each of the 26 added 
sentence particles consisted of function 
words. The weaker criterion was thought 
more meaningful because when only part of 
an added segment was underlined, the un- 
derlined portions generally consisted of all 
the content words. 
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Fic. 1. Proportion of detection of newly 
added sentence particles and proportion of 
false positive responses as a function of the 
number of exposures to the basic passage. 


lined. The proportions plotted in Fig. 
1 are the average number of false posi- 
tive responses per S divided by 54, 
this number being the number of sen- 
tences in the basic text to which no 
new elements had been added. This 
proportion was used in order to make 
the false positive plots comparable to 
the successful detection data. The in- 
cidence of false positives changed rela- 
tively little within the range of the ex- 
posure frequencies that were tested in 
the present experiment. 

The conclusions stated above were 
supported by an analysis of covari- 
ance which indicated that correct de- 
tections differ significantly among 
groups of Ss who had been exposed 
to the basic passage for different num- 
bers of times, F (2, 7) = 8.57, p< 
001. The control variable, used in 
the analysis, was the number of false 
positive responses. Frequency of false 
positive responses was not significantly 
different for the various exposure con- 
ditions although the attained F was 
large, F (2, 78) = 3,05, 10 > p > 0.5. 

The analysis of covariance indicated 
that exposure to the basic text tends 
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to increase the likelihood that Ss will 
notice newly added sentence fragments. 
However, substantial difference in de- 
tectability among fragments were ob- 
served to persist even when n= 3. 
An important finding was that the rela- 
tive detectability of various added ele- 
ments appeared to remain relatively 
unchanged by the number of exposures 
to the basic passage. This was indi- 
cated by high product-moment corre- 
lations between the detectability of the 
various particles under the various ex- 
posure conditions. They were as fol- 
lows: .90 (n= 1 vs.n=2), 86 (n= 
1 vs. n= 3), and .91 (n=2 vs. n= 
3). 


Experiment II: Learning 


Basic text—When the proportion of 
correct responses on the completion 
test for the 43 items from the basic 
passage was plotted against number of 
exposures to the basic text segments, 
a negatively accelerated curve was ob- 
tained. This plot was similar to those 
observed in previous experiments with 
written discourse (Rothkopf, 1968). 

New sentence fragment learning.— 
Proportion of correct responses on the 

‘ completion test was tabulated sepa- 
rately for each of the 26 newly added 
sentence fragments for each of the 
treatments of the learning experiment. 
The proportion of correct responses 
that resulted from the single exposure 
during presentation +1 was found 
to increase with the likelihood that the 
fragment in question was noticed dur- 
ing the detection experiment. This 
correspondence increased with the 
number of exposures to the basic pas- 
sage in the learning experiment. 

These conclusions were based on an 
analysis of variance that is reported 
later in this paper, and on correlation 
coefficients that were calculated in the 
following manner. Proportion of suc- 
cessful detection for each added ele- 


ment after three exposures in Exp. I 
were correlated with proportion cor- 
rect completion responses for the same 
added sentence fragment obtained from 
the several learning treatments on the 
post-practice test. Since learning gains 
produced during the n +1 exposure 
were the data of interest, the learning 
score for each new element was cor- 
rected in every case for constraint due 
to preexperimental experience. This 
was done by subtracting from the pro- 
portion of correct completion responses 
of each item the score that had been 
obtained for that element by Ss who 
had taken the completion test without 
ever reading the experimental passage 
(i.e, Treatment L-0). The obtained 
product-moment correlations were for 
L-1A, r = .496; for L-1B1A, r = .694; 
for L-2B1A, r=.712; and for L- 
3B1A, r=.781. All of these coeffi- 
cients were significantly greater than 
zero. 

It had been shown in Exp. I that 
the relative noticeability of various new 
elements remained approximately the 
same over all exposure levels. Never- 
theless, the possibility existed that the 
correlation between noticeability and 
amount learned increased as a function 
of in the learning experiment because 
the correlations were calculated from 
noticeability data that had been ob- 
tained after three exposures to the 
basic text. Hence the rise of these 
correlations with n may have been due 
to the increased correspondence be- 
tween the number of exposures during 
learning and the number of exposures 
in the reference detection experiment 
treatment. Equivalent correlations 
were therefore also computed from 
noticeability data from the treatment 
where n=1. The resulting coeffi- 
cients were as follows: L-1A, r = .429; 
L-1B1A, r = .604; L-2B1A, r = .602; 
and L-3B1A, r=.676. All of these 
correlations are slightly lower than 
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those obtained when n in the detection 
experiment was three, but the same 
trend toward higher r’s with higher 
values of n in the learning experiment 
can be observed. 

A better understanding of how pre- 
vious exposures to the basic passage 
brought about increased correspon- 
dence between noticeability and the 
new fragment learning effects can be 
obtained from Fig 2. For this plot, 
the new items were classified into 
groups according to noticeability, i.e., 
their detection probabilities, Pa. De- 
tection probabilities when n = 3 were 
used for this purpose. The classes 
were: (a) Pa < 30; (b) .30 < Pa < 
50; (c) .50 < Pa < 70; and (d) Pa 
>.70. Number of items in these four 
groups were 6, 7, 5, and 8, respectively. 
In Fig. 2, the mean proportion of 
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Fic. 2. Proportion of correct completion 
test responses for newly added sentence 
particles. (The data are shown for treat- 
ments differing in number of exposures to 
the basic text. For all treatments except 
that on the extreme left, the new sentence 
Particles were seen exactly once prior to 
testing. The parameter is detection prob- 
ability of the newly added particles as deter- 
mined after three exposures to the basic 
text in Exp, I.) 


correct responses on the completion 
test for items in each of these four 
categories was plotted as a function of 
the number of previous exposures to 
the basic text. 

The obtained plots indicate that the 
learning effects produced by a single 
exposure to a new element increased 
systematically as a function of previous 
exposures to the basic passage (n). 
This was observed for the three classes 
of more easily detectable new sentence 
fragments. The fragments that were 
difficult to detect, (Pa < .30), how- 
ever, appeared to be governed by a 
different rule. For these elements, 
either n had no effect or repeated ex- 
posures to the basic text actually re- 
duced the likelihood of learning during 
the single inspection of the augmented 
text. Increases in therefore resulted 
in greater differences among new frag- 
ments of various degrees of notice- 
ability, with respect to learning effects 
produced by one exposure. This is 
consistent with the previous observa- 
tion that correlations between Pa and 
learning increase with n. 

The conclusions about the interrela- 
tionships among n, learning, and Pa, 
were supported by a two-factor analy- 
sis of variance of mixed design. The 
L-0 condition and Treatments L-1B, 
L-4B were not included in this analy- 
sis. The two main effects, Number of 
Exposures, F (3, 116) = 5.37, p< 
05, and Detectability, F (3, 348) = 
71.60, p < .01, as well as the Number 
of Exposures X Detectability interac- 
tion, F (9, 348) = 3.98, p < 01, were 
found to be significant. 

The significant detectability effects 
in the Fig. 3 data support the conclu- 
sion, presented earlier, that more is 
learned about noticeable sentence frag- 
ments. As indicated above, Fig. 3 
also revealed two interesting conse- 
quences of exposure to the basic pas- 
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sage on learning about the new sen- 
tence fragments, 

The first of these effects was that 
learning about noticeable fragments 
(Pa > .30) was more pronounced in 
treatments with the greater number of 
exposures to the basic text. Seventy- 
seven percent of the new sentence 
fragments selected for this experiment 
fall into this category (i.e, Pa > .30). 

The second effect involves the new 
elements that were relatively difficult to 
detect in Exp. I. The data support 
the strong suspicion that for fragments 
with Pa < .30, exposure to the basic 
text actually decreased learning. How- 
ever, while inspection of the Pa < .30 
points in Fig. 3 indicates a decreasing 
trend, statistical analysis was equivocal. 
For a comparison between L-3B1A 
and L-1A, ¢ (116), was only 1.918, 
10>p>.05, although these two 
treatments differed most widely with 
respect to number of exposures to the 
basic text (n = 3 vs. n=0) prior to 
presentation of the augmented passage. 
It should be noted, however, that the 
interpretation of this ¢ test was com- 
plicated by the fact that performance 
on Pa < 30 items following the L- 
3B1A treatment was approaching 
chance level, i.e., the same level as 
those Ss who had never seen any ex- 
perimental text (L-0).* 

If the learning about difficult-to- 
notice elements (Pa < .30) during a 
single exposure is, as a matter of fact, 
a decreasing function of n, then this 
may have been due to modifications of 
inspection behaviors that were func- 
tionally equivalent to the extinction 
of attention-like processes in segments 

5 In this connection it should be noted that 
one of the distinguishing characteristics of 
Pa < .30 items was that they had a somewhat 
higher chance level than others. This meant 
that preexperimental experience had pro- 
vided somewhat more useful response con- 
straints for these particles than for the re- 
mainder of the added elements. 
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of the experimental passage during 
earlier exposures to the basic text, 
Modifications of inspection (mathema- 
genic) behavior have been suggested 
(Rothkopf, 1968) as one possible ac- 
count for the negative acceleration of 
certain acquisition curves. 

The finding that increasing n re- 
sulted in greater learning about the 
more noticeable new fragments may 
also be due to changes in the hypothe- 
sized inspection activities. The more 
noticeable new fragments may attract 
more attention when embedded in ma- 
terial that no longer elicits very ener- 
getic inspection activities, That in- 
creased learning about noticeable ele- 
ments may be related to performance 
on the basic text was indicated by the 
plots shown in Fig. 3. For this figure, 
proportion of correct completion test 
responses for the 20 test items involv- 
ing those new sentence elements where 
Pa > .30 was plotted for each S against 
proportion of correct responses for the 
43 items from the basic text. This was 
done separately for the main treat- 
ments of Exp. II. 

The data in Fig. 3 indicated fairly 
clearly that within any given treatment, 
learning of noticeable new particles was 
associated with high test performance 
on test items from the basic text. 
Product-moment correlations between 
new particle learning and basic text 
performance were for the several treat- 
ments as follows: L-1A, r = .508; L- 
1B1A, r= .591; L-2B1A, 7 =.595; 
and L-3B1A, r = .753. 

The findings that (a) new fragments 
(Pa > .30) and basic-text performance 
were positively related and (b) mean 
basic-text performance increased with 
n suggest why new fragments per- 
formance increased with n in Fig. 2. 
In order to determine whether a and 
b were sufficient to account for the 
new fragments data (Pa > .30) in Fig. 
3, the following procedures were used. 
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Fic. 3. Proportion correct responses for 
new particles (Pa>.30) as a function of 
performance on the basic text. (The top 
panel shows the interrelationship of straight 
lines fitted to the various treatments. For 
an explanation of the treatment codes, see 
Table 1). 


o 


A line was fitted, by the method of 
least squares, to each treatment group 
that is plotted in Fig. 3. From the 
equation for each treatment (also 
shown in Fig. 3), the new-particle 
performance mean was calculated that 
could be expected if the mean basic 
text performance of that treatment 
were the same as L-3B1A, the treat- 
ment with the highest value of n. Cal- 
culated proportions of correct re- 
sponses for new particles (Pa > -30) 
were nearly identical for the several 
treatments: L-1A = 40, L-IBIA= 
40, L-2B1A = 42. The observed 
value for L-3B1A was .38. The re- 
sults of these calculations support the 
view that the increasing effectiveness 
of the n + 1 presentation depended on 
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rises in basic text performance and 
only indirectly on n. 

Why is new fragment performance 
related to basic text score?—The de- 
pendence of new fragment performance 
on basic text performance may be due 
to direct transfer of training between 
basic text and new elements. Accord- 
ing to this hypothesis the learned skills 
(knowledges) that are acquired during 
exposures to the basic text and that 
produced high test performance on the 
basic text also produced high perform- 
ance on new fragment items. The di- 
rect transfer-of-training hypothesis im- 
plies that the treatment group that saw 
the basic text and only the basic text 
for four exposures (L-4B) should 
show higher new fragment (Pa > 
.30) performance than L-1B or L-0. 
Our data (L-4B = .017, L-1B = .052, 
L-0 = .041) were not in accord with 
this prediction. 

Some assumptions about the atten- 
tional mechanisms may provide an al- 
ternative explanation of the relation- 
ship between the learning of noticeable 
new fragments and basic text perform- 
ance. The present data is respected 
if it is assumed that: (a) the likeli- 
hood of attending to a noticeable new 
element increases as the likelihood of 
effective attention to the basic text de- 
creases, and (b) that effective attend- 
ing to any material decreases as its 
content is mastered. It is also plausi- 
ble that the less noticeable new frag- 
ments (Pa < .30) were better inte- 
grated into the conceptual fabric of the 
basic text and that therefore any de- 
terioration of effective attending to the 
basic text is likely to be accompanied 
by equal deterioration with respect to 


6 It is noteworthy that, in the linear equa- 
tions that were fitted to the various treat- 
ments in Fig. 3, the constants vary mono- 
tonically with n. However, differences 
among treatments, as tested by analysis of 
yariance, did not prove reliable. 
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the less noticeable new fragments. The 
observation that test performance with- 
out prior exposure (i.e, L-O treat- 
ment) was higher for less noticeable 
elements than other new particles lends 
further plausibility to this view. The 
data of the present experiment were 
not sufficient to reach useful conclu- 
sions about the quantitative charac- 
teristics that the above assumptions 
must have in order to predict new ele- 
ment learning data in an exact sense. 

The present findings, regardless of 
theoretical interpretation, suggest the 
following empirical rules: (a) more 
is learned about new material pre- 
sented together with recently exposed 
old material than when presented with 
other new material; and (b) more is 
learned about new material presented 
together with recently exposed, well- 
mastered material than when presented 
with recently exposed material that has 
not been mastered well. 

These rules suggest a new interpre- 
tation of the studies of incremental 
mechanisms in paired-associate learn- 
ing by Rock (1957) and others (Clark, 
Lansford, & Dallenbach, 1960; Wogan 
& Waters, 1959). Their experimental 
procedure is characterized by replace- 
ment by a new paired associate of any 
item that has been incorrectly antici- 
pated at any trial. This is a situation 
in which new items are presented with 
well-mastered, recently exposed old 
material. In keeping with the prin- 
ciples stated above this should lead to 
facilitated learning of the new item. 
This in turn may provide an alterna- 
tive explanation of why Rock (1957) 
and others have failed to obtain dif- 
ferences between their replacement 
procedure and repetitive paired-asso- 
ciate techniques. 

Inspection time——Ayerage inspec- 
tion time per page decreased with re- 
peated exposures in a manner similar 
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to that reported previously (Rothkopf, 
1968). No systematic relationship be- 
tween inspection time and the addition 
of new text particles was observed. 


Discussion 


The present experiment provided evi- 
dence that learning about sentence frag- 
ments depended on how noticeable they 
were. Noticeability may be related to the 
structure of the sentence particles or to 
the passage context in which they are 
located. These findings have been inter- 
preted to mean that inspection behaviors 
and other attention-like activities play an 
important role in determining what is 
learned from written discourse (for fur- 
ther evidence see Rothkopf & Coke, 1963, 
1966). The results of the present study 
also indicate that inspection behavior is 
modified by repeated exposure and that 
this may have interfering effects on the 
learning of newly added material. The 
interfering effect has some resemblance 
to the experimental extinction of atten- 
tion. 

The learning of the more noticeable 
new material was related to performance 
on the old material to which the new had 
been added. This relationship was not 
due to direct transfer of training. The 
possibility cannot be dismissed that per- 
formance on the old material is a limiting 
condition for how much can be learned 
about any newly added material during 
any one exposure. 

Finally it would appear that the tech- 
nique of inserting new fragments into a 
previously exposed text may prove use- 
ful in studies of attention-like processes 
and other learning phenomena in con- 
nection with written discourse. 


REFERENCES 


CLARK, L. L., Lansrorp, T. G., & DALLEN- 
pacH, K. M. Repetition and associative 
learning. American Journal of- Psy- 
chology, 1960, 73, 22-40. 

Epsrncnaus, H. Uber eine neue Methode 
zur Prüfung geistiger Fähigkeiten und 
ihrer Anwendung bei Schulkinder. Zeit- 
schrift fiir Psychologie, 1897, 13, 401-459. 


‘Rock, I. The role of repetition in associa- 
tive learning. American Journal of Psy- 
chology, 1957, 70, 186-193. 

Roruxorr, E. Z. Some conjectures about 
inspection behavior in learning from writ- 
ten sentences and the response mode prob- 
lem in programmed _ self-instruction. 
Journal of Programmed Instruction, 1963, 
2, 31-46. 

Rornxorr, E. Z. Some theoretical and ex- 
perimental approaches to problems in 
written instruction. In J. D. Krumboltz 
(Ed.), Learning and the educational pro- 
cess, Chicago: Rand McNally, 1965. 

_Roruxorr, E. Z. Textual constraint as 
function of repeated inspection. Journal 
of Educational Psychology, 1968, 59, 20- 
25. 

Rorukorr, E. Z., & Coxe, E. U. Repetition 
interval and rehearsal method in learning 


, 


$ 
| 


—_ 


NOTICEABILITY AND LEARNING NEWLY ADDED FRAGMENTS 199 


equivalences from written sentences. 
Journal of Verbal Learning and Verbal 
Behavior, 1963, 2, 406-416. 

Rotuxorr, E. Z., & Coxe, E. U. Variation 
in phrasing, repetition intervals, and the 
recall of sentence material, Journal of 
Verbal Learning and Verbal Behavior, 
1966, 5, 86-91. 

Taytor, W. L. “Cloze procedure”: A new 
tool for measuring readability. Journal- 
ism Quarterly, 1953, 30, 415-433. 

UNITED States DEPARTMENT oF DEFENSE. 
A pocket guide to Thailand. Washington, 
D, C.: U. S. Government Printing Office, 
1965. 

Wocan, M., & Waters, R. H. The role of 
repetition in learning. American Jour- 
nal of Psychology, 1959, 72, 612-613. 


(Received for early publication 
March 21, 1968) 


ELMER H. DAVIDSON AND 


Journal of Experimental Psychology 
1968, Vol. 78, No. 2, 200-207 


SOME DETERMINANTS OF CONTROLLED- 
ASSOCIATION TIMES+ 


CHARLES N. COFER 
Oklahoma City University 


Controlled-association times obtained under 6 kinds of instruction 
were compared with each other and with free-association times for 3 
sets of stimulus words, the characteristics of whose response distribu- 
tions in free association were known. 21 independent groups of Ss 
were used, The chief findings were that the free-association frequency 
of the correct responses given in controlled association is highly 
correlated (r =—.83) with controlled-association response latency, as 
is the number of responses having higher free-association frequencies 
than the correct response given (r=.81). When primary responses 
are compared, latencies do not differ significantly between free-asso- 
ciation and the controlled-association conditions requiring opposites or 
synonyms. When a response class or classes are excluded by instruc- 
tions, latencies typically increase, even though the primary responses to 
the stimuli are permissible under the instructions. An informal 
model of the process of controlled association, as revealed in this 
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experiment, is briefly described. 


Controlled associations are, typically, 
responses to word stimuli given under 
an instruction which limits the class 
of permissible responses. Thus, one 
may require S to respond to the stimu- 
lus with a synonym, an opposite, a 
superordinate, a coordinate, or some 
other type of response which has a 
designated relationship to the stimulus. 
This variant of word-association pro- 
cedures has been in use for 80 yr. (cf. 
Cofer, 1967 ; Woodworth, 1938 ; Wood- 
worth & Schlosberg, 1954), and the 
results obtained with it, as compared 
with the free-association technique in 
which no restrictions as to class are 
placed on S’s response, have been sum- 
marized as follows: 


The most remarkable fact about controlled 
associations experiments is that correct re- 


1This study was supported, in part, by 
Grant Nonr 656 (30) between the Office of 
Naval Research and the Pennsylvania State 
University, and was presented by the first 
author as a dissertation in partial fulfillment 
of the requirements for the PhD degree in 
Psychology to the Pennsylvania State Uni- 
versity. 


sponses to suitable stimulus words are given 
quickly and with few false responses. The 
subject becomes set for the assigned task, 
and this preparatory set operates very effi- 
ciently, facilitating certain associations and 
inhibiting other associations which may be 
equally strong [Woodworth & Schlosberg, 
1954, p. 57]. 


The evidence on which this summary 
statement is based comes mostly from 
studies conducted prior to 1920 with 
rather small and sophisticated groups 
of Ss. In addition little account was 
taken in these studies of the strength 
in free association of the responses 
meeting the requirements of the con- 
trolled-association instructions. In a 
recent study, Cofer (1967) has found 
that under conditions of group ad- 
ministration of paper and pencil con- 
trolled-association tests (conditions 
paralleling those under which most 
norms of free association have been 
obtained) there were many more re- 
sponses not consonant with the in- 
structional condition than the quota- 
tion would suggest. In addition, he 
found that the responses given under 
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controlled association had many simi- 
larities to those given under free asso- 
ciation. 

These findings indicated that it 
would be desirable to obtain new data 
on controlled-association times, with 
several kinds of instructions and with 
control of the free-association frequen- 
cies of the target responses to the 
stimuli used. Such evidence, if it con- 
forms to the conclusion quoted above, 
might help to elucidate the nature of 
the efficient “preparatory set” which 
is capable of facilitating some and in- 
hibiting other responses. That such 
a set may be a real one was suggested 
by the results of an experiment by 
Baker and Elliot (1948). They com- 
pared free-association and controlled- 
association times to the same stimuli 
(and by selecting the stimuli carefully 
were able to deal with the same re- 
sponses under both conditions), and 
found shorter association times under 
controlled than under free instructions. 
All of their data were obtained from 
the same Ss in a constant order (free 
association first), but in a partial re- 
plication (unpublished) of this experi- 
ment with independent groups, the 
present authors confirmed this finding. 
It is unknown, however, what the 
range of conditions is under which the 
shorter association times for controlled 
association would hold. 

It is possible to restrict associations 
ina variety of ways. Most common in 
the literature is the designation of a 
class of responses to be given. One 
might, however, ask for either of one 
or two classes, or indicate that re- 
sponses of one class or responses of 
each of two classes are not to be given. 
All of these conditions can be classed 
as involving controlled association. 
They were used in the present experi- 
ment. In addition, stimuli were em- 
ployed with known propensities for 
eliciting responses in free association 


appropriate to one or another instruc- 
tional condition, and the responses 
obtained were analyzed in terms of 
their latencies, appropriateness, free- 
association frequency, and rank in the 
free-association hierarchy. 


METHOD 


Subjects —The Ss were 586 students from 
the introductory psychology classes at Penn- 
sylvania State University, Data from 567 
(389 males and 178 females) of these were 
included in the analysis. Eleven were dis- 
carded because of apparatus malfunction, 
6 because of speech defects, and 2 because 
of mistakes in procedure by E. All Ss 
were volunteers, but were rewarded by lab- 
oratory credit for their psychology classes. 

Apparatus—The stimulus words were 
presented by use of a Stowe memory drum 
(Model 459 B). When the shutter. was 
raised so that § could see the stimulus, an 
electronically controlled Lafayette voice- 
reaction apparatus (Model 302) was pulsed, 
and a Standard Electric clock (Model S-1) 
was started. The clock was calibrated in 
1/100ths of a second. When S responded, 
a throat microphone pulsed the yoice-reac- 
tion apparatus and the clock was stopped. 
One second before the presentation of any 
stimulus word, a timer automatically reset 
the electric clock, If S did not respond, this 
mechanism stopped and reset the clock for 
the next trial. 

When a continuous sound was supplied to 
the microphone, the shortest time obtainable 
(stopping latency of the apparatus) ranged 
from .11 sec. to 16 sec. This is about 15 
sec. below the shortest association time ob- 
tained in this study. 

After six of the nine complete replica- 
tions, the voice-reaction apparatus was re- 
placed by a new Lafayette voice-reaction 
apparatus (Model 602A). The latency of 
this apparatus was even shorter than that 
of the first. Since both units of apparatus 
used to measure latency were capable of 
measuring latencies much shorter than any 
obtained, it was assumed that the change did 
not affect the data. 

Design —The study was designed pri- 
marily around three lists of 10 stimulus 
words each. One list contained words 
whose primary responses (PRs) in free as- 
sociation were opposites (opposite-primary) ; 
another, words with synonyms as PRs 
(synonym-primary) ; and the third, words 
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that had PRs that were neither synonyms 
nor opposites, i.e., they had other words as 
primaries  (other-word-primary).2 The 
normative sources were (a) Cofer;3 (b) 
Bousfield, Cohen, Whitmarsh, and Kincaid 
(1961); (c) Palermo and Jenkins (1964) ; 
and (d) Deese.* 

Each of the three lists of words was util- 
ized with seven different types of control 
instructions, as follows: 


1. Give opposites. 

2. Give synonyms. 

3. Give opposites or synonyms. 

4. Give the first word thought of (free as- 
sociation). 

5. Give any word except an opposite. 

6. Give any word except a synonym. 

7. Give any word except an opposite or a 
synonym, 


Independent groups were utilized for all 
conditions. With three lists of stimulus 
words and seven types of instructions, there 
were 21 basic groups. There were 27 Ss 
in each group. Also, the words of each of 
the three lists were presented in three dif- 
ferent random orders with 9 of the 27 Ss 
exposed to each order of each group. The 
three different orders were used to balance 
out any effects due to adaptation or cumu- 
lative exposure. 

The Ss were assigned to groups by the 
following method, Sixty-three (3 Orders X 
3 Stimulus Types X 7 Instructions) num- 
bered disks were placed in a container, The 
experimental condition for each S$ was de- 
termined by blindly drawing a disk from 
the container. All 63 conditions (one repli- 
cation) were assigned before any disk was 
replaced in the container. The sex of $ was 
not considered in the assignment of Ss to 
groups or in the analysis of data. 

In order to determine which response 
words were appropriate for the instructions 


2A four-page table giving the stimulus 
words used along with their PRs and one 
other word acceptable in each of the other 
categories along with the AP of each word 
has been deposited with the National Auxili- 
ary Publications Service. Order Document 
No. NAPS-00052, from ASIS National 
Auxiliary Publications Service, c/o CCM 
Information Sciences, Inc, 22 West 34th 
Street, New York, New York 10001; re- 
mitting $1.00 for microfiche or $3.00 for 
photo copies. 

3 Unpublished study, 1957. 

4 J. Deese, personal communication, 1963. 


given for each group, the following sources 
were used as authorities: Webster’s Third 
New International Dictionary of the English 
Language (1961), Webster's Dictionary of 
Synonyms (1942), Funk & Wagnall’s Stand- 
ard Handbook of Synonyms, Antonyms, and 
Prepositions (1947), and Jenkins and Pa- 
lermo’s (1964) study of contrasts. If a 
response fit the instructions in any sense indi- 
cated in the source, it was assumed that S was 
following instructions and the word was con- 
sidered a correct response. If none of the 
sources described the word in a way that fit 
the instructions, then that word was con- 
sidered an intrusion and the latency was 
not used in the group latency score, If S 
failed to respond, the omission was not 
scored for latency. A few other latencies 
were missed because of apparatus failure 
and sounds made by S before the response 
was given. 

Procedure.—The S was seated before the 
memory drum and given appropriate in- 
structions verbally. After S reported that 
the instructions were clearly understood, a 
throat microphone was fastened around S’s 
neck and the experiment was begun. No 
practice trials were given. The stimuli were 
exposed on the memory drum at 10-sec. in- 
tervals. After 5 sec,, the shutter of the 
memory drum covered the stimulus word 
and stayed down until the next stimulus was 
presented. The click of the clock-reset 
mechanism served as a 1-sec. warning signal 
that the next stimulus word was about to ap- 
pear. The E listened for the simultaneous 
sound of the beginning of the word and 
click of the clock stopping to determine if 
the time was a true measure of word latency. 


REsuLts 


Scoring procedures—After deter- 
mining which responses were appro- 
priate, the mean latency (in 1/100ths 
sec.) of the correct responses was com- 
puted for each word ® of each group. 
Using this method, there were 10 
scores for each group with each of the 


5 At the suggestion of Robert Seibel, the 
analysis was redone using subject scores 
rather than word scores. No conclusion 
would be changed by use of the subject-score 
analysis. The word score is used because 
it seems that the particular latencies may be 
more peculiar to the word than they are to 
the subject. 
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TABLE 1 
Group MEAN AsSsocIATION PERCENTAGES OF THE CORRECT RESPONSES 


Type of Stimulus Word 


Instructions 


Opposite Synonym Other-Word 

Primary Primary Primary 
Give opposites 54.5 7.1 10.2 
Give synonyms 2.8 35.2 5.9 
Give opposites or synonyms 53.7 22.4 9.5 
Free association 50.9 26.3 19.3 
Give any except opposites 2.0 27.5 20.5 
Give any except synonyms 47.5 oS 18.7 
Give any except opposites or synonyms 0.6 1.5 18.9 


10 scores being the average of 2 to 27 
individual latencies. The average 
number of individual scores going into 
each word score was 20,3. These 10 
word scores for each of the 21 groups 
were used as the basic data for all the 
following latency analyses. 

The strength of association between 
the stimulus word and the response 
obtained was measured by the frequen- 
cies of occurrence of the obtained re- 
sponses in the normative sources. For 
example, if S gave the response night 
to the stimulus BLACK, the frequency 
of this response to this stimulus in 
the free-association norms was used. 
Since the norms available differed in 
the number of Ss used, frequencies were 
converted to percentages. A mean as- 
sociation percentage (MAP) of the 
correct responses was computed for 
each stimulus word. If the correct re- 
sponse did not occur in the norms it 
was scored a zero. Nine of the 210 
word scores had MAPs of zero. Table 
1 shows the average associative 
strength of the correct responses for 
each group. 

Association time and free association 
frequency (MAP),—An analysis of 
deviation from linearity showed that 
the relationship between association 
time and strength of the association 
was curvilinear, F (41, 167) = 2.95, 
p< .01 (Guilford, 1956, p. 294), so 


a logarithmic transformation was per- 
formed on MAP of the correct re- 
sponse scores with a constant of one 
added to each score to avoid the log 
of zero. After the logarithmic trans- 
formation, the relationship was not sig- 
nificantly different from a linear one. 
The product-moment correlation be- 
tween the transformed MAP and as- 
sociation time is —.83 (p < .001). 
Association time and number of 
more frequent responses in the free 
association hierarchy.—Since the cor- 
relation between controlled-association 
time and frequency of response in free 
association was so high, it seemed 
worthwhile to look at another measure 
obtainable from the free-association 
norms, This is the number of re- 
sponses in a given distribution with 
higher frequencies than the correct re- 
sponse given. A count was made for 
each correct controlled-association re- 
sponse, but because of the different 
sample sizes in the norms it was de- 
cided to use four responses as the 
maximum, i.e., tables were made for 
1, 2, 3, or 4 (or more) responses with 
free-association frequencies higher than 
the response in question. The correla- 
tion (r) between this measure and 
latency in controlled association was 


81 (p< 01). 


6The correlation between this measure 
and free-association frequency for the same 
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TABLE 2 


LATENCIES OF THE CORRECT AND PRIMARY RESPONSES IN SPECIFICATION, 
ELIMINATION, AND FREE-AssocIATION CONDITIONS 


Type of Stimulus Word 


Opposite Primary 


Synonym Primary 


Other-Word Primary 


Latency of | Latency of 


Latency of | Latency of 


Latency of | Latency of 


Group Correct Primary Group Correct Primary Group Correct Primary 
Response | Response Response | Response Response | Response 
S 100 99 S 150 147 -= 
FA 117 109, FA 150 135 FA 136, 132 
E:AES 133 124 E:AEO | 166 152 E:AEO| 162 150 
E:AES 164 149 


Note,—The line beside the scores indicates a lack of statistical significance between the groups joined by 


the line, 


Types of controlled association — 
Certain of the instructions specify the 
kind of response to be given, e.g., give 
opposites, synonyms, opposites or syno- 
nyms; others specify kinds of responses 
not to be given, e.g., any word but an 
opposite, a synonym, or an opposite 
or a synonym. Since the first type 
specifies the acceptable response class, 
it will be referred to as S; the second 
type designates certain classes as not 
acceptable. These classes are elimi- 
nated, hence this type will be referred 
to as E. It is of interest to compare 
these types with each other and with 
free association (FA) for each stimu- 
lus set. The data are shown in Table 
2. 

Stimuli with opposites as primaries. 
—When latencies for all correct re- 
sponses are compared, the S group 
(give opposites) has the shortest la- 
tency (100), followed by the FA group 


responses was —.88, indicating that they 
measure much the same thing. Using these 
two measures in a multiple-regression equa- 
tion (Bartoo, Hiz, & Laird, 1958) together 
with two others (type of controlled-associa- 
tion instruction and the degree of response 
restriction) produced a multiple R of .86, 
Clearly, this shows little improvement over 
the values obtained for log free-association 
frequency (—.83) or for number of more 
frequent responses (.81). 


and by the E group instructed to give 
any words except a synonym (AES), 
and all means differ significantly from 
one another at p< .05 according to 
Duncan’s multiple-range test (Mc- 
Guigan, 1960), However, when pri- 
mary responses (the opposites) are 
used, only Group AES differs signifi- 
cantly from the others. This compari- 
son based on primaries holds the asso- 
ciative frequency of the correct re- 
sponse constant across groups and thus 
is the best one to use. 

It should be noted that since these 
words all have opposites as primaries, 
the significant increase in latencies for 
AES (and the insignificant increase 
for Group FA) must be due to proc- 
esses invoked by the type of instruc- 
tion. 

Stimuli with synonyms as primaries. 
—There are no significant differences 
here by Duncan’s test, and the means 
for S and FA are close together for 
both all correct responses and for pri- 
maries. Although the E group 
(AEO) has an increased latency, des- 
pite the appropriateness of the syno- 
nym primaries as responses, it is not 
significant. The increased latency, 
however, parallels that for stimuli with 
opposite primaries in the case of Group 
E (AES). 
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Stimuli with words as primaries 
other than opposites or synonyms.— 
With these stimuli it was not possible 
to designate a primary response class 
that would hold for all the stimuli of 
the set. Hence, the S condition is not 
shown for these sets in Table 2. How- 
ever, the free-association latency for 
these stimuli (for all correct responses 
or for primaries alone) is significantly 
shorter than for either E instruction 
(AEO, AES); the two E conditions 
do not differ significantly from each 
other. 

These data suggest that there need 
be no difference in latency between FA 
and S groups (e.g., the synonym-pro- 


ducing stimuli) and that the E kind 
of instruction clearly produces longer 
latencies than the S or FA instructions. 
The latter is a clear case in which 
shorter latencies occur under FA than 
under controlled association. 

A further comparison of S and E 
conditions is provided in Table 3. 

Here the instructions did not permit 
the primary response for the stimulus 
) word sets presented, Again there is 
an elongated latency for the E groups 
(AEO and AES), as compared with 
the S groups, the difference reaching 
significance (p < .05) in one case but 
not in the other. This comparison is 


TABLE 3 


LATENCIES oF THE CORRECT RESPONSES AND 
THE MEAN AsSOCIATION PERCENTAGES 
(MAP) or SPECIFICATION AND 
ELIMINATION CONDITIONS 


Type of Stimulus Word 


Opposite Primary Synonym Primary 


Group |Latency| MAP | Group | Latency | MAP 
S 218 2.8 S 165 | 7.1 
E:AEO | 236| | 2.0 |E:AES| 216 | 3.4 


sote-—The line beside the scores indicates a lack of 
4 tees significance between the groups joined by the 
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not as clear as one would like, because 
the latency differences also parallel dif- 
ferences in associative frequency 
(MAP) of the correct responses. 
Number of response classes speci- 
fied or eliminated.—The authors’ in- 
structions specify that a response may 
come from one class or from two 
classes and that a response from one 
class or from two classes is not to be 
used. In the present analysis, the ef- 
fect of each of these variations was 
evaluated. Table 4 shows the data. 
Where a single class of response is 
designated (opposites, synonyms) 
there is no significant difference in 
latencies either for all responses or for 
primaries only as compared with the 
designation that the response must 
come from either of two classes (op- 
posite or synonym), although the mean 
latencies for the latter condition are 
all slightly longer than for the former 
conditions. With the E conditions, 
however, the elimination of two classes 
(any word except an opposite or syno- 
nym) significantly ($ < 05) elongates 
the latencies as compared with the 
elimination of one class (AEO, AES). 
A similar finding was observed for the 
stimulus set with primaries other than 
opposites or synonyms. Here the 
means for AEO and AES are 162 
and 164, respectively, and for the 
double elimination condition (any ex- 
cept opposites or synonyms) is 184. 
Free association, in the present com- 
parison, could be considered as a case 
of neither specification nor elimination 
of response classes. Comparison of 
FA latencies were made with those for 
single restrictions (specification and 
elimination, e.g., opposites, AEO) and 
double restrictions (specification and 
elimination, e.g., opposites or synonym, 
any word except an opposite or syno- 
nym). For the stimuli with opposite 
primaries, the latencies were as fol- 
lows: FA, 116.9; single restrictions, 
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Bodh 167.4; double restrictions, 180.0. A 
$ fsa similar set of relations was found for 
the stimuli with synonym primaries, 
Plas No check was made for the third set 
E mE gS of stimuli, since there were no cases 
£ o 8 here which allowed for primary re- 
2 g sponses under single restrictions. 
Q > B 
(a — n 
ali 32 2 128 Discussion 
Oo || 58 Ones 
O | cs fx) tx) The most important findings from the 
te £2 present experiment may be summarized 
Bi 5% el pee as follows: (a) Controlled-association 
8 ER 4 |ac times are predicted well (r= —.83) by 
E the frequencies of the correct responses as 
B ees free associates to the same stimuli. (b) 
a E ae 8/83 Controlled-association times are also pre- 
p £ T RINT. dicted well (r=.81) by the number of 
B 3 $ Tesponses with greater frequencies in free 
fe gle association than the correct response to 
g & s |o6 | 3 the same stimuli. (c) These two pre- 
a È aa 5 dictors are highly intercorrelated (r= 
A oa l? —.88). (d) When attention is restricted 
2 3s to primary responses, there are no sig- 
~+ 8 4 8 nificant differences between free-associa- 
al bs a PETINE tion times and controlled-association times 
a Š E332] for the instructions to give opposites or 
g Z D f to give synonyms when the stimuli used 
A BHD is È have, respectively, opposites or synonyms 
3 @ | fee |x 5 as primaries in free association. (e) Use 
2|/6 jaag of all correct responses produces a sig- 
ty} g El nificant advantage for controlled asso- 
Z faa ian a Q & ciation (give opposites) over free asso- 
g ze 3 loas |7 ciation for opposite-producing stimuli; 
Hs oF ) a1: this is, perhaps, a consequence of lower 
p || 3 F free-association frequencies for responses 
Bl o i Bip aac 5 other than opposites given in free asso- 
| a Eu 5 a2 |ž ciation than is the case when only op- 
ô || 22 at U T i posites are considered. H), Significant 
g F g elongation of association times occurs 
a $ PAN en E primarily in the instances in which one 
5 g | Exs|ss |5 class or two classes of responses are 
a ESETEI a s eliminated by instruction, despite the fact 
5 & a that free-association primary responses 
a a Q = are permitted under several of these con- 
z i Kea lie ditions. This last result clearly indicates 
a e n 138 that controlled-association times may be 
ki É longer than free-association times, given 
B55 Fs certain conditions. 
BEE A i T These results are consistent with the 
3 BE SoS following general interpretation. It is 
ga E A hypothesized that under controlled-asso- 
ae ciation conditions, S, on seeing the stimu- 
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lus word, “scans” his associative hier- 
archy for the stimulus in question and 
emits the response which he “judges” as 
appropriate for that stimulus under the 
instruction given. The interpretation in- 
volving scanning is suggested by the high 
correlations between free association fre- 
quency of and number of more frequent 
responses than the correct response and 
controlled-association latency. Further, 
the failure to find significant latency dif- 
ferences between the instructions to give 
opposites or to give synonyms and the 
free-association instructions for primaries 
is consistent with the scanning notion. 
However, the elongation of response la- 
tencies under instructions which eliminate 
one or two response classes, even when 
the instructions permit the free-associa- 
tion primary for the stimuli employed, 
implies that S must check the response 
which occurs to him, based on its free- 
association strength, for appropriateness 
to the stimulus under the given instruc- 
tional condition. 

It appears to us that the classical sum- 
mary of controlled-association latencies, 
as provided in the quotation given in the 
introduction, is faulty in that (a) free- 
association and  controlled-association 
times may not differ, given that the same 
responses (primaries) are compared and 
(b) there are conditions, viz., those that 
eliminate a response class or classes, 
under which controlled-association times 
are longer than are free-association times. 
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Language switching in the bilingual’s speech production takes an ob- 
servable amount of time. He can reduce the amount of time if he is 
able to anticipate a switch. Switching times are not a function of 
stimulus uncertainty, but they are a function of response uncertainty. 
Language switching did not differ in any measurable way from a uni- 
lingual form of response switching with which it was compared. 
These were the conclusions of a study of number naming by bilinguals. 
The theoretical implications of the findings were discussed with special 
reference to linguistic independence, the bilingual’s ability to keep his 


languages distinct. 


Within certain limits (Ervin-Tripp, 
1964; Fishman, 1964; Gumperz, 1967; 
Jacobovits & Lambert, 1961; Mackey, 
1962), all bilinguals manage to keep 
their languages distinct and can switch 
from one to the other. It follows that 
to some extent bilinguals experience 
their languages as psychologically dis- 
tinct systems, and that they have some 
device to control which one is used at 
any particular time. Penfield and 
Roberts (1959) described this device 
as a “curiously effective automatic 
switch,” which they seem to visualize 
as having the effect of when one lan- 
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guage is on, the other is necessarily 
off. However, Macnamara (1967a) 
has pointed out that certain findings 
reported by Preston (1965) require a 
model which includes at least two 
switches, one at the beginning of the 
input or decoding process and one at 
the beginning of the output or encod- 
ing process. The study reported here 
was designed to throw light on the 
second of these switches and on its 
manner of functioning. 

The basic idea of the study was to 
present a bilingual with linguistically 
neutral stimuli such as numerals and 
have him name them first in one lan- 
guage (L,), then in the other language 
(L2), and then alternate between lan- 
guages (switching). The use of lin- 
guistically neutral stimuli ensured that 
the switch being operated was the “out- 
put” switch. The hypothesis was that 
switching times would be longer than 
unilingual ones, and that the difference 
would be due to the fact that the bi- 
lingual had to perform the added task 
of operating the switch when switching 
languages. However, when reading 
unilingually a list of numbers selected 
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at random from 1 to 10, e.g., S need 
have only 10 names in readiness; when 
reading the same list bilingually he 
must have 20 names in readiness. That 
is, the bilingual condition involves 
greater response uncertainty than the 
unilingual one. As part of a proced- 
ure for studying the relationship be- 
tween response uncertainty and lan- 
guage switching, Ss were asked to read 
unilingually lists of numbers selected 
from 1 to 10 and also lists selected 
from 1 to 20. 

Two preliminary studies encouraged 
the authors to expect a quite marked 
switching effect and also provided a 
clue which influenced the design of 
the study. The authors found that 
when Ss were required to switch lan- 
guages in a regular and predictable 
pattern, their response times were 
faster than when the switches occurred 
in a random and unpredictable pattern. 
This effect shall be referred to as the 
anticipation effect. 

The anticipation effect can be tested 
by comparing response times for 
switching in a regular pattern with 
those for switching in an irregular 
pattern. However, on the hypothesis 
that anticipation is possible only when 
there is sufficient time for it to take 
place, the authors also tested the an- 
ticipation effect by varying the interval 
between response and stimulus. Thus, 
four conditions were included to check 
out this effect: switching in a regular 
pattern with and without an interval 
between response and stimulus ; switch- 
ing in an irregular pattern with and 
without such an interval. Because an- 
ticipation involves the reduction of re- 
sponse uncertainty, these conditions 
should also cast light on the relation- 
ship between response uncertainty and 
language switching. 

Further, to see if there is something 
special about language switching, the 
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authors thought it desirable to com- 
pare language switching with other 
forms of switching in which a language 
switch is not involved. Finally, an at- 
tempt was made to study language 
switching in relation to degree of bi- 
lingualism, i.e., relative competence in 
the two languages. 


METHOD 


Subjects—The Ss, French-English bilin- 
gual college students who volunteered for 
the experiment, were selected on the basis of 
responses to a language background ques- 
tionnaire. They fell into three groups: (a) 
English speaking with some knowledge of 
French, (b) French speaking with some 
knowledge of English, and (c) balanced, or 
equally competent in both languages. It 
should be noted that this classification is 
very rough indeed (see Macnamara, 1967b). 

Materials and design.—Eighteen lists, each 
consisting of 10 randomly arranged numbers 
were prepared. In all lists, except Numbers 
3 and 4 (see Table 1), the numbers ranged 
between 1 and 10; in Lists 3 and 4 they 
ranged between 1 and 20. Four of the lists, 
Numbers 1 to 4, were to be read unilingually 
in English or French; four were used to 
measure language switching, Lists 5 to 8; 
two required Ss to add one to the number 
presented, Lists 9 and 10 (e.g., if 6 appeared, 
S was required to say seven); four were 
number-switching lists (either name the 
number presented, or add one to it) to be 
read in English; and four were number- 
switching lists to be read in French, numbers 
11 to 18. Four geometrical figures were 
employed to indicate which type of response 
was required. All stimuli were set in the 
center of such a figure with circles and tri- 
angles indicating, according to instructions. 
either English or French and with squares 
and diamonds indicating either the naming 
of the number presented or the number pre- 
sented plus one. 

Two lists of each type of switching (lan- 
guage switching, number switching in Eng- 
lish, and number switching in French) were 
presented with a 2-sec. delay after each re- 
sponse. This was to allow time for the an- 
ticipation of switches. The remaining two 
lists of each type of switching were pre- 
sented without delay. Two lists of each type 
of switching involved switches according to 
a regular and predictable pattern; two lists 
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TABLE 1 
SUMMARY OF Lists AND CONDITIONS 
List Type Condition Symbols Used 

Unilingual 

1 Numbers ranging 1-10 Triangle (T) 

2 Numbers ranging 1-10 Circle (C) 

3 Numbers ranging 1-20 T 

4 Numbers ranging 1-20 Cc 
Language Switching 

5 Regular pattern of switches; no delay T+C 

6 Irregular pattern of switches; no delay T+C 

$ Regular pattern of switches; 2-sec. delay T+C 

8 Irregular pattern of switches; 2-sec. delay T+C 
Unilingual 

9 Add 1 to numbers presented Squares (S) 

10 Add 1 to numbers presented Diamond (D) 
Number Switching 

11 Regular pattern of switches; no delay S+D 

12 Irregular pattern of switches; no delay S+D 

13 Regular pattern of switches; 2-sec. delay S+D 

14 Irregular pattern of switches; 2-sec. delay $+D 

15 Regular pattern of switches; no delay $+D 

16 Irregular pattern of switches; no delay S$+D 

17 Regular pattern of switches ; 2-sec. delay $+D 

18 Irregular pattern of switches ; 2-sec. delay $+D 


of each type involved switching in an irregu- 
lar and unpredictable pattern. 

The order of presentation of lists and tasks 
was as far as possible counterbalanced across 
Ss within each group. Half of the Ss fol- 
lowed an English-French order throughout ; 
the other half followed the reverse order. 
Half of the S's took the language-switching 
tasks before the numerical switching ones, 
while the other half took these tasks in the 
reverse order. For half of the Ss, triangles 
symbolized English and circles symbolized 
French responses; for the remaining Ss the 
symbolism was the other way around. Simi- 
larly, for half of the Ss, a square meant they 
should add one and a diamond that they 
should name the number as presented; for 
the remaining Ss the symbolism was the 
reverse. i 

Procedure—The stimuli were photo- 
graphed on filmstrips and projected on a 
screen by means of a Dukane Projector 
(Model 576-47B) which has the characteris- 
tic of very rapid action, less than .05 sec. 
The stimuli appeared one at a time and re- 
mained visible until E pressed a button which 
moved the filmstrip and presented the next 
stimulus. The E always attempted to press 
the button immediately after 5 completed 
his response. The circuit was fitted with a 


timer which, by choosing the appropriate but- 
ton, E could use to allow a delay of 2 sec. 
between his pressing the button and the ap- 
pearance of the next stimulus. The S re- 
sponded verbally. The relay which operated 
the projector also operated a clock which 
timed responses to one-hundredth of a sec- 
ond. The clock was stopped by a voice key 
at the commencement of S’s response. The 
clock was used to measure cumulative re- 
sponse latencies for each list. Responses to 
all switching lists were tape recorded. Sub- 
sequently, the recordings were analyzed and 
cumulative times were corrected for errors as 
well as false starts. Errors in response to 
lists which required no switching were so 
infrequent as to be negligible. i 
Before Ss were shown any particular. list, 
its character was explained to them and also 
the types of response they were required to 
make. Before beginning lists involving 
switching, Ss were shown six practice items 
to which they responded. When they met 
a criterion of one faultless run through the 
practice items, they were permitted to go 
on to the test list. R 
Computation of times—One of the prin- 
cipal difficulties associated with the use of 
voice keys is that any sound which is of suf- 
ficient intensity will trigger them. It is 
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therefore necessary to devise some method 
of correcting for “false starts.” The method 
adopted was first to identify all errors in 
the tape recordings auditorily, and then run 
the tapes through a mingograph which 
printed out the sound wave patterns on paper 
calibrated in hundredths of a second. Next 
the errors were located in the paper print- 
outs. Since the presentation of a stimulus 
was marked on the tape by an electrical im- 
pulse which appeared in the printouts, it was 
possible to measure the time from the pres- 
entation of the stimulus to the beginning of 
an erroneous response and to correct the 
cumulative time accordingly. 

In order to estimate the time due to 
switches in both language and number switch- 
ing, times for the corresponding unilingual 
conditions were subtracted from the switch- 
ing conditions. An example will make the 
procedure clearer. Suppose an S’s language- 
switching time on a task involving irregular 
switches had to be estimated. When errors 
due to false starts had been eliminated from 
the cumulative time for the list, the number 
of correct: responses in each language was 
counted and also the number of language 
Switches; then, that S’s average response 
latencies for a single item in each of the 
languages was calculated on the basis of 
his performance on Lists 1 and 2 (unilin- 
gual). Suppose that he gave five correct 
French responses and four correct English 
ones in the switching task, then his average 
“French” unilingual time was multiplied by 
five and his average “English” unilingual 
time by four. The sum of the two quantities 
thus obtained was then deducted from his 
corrected switching time to yield a measure 
of time due to switching only. Finally, if 
in the switching task he made five faultless 
switches, his switching time was raised pro- 
Portionately to make it equivalent to nine 
switches. This was done to make results 
comparable across conditions; there were 
nine switches in the conditions involving 
switching according to a regular pattern. 
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RESULTS 


Number naming.—Analysis of vari- 
ance of unilingual scores (Lists 1-4) 
by group, language, and stimulus un- 
certainty revealed no significant differ- 
ences. This implies that contrary to 
expectations all three groups were bal- 
anced bilinguals with regard to number 
naming. It also implies that, at least 
in this task of number naming, the 
amount of uncertainty in the stimulus 
did not affect response latencies. 

Language switching—Mean lan- 
guage switching times, computed in 
the manner outlined above, are given 
in Table 2. These were analyzed by 
groups, by order of switching (regu- 
lar-irregular), by interval (delay-no 
delay), and by Ss. The only signifi- 
cant result obtained was the effect of 
order of switching, which was tested 
against the mean square associated 
with Order X Ss within groups, F (1, 
33) = 42.88, p < .01. That is, times 
for regular and predictable switches 
were significantly faster than those for 
irregular and unpredictable switches. 
The combined results for the two tasks 
involving regular switches yielded a 
mean time per switch of .21 sec.; those 
for the two tasks involving irregular 
switches yielded a mean time per 
switch of .39 sec. The mean difference 
between these two figures was .18 sec. 

The absence of a significant F value 
associated with interval is contrary to 
expectations. At first sight, it would 


TABLE 2 
MEAN SWITCHING TIMES IN SECONDS FOR Nine SWITCHES 
No Delay Delay 
Type of Switching 
Regular Irregular * Regular Irregular 
Language 1.91 (1.99. 3. 2.82) 1.78 iss 3.68 a 
Numerical (English 1.55 (1.85 3. (337 2.23 (1.90 3.94 (3.81 
umereal EAEI 1.83 (E83) 3.31 (339) 2.32 (1.89) 3.36 (2.72) 


Note.—SDs in parentheses. 
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seem that Ss anticipated as effectively 
without a delay as with one. How- 
ever, the likely explanation is that even 
in the no delay condition the succes- 
sion of stimuli was not rapid enough 
to prevent Ss from anticipating. 

Finally, all mean values for language 
switching given in Table 2 were sig- 
nificantly greater than zero. This 
means that the requirement to switch 
languages added significantly to Ss’ re- 
sponse latencies. 

Number switching —The mean times 
for the numerical switching tasks are 
also given in Table 2, These were 
submitted to analysis of variance by 
Language, by Order of Switches (reg- 
ular-irregular), by Interval (delay-no 
delay), and by Ss. Once again the 
only significant main effect was Order 
of Switches, F (1, 34) =23.98, p< 
01. Two of the interactions wete also 
significant: Language X Order, F (ail 
34) =7.14, p <.05, and Language x 
Interval, F (1, 34) =4.51, p <.05. 
Further study revealed that in the first 
of these the Order factor had a larger 
effect in English than in French; in 
the second, the Interval factor also had 
a larger effect in English. Since these 
significant interactions did not affect 
the main conclusion, the significance of 
Order, they will not be made the sub- 
ject of comment in the discussion sec- 
tion. 4 

The mean time for regular switches 
was .22 sec. while that for irregular 
switches was .39 sec. The mean dif- 
ference between the two was .17 sec. 
All eight mean times for number 
switching were significantly greater 
than zero, which implies that the re- 
quirement to switch contributes sig- 
nificantly to response latencies. 

Language and number switching.— 
The final analysis was made in order 
to compare language switching with 
numerical switching. This was done 
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by subtracting each S’s language 
switching time from his corresponding . 
numerical switching one. By follow-. 
ing this procedure systematically, eight 
new measures were obtained for each | 
S, each of which was corrected for his 
language-switching times. These new 
measures permit one to make a very 
sensitive test of the differences be- 
tween language switching and numeri- 
cal switching. 

The corrected measures were sub- 
mitted to analysis of variance by lan- 
guage, by order, by interval, and by 
Ss. None of the mean squares yielded 
a significant F ratio. Furthermore, 
none of the mean values for corrected 
measures differed significantly from 
zero. In short, in 16 tests no signifi- 
cant differences between language 
switching and numerical switching 
emerged. So the authors are entitled 
to conclude that the two sorts of task 
were accomplished by Ss in the same 
way, or at least that neither task 
brought into play any skill which was 
not involved in the other. 

Errors—In all, Ss made 6,480 re- 
sponses, of which 4,320 were made in 
switching conditions. The total num- 
ber of errors made in these conditions 
was 181. Of these, 105 were stumbles 
or false starts which S corrected him- 
self and 76 were wrong responses in 
which S used the wrong language or 
said the wrong number. Taken to- 
gether, the two types of error amount 
to about 4% of the switching responses. 
No statistical analysis of errors was 
made, but they appear to have occur- 
red at random throughout the various 
tasks. 


DISCUSSION 


The finding of central interest is that 
language switching takes an observable 
amount of time. Similar findings have 
been reported by Kolers (1966) and by 
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Macnamara (1967c). One might object 
that the observed increase in response 
latencies in the switching conditions can 
‘be accounted for by explanations other 
than the one proposed, i.e., the operation 
of a language-switching device. Some 
colleagues have suggested that the re- 
quirement to keep the switching instruc- 
tions in mind throughout a switching task 
might have a disruptive effect on per- 
formance, The suggestion, however, 
seems to stem from the fact that lexical 
items are clearly classified by language 
and that serial retrieval is facilitated by 
sticking to one language. This is, of 
course, the point of the authors’ inter- 
pretation. If, on the contrary, lexical 
items were stored in a random fashion, 
then the requirement to retrieve 5 specific 
items from Ly and 5 from Ly would 
present no greater difficulty than the re- 
quirement to retrieve 10 specific items 
from Lı. In both the unilingual and 
Switching tasks, all Ss were required 
to do was to retrieve 10 specific items 
from a total of 20. The point can be ex- 
pressed equally clearly in the language of 
set. Bilingual persons have well estab- 
lished but distinct sets for responding in 
Lı and in Ly; what they do not have is 
an equally well-established superordinate 
set which can guide bilingual produc- 
tion of the type this tests required. In- 
deed, the whole study of bilingualism 
gives one the distinct impression that 
the bilingual’s linguistic performance is 
Similar to that of the musician who ob- 
Serves the notation for key at the begin- 
ning of a piece of music and then forgets 
about it though in his playing he per- 
forms the actions appropriate to the key. 
Similarly, the bilingual once started in 
one language can forget about which 
language he is speaking and yet obey the 
rules of that language. 

Both the time increases and the number 
of errors involved in switching from one 
language to the other suggest that there 
are greater difficulties attached to operat- 
ing the output switch than the input one. 
The latter appears to operate quite auto- 
matically as a function of the input. 

A second possible objection to the 
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theory being proposed is that the increase 
in response latencies in the switching 
conditions is due to increase in response 
uncertainty. In these conditions Ss had 
to be ready to respond in any of 20 ways, 
whereas in the unilingual conditions, he 
had to be ready to respond in only 10 
ways. However, the finding that re- 
sponse latencies for numbers drawn from 
1 to 20 are no greater than those for num- 
bers drawn from 1 to 10 shows that there 
is no simple relationship between re- 
sponse uncertainty and response latency. 
When responding to numbers drawn from 
1 to 20, Ss had to cope not only with 
increased response uncertainty but also 
with increased uncertainty in the stim- 
ulus, and yet these increases did not 
affect their performance. This particu- 
lar finding is supported by a consider- 
able number of studies (e.g., Forrin, 
Kumler, & Morin, 1966; Morin & Forrin, 
1965). 

The finding that bilinguals can improve 
their response times by anticipating a 
language switch seems to be like the ef- 
fect of a reduction in response uncertainty 
(for similar findings see Dillon, 1966; 
Schaffer, 1965, 1966, 1967; Schlesinger, 
1964). When responding to lists which 
demanded switching in a fixed pattern, S's 
performed as though they had already 
determined the response category before 
the stimulus appeared, or at least begun 
to move in that direction. The fact that: 
they were able to benefit from doing so, 
however, implies that their languages are 
psychologically distinct systems which 
can be primed for use even before the 
particular use has been specified. But 
this, of course, brings one back to the 
interpretation of the data in terms of 
set and of switching from one set to 
another. 

In this connection it is important to 
recall that in 16 comparisons of language 
switching with unilingual number switch- 
ing, no significant differences between 
the two were found. Thus, language 
switching of the type studied here seems 
to require no psychological skill peculiar 
to bilingualism, but rather a skill which 
is equally applicable in a large number 
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of operations in which persons are asked 
to switch modes of response rapidly. 

In summary, then, there are three 
levels of performance which run quite 
parallel for both language switching and 
unilingual number switching: (a) uni- 
lingual number naming, the common 
base line from which both language and 
number switching depart; (b) switch- 
ing in a predictable pattern; and (c) 
switching in an unpredictable pattern. At 
the first level, simple number naming, it 
is found that increasing stimulus uncer- 
tainty, and consequently response un- 
certainty, does not affect performance. 
By contrast, in the switching tasks the 
reduction of stimulus and response un- 
certainty makes a marked difference to 
performance. It seems logical to con- 
clude that the basic reason why these two 
modifications have quite different effects 
lies in the manner in which S arrives 
at the correct response. In simple num- 
ber naming, responses belong to only one 
category; in switching they belong to 
two categories and S must choose be- 
tween them. In other words, the two 
conditions involve simple and choice reac- 
tion times, respectively. A reduction of 
response uncertainty is known to improve 
times in the latter condition but not in 
the former when stimulus-response transi- 
tions are as overlearned as they are in 
number naming. The same line of 
reasoning also leads to the conclusion that 
the major reason why latencies for 
switching according to a fixed pattern are 
longer than those for simple number nam- 
ing is the change from simple to choice 
reaction times. However, this also shows 
that the bilingual who had two sets of 
names for each stimulus list could suc- 
cessfully exclude one set (L,) when re- 
quired to respond unilingually in the 
other (Lg). Thus, again there is the 
idea of distinct linguistic systems and 
a switching device which determines 
which system is on. 

The size of switching times bears a 
curious resemblance to other time mea- 
sures of linguistic performance. Re- 
cently, Lenneberg (1967) reviewed the 
relevant literature and proposed that 


J. MACNAMARA, M. KRAUTHAMMER, AND M. BOLGAR 


there is a quantity in the measurement of 
linguistic performance analogous to 
Planck’s 4 in physics (the analogy is the 
authors’). Lenneberg suggests that there 
is some biological characteristic of the 
human brain which does not permit lan- 
guage operations occurring in the CNS 
in a period less than .18 sec. In the 
present studies the authors found that in 
conditions which involved switching in a 
regular pattern, the average time for 
individual switches is .21 sec. The 
average increase from this to conditions 
which involved switching according to an 
irregular pattern is .18 sec. These find- 
ings are support for Lenneberg’s thesis. 
However, it is necessary to point out that 
these figures, like those cited by Lenne- 
berg, are averages, and there is consider- 
able variation among individuals. 

Finally, these findings, although based 
on a very limited sample of language in a 
very formal setting, suggest an explana- 
tion of how in normal discourse the bi- 
lingual can frequently, without seeming 
to pause, have recourse to his second lan- 
guage for a word, a phrase, or an apt 
quotation. He anticipates the switch 
while speaking and thus avoids disrupting 
the flow of his speech. 
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OF MEDIATION+ 


BRUCE EARHARD an» MARCIA EARHARD 


Dalhousie University 


8 experiments dealing with interference and strategies in the study of 
mediation were reported. In Stage 1 of all experiments, Ss associated 
2 unrelated verbal units with a mediator. In Stage 2, performance of 
mediator-linked (M-L) pairs of associates was compared with per- 
formance on new and unrelated control (C) pairs of associates. The 
results indicated that M-L pairs were learned less rapidly than C 
pairs even if mediating linkages were overlearned. Increasing the 
time for mediational processing, and combining increased time with 
overlearning revealed no differences in the learning of M-L and C 
pairs. Superior performance on M-L pairs was found to result only 
when a mediating strategy was combined with overlearning and/or 


increased time for processing. 


In a typical mediation study, Ss are 
required to learn three successive 
paired-associate (P-A) lists: An A-B 
set of associations is learned in Stage 
1, a B-C set of associations in Stage 2, 
and an A-C set of associations in the 
third, or test stage. The more rapid 
acquisition of the A-C test-stage list 
in a mediation paradigm than in a 
standard control paradigm (A-B, D-C, 
A-C) is attributed generally to medi- 
ated facilitation. It is assumed by me- 
diation theorists that the common B 
responses which are learned to both 
the A and C members of the test-stage 
pairs in mediation paradigms during 
the first two stages of learning provide 
indirect mediating linkages that facili- 
tate learning (Horton & Kjeldergaard, 
1961; Jenkins, 1963). 

The mediation theory interpretation 
of interparadigm differences has been 
opposed by Earhard and Mandler 
(1965a) who have argued, on the basis 
of interference theory principles, that 
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all associations acquired to the A and 
C members of the test list, during the 
learning of Stages 1 and 2, interfere 
with, rather than facilitate, test-list 
learning. According to Earhard and 
Mandler (1965a), the acquisition of 
the A-C test list is less rapid in control 
paradigms because more of the A-B 
associations learned in Stage 1 are 
available at the time of learning the 
A-C test list in control paradigms than 
in mediation paradigms. In mediation 
paradigms, the learning of B-C associa- 
tions in Stage 2 interferes with and 
causes forgetting of some of the A-B 
associations learned in Stage 1, where- 
as in standard control paradigms, 
Stages 1 and 2 are unrelated and for- 
getting of Stage 1 A-B associations 1s 
negligible (Earhard & Mandler, 1965b; 
McGovern, 1964). Since A-B asso- 
ciations learned in Stage 1 are assumed 
to interfere with the acquisition of the 
A-C test list, and more A-B associa- 
tions are available at the time of test- 
list learning in the control paradigm 
than in mediation paradigms, it follows 
that the A-C test list should be learned 
less readily in control paradigms than 
in mediation paradigms. Support for 
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this interpretation has been provided 
by Earhard and Earhard (1967) who 
found that performance on the test 
stage of the standard control condition 
was inferior to performance on the 
test stage of a relatively interference- 
free, neutral control paradigm consist- 
ing of three unrelated P-A lists. Fur- 
ther support has been provided by in- 
vestigations which show mediationlike 
performance can be produced by inter- 
list interference effects (Carlson, 1966; 
Jenkins & Foss, 1965; Mandler & Ear- 
hard, 1964), 

While it seems clear that learning in 
the test stage of control paradigms is 
impaired in accordance with arguments 
of Earhard and Mandler, the more im- 
portant question as to whether linking 
two unrelated verbal units to a media- 
tor, by means of a standard mediation 
paradigm, facilitates or impairs the 
subsequent association of these units, 
remains unanswered. Attempts to an- 
swer this question by comparing per- 
formance on the test stage of a media- 
tion paradigm with performance on the 
corresponding stage of a neutral con- 
trol paradigm have not proved success- 
ful. Horton and Wiley (1967) re- 
ported the result of such a comparison 
to be superior performance in the me- 
diation paradigm, but their use of non- 
Sense materials makes their comparison 
difficult to interpret. With nonsense 
materials as the items to be associated, 
the A and C terms of the test list 
would be available in the mediation 
paradigm, since they had been encoun- 
tered and learned in Stages 1 and 2, 
but this would not be the case in the 
heutral control paradigm which is com- 
Posed of three entirely different lists 
Of associates. This difference in avail- 
ability at the beginning of the test stage 
Could be sufficient to produce the “me- 
diated facilitation” they report. A 
Similar experiment by Earhard and 
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Earhard (1967) employed well-inte- 
grated English words in order to make 
mediation and neutral control para- 
digms more comparable with respect 
to availability factors, and they failed 
to find any difference between perform- 
ance on mediation and control para- 
digms. 

Although the problem of availability 
can be overcome to a large degree by 
making use of well-integrated verbal 
material, the major difficulty in ade- 
quately ascertaining whether linking 
two unrelated verbal units facilitates 
or impairs their subsequent association 
is the fact that substantial numbers of 
associations learned in Stage 1 of three- 
stage mediation paradigms are unavail- 
able at the time test-list learning is 
undertaken (Earhard & Mandler, 
1965b; McGovern, 1964). Tt is diffi- 
cult to imagine how A-B associations 
which are forgotten can play any role 
whatsoever in facilitating or retarding 
the acquisition of an A-C test list. Un- 
til steps are taken to ensure that all 
associations linking test-list terms to a 
common response are intact at the time 
of test-list learning, an adequate eval- 
uation of the positions taken by inter- 
ference theory and mediation theory 
cannot be achieved. 

The main objective of the present 
study was to evaluate the interference 
theory interpretations of Earhard and 
Mandler (1965a)—with respect to the 
effect of associating pairs of verbal 
units to be learned with a common re- 
sponse—under conditions where it was 
possible to be relatively certain that all 
associations were available at the be- 
ginning of acquisition of the test list 
for mediation effects. Under such cir- 
cumstances, interference theory pre- 
dicts maximum interference with test- 
list learning whereas mediation theory 
would predict maximum facilitation. 
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EXPERIMENT I 
Method 


Design—The experimental task required 
Ss to learn two successive lists of six paired 
associates. The structure of these lists is 
shown in Table 1. 

List 1 was constructed so that three of 
the verbal units in the list (B, E, and I) acted 
both as the response to one term and as the 
stimulus to another term. These terms thus 
served to provide an indirect, mediating link 
between previously unrelated verbal units 
(A and C, D and F, H and J) which were 
also contained in the list. List 2 was com- 
posed of two different sets of pairs. Those 
items associated with the same third term in 
List 1 were paired with each other in List 2 
(A-C, D-F, H-J), and these mediator-linked 
(M-L) pairs constituted one-half of the 
associates presented for learning. The re- 
maining three pairs of associates in List 2 
(K-L, M-N, O-P) were control (C) pairs, 
ie, new paired associates to which Ss had 
not been exposed previously. The point of 
major importance about this design is that 
by carrying List-1 learning to a criterion 
of at least one perfect recitation, one could 
be relatively confident that all necessary 
mediating linkages were available at the time 
of the test for mediation effects in List 2. 

Since well-integrated verbal materials were 
used, a comparison of the rate at which M-L 
pairs were learned with the rate at which 
new C pairs were learned provided an esti- 
mate of the effect that associating two dif- 
ferent terms with a common mediating re- 
sponse had upon the course of acquisition 
more precisely than has been possible with 
conventional three-stage paradigms, 

Materials—All lists contained six paired 
associates and were constructed from a pool 
of 36 low-frequency English words selected 
in accordance with procedures described by 
Horton and Kjeldergaard (1961). Three 
different sets of pairings were constructed. 


TABLE 1 
STRUCTURE OF Lists 1 AND 2 UsEp 1N Exp. I 
List 1 List 2 
A-B A-C | Mediator-Linked 
H-I D-F Pairs 
D-E H-J 
B-C K-L 
E-F M-N + New Control Pairs 
I-J O-P 
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In each set of pairings, no pair of terms 
began with the same initial letter, and there 
was no duplication of initial letters on either 
the stimulus or the response sides of each 
set of pairings. Five different orders of 
presentation of individual pairs were used 
in each list to reduce the possibility of serial 
learning. In the orders of presentation for 
List 1, steps were taken to ensure that the 
presentation of at least one unrelated pair 
of associates intervened between pairs of 
associates containing the same term. 

Procedure—Standard P-A instructions 
were given prior to List-1 learning. Before 
List-2 learning began, Ss were told that 
some of the words that they would see had 
been presented previously in List 1. Both 
lists were presented at a 2:2-sec. presenta- 
tion rate on a Stowe memory drum. The 
learning of both lists was carried to a cri- 
terion of one perfect trial, but on List 2 a 
minimum of 20 trials were given. There 
was no intertrial interval and the interlist 
interval was only long enough to permit 
instructions to be given. 

Subjects —Twenty-four female undergrad- 
uate Ss were drawn at random from a pool 
of first and second year university students. 
The data of two Ss were discarded andre- 
placed because of their failure to learn List 1 
in less than 60 trials. 


Results 


The Ss required an average of 21 
trials to learn List 1 to a criterion of 
one correct trial. In considering the 
learning of List 2, attention will be 
given to performance over the first 10 
trials and to the number of trials re- 
quired to reach a criterion of one per- 
fect trial. The data from Trials 10- 
20 were analyzed but provided no in- 
formation in addition to that afforded 
by consideration of the first 10 trials, 
and therefore will not be discussed. 

Figure 1 shows, trial by trial, the 
average number of correct responses 
made to M-L pairs and C pairs during 
the first 10 trials on List 2. It may 
be seen that C pairs were less readily 
acquired during the first few trials; but 
that after Trial 3, performance on 
pairs of associates was consistently su- 
perior to performance on M-L pairs. 
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Fic. 1. Average trial-by-trial performance 
on mediator-linked and control pairs during 
Trials 1-10 of Exp. I. 


In view of the apparent interaction 
C X M-L Pairs x Trials, a trial-by- 
trial analysis of performance over the 
first 10 trials was carried out after the 
data had been subjected to an arc-sine 
transformation. The analysis revealed 
a significant Trials X Type-of-Pair in- 
teraction, F (9, 207) = 2.01, p < .05. 
The analysis also indicated that, on the 
average, significantly better perform- 
ance was displayed on C pairs, F (1, 
23) = 4.71, p < .05. Further analyses 
were carried out on the number of cor- 
rect responses made on the first 5 trials, 
the first 10 trials, and the number of 
trials to criterion. Control pairs and 
M-L pairs did not differ in terms of the 
number of correct responses made over 
Trials 1-5, #(23) =.61, $ > .05, nor 
in terms of trials to criterion (23) = 
1.87, p > .05, but differences in favor 
of C pairs were observed when the 
average number of correct responses 
made over the first 10 trials were given 
consideration, (23) = 2.08, p < .05. 


Discussion 


The initial disadvantage suffered by Cc 
Pairs was expected. Underwood and 
Schulz (1960) have provided evidence 
indicating that verbal units must first be- 
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come available as responses before asso- 
ciative learning can take place. Control 
terms were less available initially as re- 
sponses than were M-L pairs. Once C 
terms became available, however, C pairs 
were learned more readily than were M-L 
pairs. Experiment I thus provided clear 
support for the contention of Earhard and 
Mandler (1965a) that associating two un- 
related terms independently with a com- 
mon term retards, rather than facilitates, 
the subsequent association of these un- 
related terms. 

Experiment I left open the possibility, 
however, that under more opportune cir- 
cumstances, mediated facilitation, i.e., the 
more rapid acquisition of M-L pairs than 
C pairs would be observed. Earhard and 
Mandler (19652) had suggested that 
overlearning of the various mediation 
linkages may be important in determining 
whether mediation will occur. It has also 
been suggested that mediation may re- 
quire more time than is available during 
the 2-sec. anticipation interval employed 
conventionally in P-A studies (Under- 
wood, 1964). Three additional experi- 
ments were designed, therefore, to evalu- 
ate the effect of overlearning and time 
factors on mediation. 


Exprermments II, III, anp IV 


The purpose of Exp. II was to as- 
certain whether strengthening the vari- 
ous mediation linkages by requiring Ss 
to overlearn these linkages during List 
1 would facilitate acquisition of M-L 
pairs in List 2. 

Experiment III was designed to de- 
termine whether providing Ss with ad- 
ditional time to work through the vari- 
ous mediation linkages during the 
anticipation interval of List 2 would 
facilitate acquisition of M-L pairs. 

Experiment IV took into considera- 
tion the possibility that both an in- 
crease in the strength of mediation 
linkages and additional processing time 
are required in order for mediational 
effects to be observed. 
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Method 


Experiments II, III, and IV employed 
the same method and materials as Exp. I. 
They differed from one another, and from 
Exp. I, only in the number of trials given 
on List-1 learning and/or in the presentation 
rate used in List 2. List 2 was learned to 
a criterion of one correct trial, but a mini- 
mum of 20 presentations were made in all 
three experiments, 

Procedure—1. Experiment II. List-1 
learning was carried to a criterion of one 
correct trial after which 30 additional over- 
learning trials were given. List 1 and List 
2 were presented at a 2:2-sec, presentation 
rate, 

2. Experiment III, List 1 was learned 
to a criterion of one perfect trial at a 
2:2-sec. presentation rate. List 2 was pre- 
sented at a 4:2-sec. presentation rate, 

3. Experiment IV. The Ss were given 
30 additional learning trials on List 1 after 
a criterion of one correct trial had been 
reached, List 1 was presented at a 2:2-sec. 
presentation rate; List 2 was presented at 
a 4:2-sec. presentation rate. 

Subjects—Kighteen female students were 
drawn at random from a subject pool of first 
and second year university students to serve 
in each of the three different experiments. 
The data from four Ss were discarded and 
replaced because of equipment failure, or 
because they were not able to complete first- 
list learning in less than 60 trials, 


Results 


The mean number of trials of List-1 
learning given in Exp. II, III, and IV 
were 54.3, 23.6, and 58.1, respectively. 
The level of overlearning was in excess 
of 100% in both Exp. II and IV 
(123% and 107%, respectively). 

The average number of correct re- 
sponses on each of the first 10 trials of 
List 2 is shown in Panels A, B, and C 
of Fig. 2 for Exp. II, III, and IV, re- 
spectively. As in the preceding experi- 
ment, an analysis of Trials 10-20 added 
no additional information and will not 
be given further consideration. 

Performance in Exp. II was very 
similar to that observed in Exp. I. On 
the first two trials, C pairs differed 
little from M-L pairs; but, after Trial 
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Fic. 2. Panels A, B, and C show the 
average trial-by-trial performance on media- 
tor-linked and control pairs during Trials 1- 
10 of Exp. II, III, and IV, respectively. 


2, performance on C pairs consistently 
surpassed performance on M-L pairs. 
A trial-by-trial analysis of variance of 
the C and M-L learning curves was 
carried out after an arc-sine trans- 
formation of the data. This analysis 
indicated that performance on M-L 
and C pairs did interact with trials, 
PROIE LO e A. Lhe 
analysis also revealed that, on the av- 
erage, C pairs were learned more 
readily than M-L pairs, F (1, 17) = 
17.30, p < .001. The most reasonable 
interpretation of the interaction re- 
mains the explanation advanced for 
Exp. I. On early trials, C terms were 
not as available as M-L terms, and 
thus the initial learning of these types 
of pairs was not different. With ad- 
ditional trials, C terms became availa- 
ble, and C pairs were learned more 
rapidly than M-L pairs. 

Additional confirmation of the rela- 
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tive superiority of performance on C 
pairs in Exp. II was obtained when 
M-L pairs and C pairs were compared 
with respect to the total number of cor- 
rect responses over the first 5 trials, 
(17) = 2.74, p < 02; over the first 
10 trials, t(17) = 4.34, p < .001; and 
in terms of the number of trials to cri- 
terion, #(17) = 3.38, p < .01. 

The results of Exp. III and IV dif- 
fered from those obtained in Exp. II. 
It is evident in Panels B and C of Fig. 
2 that there was virtually no difference 
between performance on M-L and C 
pairs. This impression was confirmed 
by statistical analysis. Regardless of 
whether performance on M-L and C 
pairs was analyzed in terms of trials 
to criterion, number of correct re- 
sponses made over the first 5 trials, 
the first 10 trials, or on a trial-by-trial 
basis, no difference between the two 
sets of terms reached the 5% level of 
significance. 


Discussion 


The experimental manipulations em- 
ployed in Exp. II, III, and IV must be 
judged unsuccessful in that they did not 
produce better performance on M-L pairs 
than on C pairs. There is little reason to 
think that practice on List 1 beyond the 
30 additional trials given, or further in- 
creases in the anticipation interval would 
have resulted in better performance on 
M-L pairs. The level of overlearning 
was beyond the 100% level in Exp. II and 
IV, and the 4-sec. anticipation interval 
used in Exp. III and IV should have pro- 
vided more than enough time for Ss to 
work through mediation chains of the 
length used in the present experiment. It 
Seems more reasonable to assume that 
Overlearning of mediation links and/or an 
increase in anticipation interval are not 
Sufficient, in themselves, to produce medi- 
ated facilitation. 

One way of accounting for the above 
findings might be to assume that media- 
tion is fundamentally a conceptual process 


221 


involving the use of rules, plans, and 
strategies by Ss (Earhard & Mandler, 
1965a; Miller, Galanter, & Pribram, 
1960). According to this view, the fail- 
ure to find superior acquisition of M-L 
pairs in P-A experiments of the present 
type may be attributed to the fact that 
Ss cannot discover, or fail to employ, 
mediation rules or strategies during learn- 
ing. In order to examine this possibility 
a fifth experiment was carried out. 


` EXPERIMENT V 


Experiment V used the same method 
and materials as the preceding experi- 
ments. It differed from them, how- 
ever, in terms of the instructions which 
were given before List-2 learning. The 
special instructions employed in Exp. 
V were designed to provide Ss with 
a mediating rule that would permit 
mediation effects to be observed if 
mediation were fundamentally a con- 
ceptual mechanism. 


Method 


Procedure—List 1 was learned to a cri- 
terion of one correct trial, and 30 overlearn- 
ing trials were given. The presentation rate 
of List 1 was 2:2 sec. List 2 was presented 
at a 4:2-sec. rate until a criterion of one 
correct trial was reached, but a minimum of 
20 trials were given. 

Instructions—The standard P-A instruc- 
tions used in the previous experiments were 
given prior to List-1 learning. However, 
immediately before List-2 learning, special 
instructions about how to mediate were 
given. These mediation instructions in- 
formed Ss that some of the terms they had 
encountered previously, in List 1, would ap- 
pear also in List 2 on the left-hand side of 
the memory drum. Whenever they en- 
countered such a term (ie, A) they were 
to think of the response they had learned 
to that word (ie, A— B) and then they 
were to think of the word associated with the 
word of which they were thinking (ie, 
B—C). They were told that this latter 
word (C) would be the word that would 
appear on the right-hand side of the memory 
drum and be correct. A suitable example of 
this procedure was given using terms which 
did not appear in the experiment, The Ss 
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were also informed that new pairs of words 
that they had not encountered previously 
would be presented, and it was equally 
as important to learn these pairs. 

Subjects—The 18 Ss were drawn at ran- 
dom from a subject pool of first and second 
year undergraduate female students. 


Results 


The average number of correct re- 
sponses made on M-L pairs and C 
pairs during the first 10 trials of List 
2 is shown in Fig. 3. It appears that 
the instructions providing Ss with a 
mediating strategy were successful. 
During early trials, substantially su- 
perior performance was observed on 
M-L pairs. 

Statistical analysis indicated that sig- 
nificantly more correct responses were 
made to M-L pairs over the first 5 
trials, #(17) = 5.12, p < .001, over the 
first 10 trials, #(17) = 2.98, p < .01, 
and in terms of the number of trials 
to a criterion of one perfect trial, (17) 
= 2.12, p < .05. The initial superior- 
ity of M-L pairs did not persist over 
all trials. A trial-by-trial analysis of 
performance on M-L and C pairs over 
the first 10 trials, which was carried 
out after an arc-sine transformation of 
the data, showed that the difference 
between M-L and C pairs interacted 
with trials, F (9, 153) =12.97, p< 
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Fic. 3. Average trial-by-trial performance 


on mediator-linked and control pairs during 
Trials 1-10 of Exp. V. 
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001. On the average, however, better 
performance was observed on M-L 
pairs than on C pairs, F (1, 17) = 
7.99, p < .05. 


Discussion 


Although instructions had an immedi- 
ate facilitating effect on M-L pairs, it is 
evident from Fig. 3 that the curves for 
M-L and C pairs converged rapidly be- 
fore an asymptotic level of performance 
had been reached. The reason for this 
rapid convergence is not immediately 
apparent. One very likely explanation lies 
in the comments made by a number of Ss 
to the effect that after several trials they 
tended to forget or confuse terms involved 
in mediating linkages and suffered a per- 
formance loss. 

In spite of the short-lived superiority of 
M-L pairs, the data of Exp. V provide 
the first clear evidence of superior per- 
formance on M-L terms in this series of 
studies. In interpreting these data, how- 
ever, a degree of caution is necessary. 
Before it may be concluded that the ap- 
plication of a mediating strategy facili- 
tated performance on M-L pairs, it is 
necessary to consider an alternative pos- 
sibility. It may be the case that the differ- 
ence between M-L and C pairs is not due 
to the application of a mediation strategy, 
but rather that the instructions lead Ss to 
concentrate on the learning of M-L pairs 
at the expense of C pairs. In other words, 
the differences between M-L and C pairs 
may represent impairment in the learning 
of C pairs rather than facilitation in the 
learning of M-L pairs. This possibility 
can be evaluated by comparing perform- 
ance on C pairs in Exp. V with perform- 
ance on C pairs in Exp. IV, which dif- 
fered from Exp. V only in that special 
mediating instructions were absent. The 
fact that first-list performance of the Ss 
in Exp. IV and V was not different (F 
<1) indicates that the learning ability of 
the two different groups of Ss was simi- 
lar, and that a comparison of the learning 
of the second-list control pairs is valid. 

A comparison of the learning of C pairs 
in Exp. IV and V in terms of the number 
of correct responses made over the first 
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5 trials, the first 10 trials, and the num- 
ber of trials required to learn all C pairs 
was carried out. Differences in perform- 
ance levels on C pairs in the two experi- 
ments were extremely small, and on none 
of the measures analyzed did any differ- 
ence approach the 5% level of signifi- 
cance. It would appear reasonable to 
conclude, therefore, that the difference 
between M-L and C pairs found in Exp. 
V represented a facilitation in the acqui- 
sition of M-L pairs and not an impair- 
ment in the learning of C pairs. 

The results of Exp. V have important 
implications. In Exp. IV, Ss overlearned 
the various mediational linkages, and 
were provided with long anticipation in- 
tervals to work through mediation chains, 
and no evidence of mediated facilitation 
was observed. When these same proce- 
dures were used in Exp. V, and Ss were 
provided with a mediation strategy, su- 
perior performance was observed on M-L 
pairs. This suggests that it is necessary 
for Ss to be presented with, or discover 
and apply a mediation rule or strategy in 
order for mediation effects to be observed. 
What remains to be determined is 
whether the employment of a strategy is 
itself sufficient to produce evidence of 
mediation, or whether in addition to a 
strategy, overlearning and/or additional 
time to work through mediation chains is 
required. The remaining three experi- 
ments in this series addressed themselves 
to this problem, 


Experiments VI, VII, anp VIII 


Experiment VI was designed to de- 
termine whether the mediated facilita- 
tion manifested in Exp. V would be 
observed if Ss overlearned the various 
mediation linkages and were given a 
mediation strategy, but were not pro- 
vided with the long anticipation inter- 
val used in Exp. V. 

Experiment VII attempted to deter- 
mine whether the application of a me- 
diation strategy would be sufficient to 
facilitate learning of M-L pairs if over- 
learning was not employed, but Ss 
were given a long anticipation interval 


to work through the various media- 
tional linkages in List 2. 

Experiment VIII was designed to 
observe whether mediation effects 
would be observed if a mediation strat- 
egy was employed in the absence of 
both overlearning and long anticipation 
intervals. 


Method 


Experiments VI, VII, and VIII employed 
the same materials and general methods used 
in the preceding experiments. They differed 
from one another only in terms of the num- 
ber of trials given on List-1 learning and/or 
in the presentation rate used in List 2. List 2 
was learned to a criterion of one correct 
trial, but a minimum of 20 presentations were 
made in all three experiments. The same 
special mediating instructions used in Exp. 
V were presented immediately before List-2 
learning in all three experiments. 

Procedure—1, Experiment VI. List 1 
was learned to a criterion of one correct trial 
after which 30 additional overlearning trials 
were given. Both List 1 and List 2 were 
presented at a 2:2-sec. rate. 

2. Experiment VII, List 1 was learned 
to a criterion of one correct response at a 
2:2-sec, presentation rate. List 2 was pre- 
sented at a 4:2-sec, rate, 

3. Experiment VIII. List 1 was learned 
to a criterion of one correct trial. Both 
List 1 and List 2 were presented at a 2:2-sec. 
presentation rate. 

Subjects—Eighteen female students were 
drawn at random from a subject pool of 
first and second year undergraduate students 
for each of the three experiments. The 
data of three Ss were replaced because they 
failed to learn the first list in less than 
60 trials. 


Results 


The average number of correct re- 
sponses made on M-L and C pairs on 
each of the first 10 trials is shown for 
Exp. VI, VII, and VIII in Panels A, 
B, and C of Fig. 4. There appears to 
have been an initial advantage asso- 
ciated with M-L pairs in all three ex- 
periments. This suggests that the in- 
structions did have a facilitating effect 
on M-L pairs, this effect being most 
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Fic. 4. Panels A, B, and C show the aver- 
age trial-by-trial performance on mediator- 
linked and control pairs during Trials 1-10 
of Exp. VI, VII, and VIII, respectively. 


pronounced in Exp. VI and VII in 
which overlearning and long anticipa- 
tion intervals were employed, respec- 
tively, and least evident in Exp. VIII 
where neither overlearning nor long 
anticipation intervals were used. 

In all experiments, the advantage 
associated with M-L pairs was of short 
duration. After a small number of 
trials, the curves of both types of pairs 
converge, and in Exp. VI and VII, the 
curves for M-L and C pairs cross over 
with superior performance being ob- 
served on C pairs on later trials. In 
view of the pronounced interaction be- 
tween performance on M-L and C 
pairs suggested by Fig 4, a trial-by- 
trial analysis of performance of Ss over 
the first 10 trials was performed after 
an arc-sine transformation of the data. 

In Exp. VI and VII, a significant 
interaction was found between per- 
formance on M-L pairs and C pairs 
and trials, F (9, 153) = 4.24, p< 
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001, and F (9, 153) = 4.76, p < .001, 
respectively. In neither experiment 
was there any evidence of a significant 
overall difference in performance be- 
tween M-L and C pairs. This is not 
surprising, for as Fig. 4 shows, the ad- 
vantage associated with M-L pairs is 
confined to early learning trials. On 
later trials, the performance level on 
M-L pairs falls below that of C pairs 
reducing the average difference be- 
tween the two types of pairs to a level 
of negligible importance. 

If performance on M-L and C pairs 
is examined prior to the point at which 
the two learning .curves cross, facili- 
tated acquisition of M-L pairs is ob- 
served. In Exp. VI, significantly more 
correct responses are made to M-L 
pairs on Trials 1-5, (17) = 2.12, p < 
05, and in Exp. VII, significantly 
more correct responses are made to 
M-L pairs on Trials 1-4, #(17) = 
2.52, p < .05. If the number of cor- 
rect responses over the first 10 trials, 
or trials to criterion, are analyzed, how- 
ever, no difference between M-L and C 
pairs approaches significance in either 
experiment. 

Experiment VIII differed from Exp. 
VI and VII in that a trial-by-trial anal- 
ysis of performance revealed neither a 
significant main effect for the different 
types of pairs (F <1) nor a signifi- 
cant interaction between performance 
on M-L and C pairs and trials, F (9, 
153) = 1.62, p > .05. 

To ensure that differences which 
were apparent in the early trials of 
Exp. VI and VII were not due to Ss 
concentrating on the learning of M-L 
pairs at the expense of learning C 
pairs, additional analyses were carried 
out. The S's’ performance on C pairs 
during the first five trials of Exp. VI 
was compared with performance on C 
pairs on corresponding trials in Exp. 
II. A similar comparison was made 
between ` performance on C pairs in 
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Exp. III and VII. Experiments IT 
and III differed from Exp. VI and VII 
only in that mediating instructions 
were not given in the former experi- 
ments. A finding of inferior perform- 
ance on C pairs in experiments using 
mediating instructions would mean 
that the initially superior performance 
on M-L pairs in these experiments re- 
sulted from neglect of C pairs. 

There was no difference between 
Exp. II and VI in terms of the number 
of correct responses made on C pairs 
over the first five trials of List 2, 
t(17) =.21, p > .05, or with respect 
to the number of trials required to 
reach a criterion of one correct trial 
on List 1, t(17) =.19, p > .05—indi- 
cating that the two groups did not 
differ in initial learning ability. 

The comparison of Exp. III and 
VII was complicated by the fact that 
there was a significant difference in 
favor of Exp. III in the number of 
trials to criterion in first-list learning, 
t(17) = 2.47, p<.05. When first- 
list learning scores were included as 
a covariant, it was found that C pairs 
were learned significantly more rapidly 
over the first five trials in Exp. VII 
than in Exp. III, F (1, 33) = 12.06, 
p < .005. 

While the reason for finding better 
performance on C pairs in Exp. VII 
than in Exp. III is not immediately ap- 
parent, it is evident that the difference 
between M-L and C pairs in Exp. VI 
and VII did not arise because Ss 
learned M-L pairs at the expense of 
C pairs. In both Exp. VI and VII, C 
pairs were learned at least as well as 
C pairs in experiments where no medi- 
ating instructions were employed. 


CONCLUSIONS 


The conditioning model of mediation, 
originally developed by Hull (1930, 1931, 
1939), has been applied extensively as an 
interpretative device in studies of verbal 
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processes employing P-A procedures 
(eg., Horton & Kjeldergaard, 1961; 
Jenkins, 1963). The adequacy of this 
model in accounting for performance on 
P-A transfer paradigms is called into 
question by the results of the first four 
experiments reported. These data do not 
support mediation theory; but, rather, 
they add substantially to evidence 
assembled by Earhard and Mandler 
(1965a), by Carlson (1966), and by Ear- 
hard and Earhard (1967) indicating the 
importance of interference factors in 
P-A studies of mediation. In Exp. I, 
associates linked to a mediator were 
learned significantly less rapidly than 
control pairs which were unrelated. Ef- 
forts to eliminate interference and to 
facilitate acquisition of M-L pairs by 
increasing the strength of associative 
linkages through overlearning and/or by 
increasing the amount of time available 
to Ss were not successful. The over- 
learning employed in Exp. II resulted in 
the same impairment in the learning of 
M-L pairs observed in Exp. I. The in- 
creased anticipation interval used in Exp. 
III and IV did serve to diminish inter- 
ference to the degree that differences be- 
tween M-L and C pairs were not sig- 
nificant, but failed to provide any evi- 
dence of facilitation. 

The results of Exp. I to IV are par- 
ticularly interesting because the design 
of the experiments overcame a major dif- 
ficulty in interpreting the results of con- 
ventional multistage paradigms. Nor- 
mally, at the time of the test for media- 
tion, substantial numbers of necessary 
associative linkages are not available. In 
the present series of experiments, all asso- 
ciations were available at the time of the 
test for mediation. Under such circum- 
stances, the data indicate that the associa- 
tion of two unrelated verbal units to a 
mediator does not lead to facilitation, but 
to interference. 

These findings are related closely to 
recent experiments which have dealt with 
transfer from a modified double function 
P-A list (A-B, B-C, C-D, D-E, . . .) 
to a serial anticipation list in which the 
pairs form an interlocking chain (A-B- 
C-D-E...). If the learning of the 
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various links of a mediation chain by 
means of the P-A method were effective 
in establishing an integrated chain of 
associations, then almost perfect per- 
formance would be expected on the serial 
anticipation list. Transfer effects are 
extremely small, however (Horowitz & 
Izawa, 1963; Primoff, 1938; Young, 
1959). Both the findings of the present 
Studies and the results of P-A to serial 
anticipation transfer studies cast con- 
siderable doubt upon the applicability of 
the conditioning model of mediation to 
verbal processes in general and to P-A 
transfer designs in particular. Perhaps 
also questionable are efforts to employ 
findings based on P-A studies of media- 
tion to account for grammatical phe- 
nomena (Jenkins, 1964). 

In contrast to the data of Exp. I-IV 
which show that mediation does not fol- 
low automatically from simple condition- 
ing of associations, the data of Exp. V, 
VI, and VII show that mediation can oc- 
cur if a rule or a strategy is employed, 
This lends support to the position that 
mediation is fundamentally a conceptual 
mechanism requiring Ss to make use of a 
plan or a strategy (cf. Miller, Galanter, 
& Pribram, 1960). If the experimental 
task is based, for example, on overlearned 
linguistic relationships (Jenkins, Foss, 
& Odom, 1965) or on overlearned logical 
relationships (Mandler & Cowan, 1962), 
and Ss discover and employ these rules, 
then mediated facilitation is possible. 
If Ss fail either to discover or employ an 
adequate plan or strategy, then interfer- 
ence is likely to result. 

The failure to find evidence of media- 
tion in Exp. VIII does indicate, however, 
that the application of a rule or plan in 
itself will not guarantee mediation. The 
mediated facilitation in the present ex- 
periments occurred only if associative 
linkages were overlearned (Exp. VI), 
where Ss were provided with additional 
time during anticipation intervals (Exp. 
VII), or where both increased time and 
overlearning were employed (Exp. V). 
This suggests that if the number of asso- 
ciative linkages involved are beyond the 
span of immediate memory, as is typically 
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the case when multitrial P-A procedures 
are used, then associative linkages must 
be made highly available through over- 
learning, or Ss must be provided with 
additional time to work through the 
various mediational linkages to be em- 
ployed. This fits well with a number of 
studies which have provided clear evi- 
dence of mediation under conditions 
where task complexity and memory load 
has been reduced by the use of matching 
tasks (Schulz, Weaver, & Ginsberg, 
1965) or by the use of single pairs of 
associates (Peterson & Blatner, 1963), 

The original purpose of mediation 
studies was to provide a bridge between 
simple conditioning and complex thought 
processes. The results of the present 
study, and of a substantial number of 
other experiments (Carlson, 1966; Ear- 
hard & Earhard, 1967; Earhard & Mand- 
ler, 1965a, 1965b; Jenkins & Foss, 1965; 
Mandler & Earhard, 1964) raise the ques- 
tion as to whether this purpose is served 
by investigations of P-A mediation para- 
digms, The answer would appear to be 
that it is not. The study of mediation 
paradigms has contributed little to the 
understanding of the mechanisms under- 
lying inference, grammar, or thought 
processes in general. 

The vast majority of P-A studies of 
mediation are better handled in terms of 
interference theory than in terms of a 
conditioning model of mediation. Those 
studies which are not open to interpreta- 
tion in terms of interference mechanisms 
can be accounted for by the application 
of rules or strategies of learning by Ss 
(see Earhard & Mandler, 1965a for a 
consideration of these issues). Of what 
importance, then, is the study of multi- 
stage P-A mediation paradigms? The 
answer to this question is that they can 
add to the understanding of the role 
interference and strategies play in trans- 
fer tasks, and not that they contribute to 
knowledge about how ideas become con- 
nected with one another. 3 

It seems clear that either radical 
changes in the conditioning model of 
mediation must be made, or else one must 
have recourse to “rules” or “strategies 
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or to some other organizational mecha- 
nism as explanatory principles. It must 
be admitted that reference to rules or 
strategies oversimplifies a very compli- 
cated issue in some respects (cf. Berlyne, 
1965). There appears good reason to 
expect, however, that an analysis of the 
development and application of such 
mechanisms will provide a more satis- 
factory advance of knowledge of concep- 
tual processes than will endeavors to 
salvage the mediation model (e.g., Hop- 
kins & Schulz, 1967; Horton & Wiley, 
1967). 
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The 2nd and 3rd syllogistic figures correspond to stimulus- and re- 
sponse-equivalent mediation paradigms, respectively. It was predicted 
that increasing the associative strength between the mediating term 
and the subject and predicate of syllogisms would facilitate validity 
judgments of the S-equivalent syllogisms but interfere with R-equiva- 
lent judgments. 54 college Ss participated in a 3X3X3X2 design 
which showed that errors in reasoning increased as associative strength 
increased. The R-equivalent syllogisms produced more errors, Asso- 
ciation and syllogistic figure interacted as predicted. Increasing the 
amount of incidental verbiage in the syllogisms tended to improve 
reasoning, but not significantly. Variation in the time between the 2 
premises, and between the last premise and conclusion, did not in- 


fluence reasoning. 


The ability to reason deductively is 
an important component of intelligence, 
but reasoning can be influenced by 
many extralogical factors. Some of 
the general factors explored previously 
include the “atmosphere effect” (Sells, 
1936), the influence of preconceived 
belief in the conclusions (Janis & 
Frick, 1943), and the effect of emo- 
tional content on judging the validity 
of syllogisms (Lefford, 1946). 

In a recent study, Frase (1968) in- 
vestigated some specific sequential fac- 
tors in reasoning that mediation theory 
suggested might be important. He de- 
scribed the four syllogistic figures as 
being analogous to the forward chain, 
stimulus (S) equivalence, response 
(R) equivalence, and reverse chain 
three-stage mediation paradigms. The 
first syllogistic figure has the form 
“All M are P. All S are M. There- 
fore, all S are P.” which can be dia- 


1 This research is part of the first author’s 
Master’s thesis submitted to the Graduate 
Faculty of the University of Massachusetts, 
under the direction of the second author, 
Part of this research was supported by a 
research grant through the Faculty Research 
Council of the University. 


grammed as a forward chain of asso- 
ciations in which M is associated to S, 
and P is associated to M; S-M-P. 
The association is unidirectional. The 
fourth syllogistic figure has the form 
“All P are M. All M are S. There- 
fore, all S are P.” which can be dia- 
grammed as a reverse chain of associa- 
tions (P-M-S) in respect to the asso- 
ciations asserted in the conclusion that 
S must judge as being a valid or in- 
valid deduction from the premises. 
Frase predicted that the forward-chain 
paradigm (first syllogistic figure) 
would produce the least errors in rea- 
soning, and the reverse chain (fourth 
syllogistic figure) the most. In two 
studies (Frase, 1966, 1968) these pre- 
dictions were confirmed even though 
the qualifiers used in the syllogisms 
had no semantic content (they were 
letters of the alphabet)—a condition 
certainly not conducive to mediation. 
Differential facilitation of reasoning 
should be stronger when words, rather 
than abstract symbols, are used. One 
experiment has systematically com- 
bined words to produce more or less 
errors in reasoning, but no paradig- 
matic effects were under consideration 
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(Frase, 1966). Early research has in- 
dicated that syllogistic reasoning is 
facilitated with common words as op- 
posed to abstract symbols or unusual 
terminology, the latter being most diffi- 
cult (Wilkins, 1928). 

In the present study the associative 
strength (Bilodeau & Howell, 1965) 
of the words used in syllogisms was 
varied using the second (S-equivalent) 
and third (R-equivalent)  syllogistic 
figures. The second syllogistic figure 
has the form “All P are M. All S are 
M. Therefore, All S and P,” and can 


be diagrammed as R >M. The third 


syllogistic figure has the form “All M 
are P. All M are S. Therefore, all 
S and P,” and can be diagrammed as 


M< $ The assumption is that the 


association M-P (where P is associated 
to M, but not M to P) corresponds to 
the first stage of mediation, M-S to the 
second stage, and S-P to the third 
stage. Theory and research in verbal 
learning (Goss, 1961; Jenkins, 1963) 
suggest that because the third syllogis- 
tic figure requires different responses 
(S and P) to the same stimulus (M), 
it can produce interference and hence 
more errors in reasoning. These inter- 
ference effects should increase as the 
association between the mediating term 
M and the terms S and P increases, 
providing that the initial association 
between S and P is controlled. Con- 
versely, the S-equivalent (second) fig- 
ure should facilitate reasoning, and 
facilitation should increase as the asso- 
ciation between the mediating term and 
S and P increases. One purpose of 
this study was to determine if this ex- 
trapolation from verbal learning proc- 
esses to deductive judgment is war- 
ranted. 

Another factor explored in the pres- 
ent study was the influence of irrele- 
vant words interpolated into the prem- 
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ises and conclusion of the syllogisms. 
This verbiage, while embedding the 
task in a more naturalistic setting, 
also increases the number of possible 
irrelevant associations. Jensen (1966) 
found that embedding P-A items in 
sentences (e.g., the HAT fell off the 
TABLE) reduces learning time. He 
also found that the more verbally rich 
the sentence (e.g., The red felt HAT 
silently rolled off the TABLE), the 
greater the facilitation. The studies 
of Jensen (1966) and Wilkins (1928) 
suggest that facilitative effects of addi- 
tional words might be obtained even 
though the words are irrelevant to the 
validity of the syllogism. 

Time relations also were investigated 
in the present study. Frase (1968) 
varied the overall time permitted to 
judge the validity of syllogisms and 
found that 30 sec. was optimal. In 
the present study Ss were allowed at 
least 30 sec. to make judgments, but 
the time between the two premises, and 
between premise and conclusion, was 
varied. Peterson (1965), using non- 
sense syllables in a paired-associate 
task, supported the hypothesis that 
mediation effects decrease as the delay 
between acquisition stages increases. 
It was anticipated that reasoning would 
be more influenced by interacquisition- 
stage intervals (IASI)—between Pre- 
mise 1 and Premise 2, and Premise 2 
and the conclusion—than by the over- 
all judgment time as used in Frase’s 
(1968) study. 


METHOD 


Subjects—Fifty-four introductory psy- 
chology students at the University of Massa- 
chusetts participated as a class requirement. 
An additional 18 Ss were discarded randomly 
to equalize groups. 

Test items—Of the following moods? of 
invalid syllogisms used in the experiment, 


2“Mood” refers to the formal logical 
words, such as “All,” “Some,” used in 
syllogisms. Propositions can be expressed 
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EAI, EEI, AOI, III, IEI, all five violate 
similar logical rules across the two syllogis- 
tic figures (S- and R-equivalence). For 
each of the 10 mood-figure combinations 
under investigation, two levels of associative 
strength using words (Bilodeau & Howell, 
1965) and a third level using nonsense 
syllables (Witmar, 1935) were employed. 
Under the high-associative word conditions, 
the probability of the first element of a pre- 
mise eliciting the second element was greater 
than .10 with a mean across all moods of 
.30. The associative strength between ele- 
ments of the conclusion in the high-asso- 
ciative condition as with the other associa- 
tive conditions was generally zero. Prob- 
abilities of first elements of premises elicit- 
ing the second term in the low-associative 
condition generally was zero, with a mean 
across all moods of .005. In the control 
nonsense syllable condition, sets of three 
CCCs of low-associative value (p< .04) 
and no more than one letter in common with 
each other (a control for similarity) con- 
stituted the syllogistic qualifiers. There was 
a total of 30 invalid syllogisms. Twelve 
valid syllogisms of the moods EIO, OAO, 
and AOO were used as filler items. 

Verbiage was manipulated by varying the 
amount of words preceding the quantifiers 
some, all, no, or some/not. The Ss in the 
low-verbiage conditions were shown in all 
figure-associations-mood combinations the 
basic logical syllogism, e.g., 


No grass is red. 
All grass are plants. 
Some plants are red. 


Four to five words preceded the basic state- 
ments in the medium-verbiage conditions: 


Henry David Thoreau surmised that no 
grass is red. 

He further recorded that all grass are 
plants. 

From this we conclude that some plants 
are red. 


Whereas, in the high-verbiage conditions, 
a string of nine to ten words was added: 


by sentences of the form, All X are Y (an 
A proposition), No X are Y (an E proposi- 
tion), Some X are Y (an I proposition), 
and Some X are not Y (an O proposition). 
Mood AAA then indicates that both 
premises and the conclusion begin with 
“All” mood AAE indicates that the two 
premises begin with “All,” but the conclu- 
sion must be of the form “No X are Y.” 
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Henry David Thoreau in his leisurely 
recordings of nature surmised that no 
grass is red. 

Further observations in his isolated para- 
dise led to the recording that all grass 
are plants. 

We, after reading his works many times, 
have concluded that some plants are red. 


Materials—Each premise and conclusion 
was separately typed on 3X5 in. index 
cards, then photographed on high-contrast 
copy film. The mounted negatives were dis- 
played by a projector (Kodak Carousal) 
on a standard high-contrast projection screen 
so that lettering was 14 in. high. The Ss 
were seated at an average distance of 12 ft. 
from the screen. The IASI was regulated 
by a three-speed automatic changer. Ex- 
posure time was manipulated by a solenoid 
shutter attachment wired to timers (Hunter 
Model 111-C). 

The sequence of presenting the syllogisms 
was randomized with the following restric- 
tions: within each of three blocks of 14 
syllogisms each mood appeared twice; each 
figure appeared three to four times in com- 
bination with approximately equal numbers 
of associative strength items. Each S thus 
viewed 42 syllogisms (30 invalid and 12 
valid). 

The Ss recorded their validity judgments 
on paper after each syllogism. 

Procedure—The Ss were directed to a 
standard classroom equipped with a slide 
projection screen, where alternate seats were 
taken. Pencils and answer sheets were 
placed on the desk tops. The E then read 
the standard instructions on the method for 
recording validity judgments. After two 
practice syllogisms, about which correct 
answers were not given, E asked if there 
were any further questions, The experiment 
then proceeded. A 

The IASI between Premise 1 and Premise 
2, and Premise 2 and the conclusion was in- 
dependently varied while holding exposure 
time for each slide constant at 6.0 sec. 
The three levels of IASI were 2.0 sec., 5.0 
sec., and 10.0 sec. Ten seconds elapsed be- 
tween offset of a conclusion and the onset of 
the following syllogism to equate for judg- 
ment time in all groups. Each of the nine 
groups was run in a separate session over a 
period of 24 hr. 

Design—A 3 (Time) X 3 (Verbiage) X 3 
(Association value) X 2 (Figure) factorial 
design with repeated measures on the last 
two factors was used to analyze number of 
errors. 
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TABLE 1 
MEANS AND SDs or ERRORS ON FIVE SYLLOGISMS FOR ALL CONDITIONS 


S-Equivalent 


R-Equivalent 


Control Low 


Control Low 


M SD M SD M 


IASI (sec.) 
2 


2.06 | 0.80 | 2.50 | 0.99 | 1.67 

5 1.78 | 1.00 | 1.83 | 1.10 | 1.11 

10 1.89 | 0.96 | 2.11 | 1.23 | 1.89 
Verbiage 

Low 1.94 | 1.21 | 2.61 | 1.09 | 2.11 

Medium | 1.33 | 1.19 | 2.00 | 1.24 | 1.89 

High 1.72 | 0.70 | 1.83 | 0.92 | 1.39 


1,02 | 1.94 | 1.06 | 2.56 | 1.58 | 2.89 | 1.32 
0,83 | 1.83 | 0.92 | 2.33 | 1.41 | 2.61 | 1.33 
1.23 | 1.67 | 1.37 | 1.83 | 1.10 | 2.56 | 0.98 


Note.—N = 18 for each mean. 


RESULTS 


Table 1 presents summary data for 
all conditions. The data reported are 
the mean number of errors made on 
the five experimental syllogisms in each 
cell of the design. There was a sig- 
nificant figure effect, F (1, 45) = 16.4, 
p < .001. The mean number of errors 
for the R-equivalent paradigm was 
2.25; for the S-equivalent paradigm it 
was 1.87. 

Associative strength was also a sig- 
nificant factor, F (2, 90) = 3.84, p < 
05. When association was present, 
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Fic. 1. Number of reasoning errors as a 
function of meaningfulness and syllogistic 
Paradigm. 


errors increased. Error means under 
the high, low, and nonsense syllable 
control conditions were 2.12, 2.19, and 
1.86, respectively. There was also a 
significant interaction between syllo- 
gistic figure and associative strength, 
F (2, 90) = 18.64, p < .001. Figure 
1 displays this interaction. It can be 
seen that interference under the R- 
equivalent paradigm (syllogistic figure 
three) was maximal when the mediat- 
ing term was highly associated with 
the subject and predicate terms of the 
syllogism. 

The verbiage factor was not signifi- 
cant, F (2, 45) = 2.42; however, there 
was a tendency for errors to increase 
with less verbiage. An average of 2.38 
errors occurred under low verbiage, 
2.03 and 1.77 under medium and high 
verbiage, respectively. 

The effect of time intervals was not 
significant, F (2, 45) = .02. 


Discussion 


The process of deductive reasoning 
proceeds, at least in part, in accordance 
with what we know about P-A tasks. 
The S-equivalent syllogisms resulted in 
significantly fewer reasoning errors than 
the R-equivalent syllogisms. The results 
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of the present study also showed that 
reasoning errors increase as the asso- 
ciative relation between the mediating 
term and other terms in the syllogism in- 
creases, at least for invalid syllogisms. 

The fact that there was little difference 
between the S- and R-equivalent syllog- 
isms under both the control and low-asso- 
ciative conditions confirms the results of 
Frase (1968), but the differences be- 
tween the two syllogistic figures under 
high-associative conditions suggests that 
he might also have found differences 
with those paradigms if he had used 
meaningful words. The nature of the 
interaction between syllogistic figure and 
association indicates that competing re- 
sponses, which are required to some 
mediating verbal event, interfere with 
reasoning. 

The results of adding extraneous verbi- 
age to the syllogisms indicated a slight 
tendency for the richer verbal context to 
facilitate reasoning rather than to pro- 
duce interference, but this did not ap- 
pear to be a significant factor. The 
critical verbal units are those that Ss 
manipulate in deducing the validity of the 
conclusion, i.e. the subject, predicate, 
and middle terms of the syllogism. Log- 
ically, the semantic content of these 
words is irrelevant to the validity of the 
argument; however, as this study has 
shown, these extralogical factors influ- 
ence reasoning in a systematic manner. 

In summary, the present study has 
shown that mediated associations play a 
significant role in deductive reasoning, 
and that some of our knowledge of basic 
verbal learning processes is applicable to 
the associative proccesses involved in 
such reasoning. 
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Input and output speed were investigated to determine if they met 
the temporal criterion as components of learning to learn. 160 college 
students served as Ss. 16 paired-associate practice lists consisting of 
high-frequency words were presented for 2 trials each to 4 acquisition 
groups in 2 sessions a day apart. Each acquisition group received in- 
put (ie., study interval) at either a fast (2 sec.) or slow (5 sec.) 
rate. S's were required to respond to a light occurring at either a fast 
(8 sec. after the onset of anticipation interval) or slow (3 sec.) rate. 
On the 3rd day each of the 4 acquisition groups was divided into 4 
parts; 1 group was changed to the conditions received by each of the 
other groups and 1 group continued under acquisition conditions. Ac- 
quisition data were consistent in showing reliable learning to learn in 
each of the 4 groups. On the Day 3 transfer task, the data showed 
that changing input and/or output speed resulted in a reliable decre- 
ment in learning (Fs<.01). These results supported the hypothesis 
that both speed parameters are components of learning to learn. The 
findings were discussed in terms of their relationship to micromolar 


theory. 


The temporal criterion generally 
used to distinguish between learning 
to learn and warm-up (Irion, 1949; 
Thune, 1950) has been considered 
somewhat inadequate (e.g., Postman 
& Schwartz, 1964). Traditionally, it 
has been postulated without supporting 
data that learning to learn involves the 
acquisition of instrumental habits 
(e.g, acquiring effective techniques of 
mediation ) related to learning a par- 
ticular task. Learning-to-learn proc- 
esses are relatively permanent, whereas 
warm-up refers to more transitory 
effects presumably related to perform- 

1This paper is based on a dissertation 
submitted to the graduate school of the 
University of New Mexico in partial fulfill- 
ment of the requirements for the PhD de- 
gree. The author is indebted to Frank A. 
Logan for helpful suggestions and guidance 
during the course of this study. This study 
was conducted while the author was recipient 
of a National Defense Education Act Title 
4 fellowship. 

2 Now at National Aeronautics and Space 
Administration, Marshall Space Flight Cen- 
ter, Huntsville, Alabama. 
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ance “set.” Accordingly, improve- 
ment in learning successive paired- 
associate (PA) lists reflects learning 
to learn only insofar as this improve- 
ment persists over reasonably long 
intervals of time. 

Both the development of an optimal 
rhythm for observing: stimuli and the 
latency of giving overt responses have 
been regarded as warm-up phenomena 
in PA learning. The rhythm of ob- 
serving stimuli depends on the speed 
of stimulus and response presenta- 
tion, and certainly the “optimal rhy- 
thm” is to assimilate items at the speed 
they are presented. The speed of giv- 
ing overt responses depends on the an- 
ticipation interval and, to some extent, 
instructions, An optimal rhythm in 
this sense means that the response is 
given during the anticipation interval. 

Although study time and output rhy- 
thms have been treated as warm-up 
effects, they may just as well be con- 
sidered component processes of learn- 
ing to learn. By “component proc- 
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esses” is meant that as learning to learn 
occurs in the sense of making fewer 
errors on successive PA lists, a part of 
what is learned is a rhythm of observ- 
ing stimuli and a latency of giving overt 
responses. The temporal requirements 
of PA learning are quite unlike any 
others S is likely to have encountered, 
and he must somehow learn to accom- 
modate to them. 

Micromolar theory (Logan, 1956) 
contends that quantitative variations in 
either of the speeds define different re- 
sponses. Though assimilation speed 
(study time) has not previously been 
identified as a learnable component of 
a molar response, a logical extension 
of the theory is that if the response of 
assimilating input is practiced at dif- 
ferent speeds by two groups, then each 
group will learn to assimilate at a par- 
ticular speed and subsequently assimi- 
late best at that same speed. The same 
prediction holds for changing response 
latencies, though it should be noted 
that one is here dealing with a more 
general effect than has previously been 
considered. The contention in the 
learning-to-learn context is not that a 
particular list is assimilated and repro- 
duced at learned speeds but that gen- 
eral habits, with respect to input and 
output speeds, may be developed so 
that learning new material is best if 
those habits are appropriate. 

Speed of input (presentation rate) 
has been varied in a number of studies 
(eg., Nodine, 1965; Nodine, Nodine, 
& Thomas, 1967), but no efforts were 
made to provide sustained and/or con- 
sistent practice at a specific input speed. 
Since these studies were not designed 
to study learning to learn, only a few 
lists were presented with a minimal 
number of items. The S was not given 
sufficient practice to determine whether 
general habits with respect to assimila- 
tion or output rates were acquired. 

Although several studies have re- 


JON G. ROGERS 


ported latency (e.g., Suppes, Groen, 
& Schlag-Rey, 1966) or response speed 
(Shiffrin & Logan, 1965) in PA learn- 
ing, these studies were not addressed 
to the question of whether output speed 
is a component of learning to learn, 
To answer this question, a general re- 
sponse speed factor must be separated 
from response speed as related to spe- 
cific items in a list. 

The purpose of this experiment was 
to determine if speed of assimilating 
input and speed of producing responses 
are components of learning to learn. 
Groups of Ss learning with different in- 
put and output speeds were given a 
number of practice lists to insure 
acquisition of components on a two- 
trial PA task. At least 12 hr. after 
being exposed to these lists, experi- 
mental groups were given three more 
lists with one or both of the rate para- 
meters changed. It was predicted that 
if the rate parameters were components 
of learning to learn, a decrement in 
performance would be manifested. If 
the rate parameters were warm-up 
phenomena, on the other hand, they 
should be lost within 60 min. after the 
experiment (Hamilton, 1950), and 
consequently no performance decre- 
ment should be obtained. 


METHOD 


Subjects—The Ss were 160 undergrad- 
uates from an introductory psychology 
course, randomly assigned in order of ap- 
pearance to each of 16 groups. ; 

Design—The design was a 2‘ X 3 factorial 
design with repeated measurements. The 
levels of the variables were as follows: ac- 
quisition input speed (fast and slow), acqui- 
sition output latency (fast and slow), trans- 
fer input speed (fast and slow), and trans- 
fer output latency (fast and slow). Three 
transfer lists were used as the repeated mea- 
sure with each S in the transfer conditions. 
The design also involved a layering pro- 
cedure. One S from each cell was com- 
bined into a layer, resulting in a total of 
10 layers, Each layer received a different 
set of 22 lists. 
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Material—A total of 440 high-frequency 
words were selected from Thorndike-Lorge 
(1944) lists by a semirandom procedure. 
Proper nouns and contractions were ex- 
cluded. A semirandom procedure was also 
used in combining the 440 words into 220 
pairs. The procedure was not completely 
random in that obvious first order associa- 
tions and words beginning with the same 
first letter were excluded. From these 
220 pairs, 22 lists of 10 pairs each were 
formed. Treating each pair as a unit within 
a list, the units were randomly rearranged 
for the second trial order. The 220 pairs 
were divided at random into lists nine more 
times, so that there were 10 sets of 22 lists. 
Since the performance of Ss across days was 
to be examined to determine loss of warm-up 
and amount of learning to learn, an attempt 
was made to avoid any performance fluctua- 
tions attributable to the composition of a 
specific list. By having 10 sets of lists, 
idiosyncrasies of particular lists were aver- 
aged out. The 10 sets constituted the layers 
of the factorial design with 1 S from each cell 
in each layer. The first 16 Ss constituted the 
first layer, and each S was randomly assigned 
to one of the cells. The Ss in this first layer 
received the first set of 22 lists according to 
the conditions of the cell to which they were 
assigned. 

Apparatus—The apparatus was a Stowe 
memory drum with the light connected to 
Hunter timers, The light in the memory 
drum served as the cue to respond, and was 
wired to come on .8 sec. after the onset of 
the anticipation interval for the fast group 
and 3 sec. after the onset of the anticipation 
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interval for the slow group. The light re- 
mained on for 1 sec. for both groups, 

After the 5-sec. anticipation interval that 
was the same for all groups, the input inter- 
val or the stimulus-response interval was 
either 2 sec. for the fast group or 5 sec. 
for the slow input group (see Fig. 1). 

Procedure—There were six groups of Ss 
in original learning: two slow input groups 
and four fast input groups. Two of the fast 
input groups were trained with short inter- 
trial intervals (ITI) and two with long 
ITIs. It is possible that differences between 
the fast input and the slow input groups 
were because the slow input groups had a 
longer total learning time (110 sec/trial) 
than the fast input (80 sec/trial) groups. 
To determine if total learning time pro- 
duces an effect in this type of task, the fast 
input groups were initially divided into two 
parts, with half of each group receiving a 
10-sec. ITI and the other half receiving a 
40-sec. ITI. This effectively lengthened the 
total learning time for the fast groups to that 
of the slow input groups, The ITI for both 
slow groups was 10 sec. 

Each S was given two training sessions 
from 12 to 24 hr. apart before the test 
task. Each training session consisted of 
eight lists of 10 items each. Each list was 
presented for two trials. On the third day, 
after combining the two ITI subgroups 
within each of the fast input groups, each 
training group was divided into fourths and 
three more lists were presented for two trials 
each. One-fourth of each group continued 
under the acquisition conditions and one- 
fourth was switched to each of the other 
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Fic. 1. Time intervals for slow output, slow input group. 
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three conditions. A short ITI (10 sec.) 
was used with all groups on Day 3. 

Though it was predicted that changing 
speeds would initially result in a perfor- 
mance decrement, it is likely that S would 
gradually adapt to the new conditions. To 
determine the relative independence of acqui- 
sition habits, the changed groups were re- 
turned to their practice conditions after the 
three test-talk lists and exposed to three 
more lists. 

Instructions were given prior to each ses- 
sion. The instructions were quite specific 
to the task in that they informed S how much 
time would elapse after the onset of the an- 
ticipation interval before he could respond 
on Trial 2 and how long the stimulus-re- 
sponse interval would be. The S was in- 
structed that only responses occurring when 
the light was on would be considered cor- 
rect. Day 3 instructions specifically in- 
formed S that the time interval(s) were 
different from the previous 2 days. 

Since the scoring involved E’s judgment 
about whether the response occurred within 
the light-on interval, all of the Day 3 ses- 
sions were recorded and the tapes were 
scored by an independent scorer. All inter- 
val changes were audible on the tapes. 


RESULTS 
Acquisition 


Figure 2 shows Day 1 and Day 2 
performance for the four acquisition 
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groups. The average score on Trial 2 
of List 1 of Day 1 was 1.75, with the 
groups relatively homogeneous at this 
point. By List 8 of Day 1, the average 
performance was 4.47. At the end of 
Day 1 a relatively consistent separa- 
tion was seen between the 3-sec. and 
.8-sec. output groups, the former per- 
formed somewhat better than the lat- 
ter. The average performance across 
groups at the beginning of Day 2, List 
9 was 3.21; it increased to 5.49 by 
List 16. For the groups continuing 
under their acquisition conditions on 
Day 3, the average score for List 17 
was 4.88, and for List 19 was 5.25. 
To determine if there was a signif- 
icant amount of learning to learn be- 
tween Day 1, Day 2, and Day 3, t tests 
were used to compare the average per- 
formance on the first list of Day 1 
with the average performance on the 
first list of Day 2. The average per- 
formance for the unchanged control 
Ss was compared on List 9, Day 2 and 
List 17, Day 3. Both ¢ tests were sig- 
nificant, #(159) = 10.95, p< 01; 
t(39) =7.18, p<.01, respectively. 
Since warm-up loss should be maximal 
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Fic. 2. Day 1 and Day 2 performance for the four acquisition groups. 
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between the last list of each day and 
the first list of the following day, aver- 
age performance was compared be- 
tween List 8 of Day 1 and List 9 of 
Day 2, #(159) = 11.14, p < .01, and 
for the unchanged groups, between the 
last list of Day 2 and the first list of 
Day 3, #(39) = 3.70, p < .01. In each 
case the amount of learning to learn 
and the amount of warm-up loss was 
significant. 

The acquisition data for the short 
and long intertrial intervals showed 
that no performance effects could be 
attributed to this temporal factor. 


Transfer 


Figure 3 shows Day 3 transfer data 
for each of the training groups. It is 
evident that a performance decrement 
was obtained from changing input 
speed or output speed. There is an 
indication of some negative transfer 
since all but two of the changed groups 
performed at a lower level on the first 
list of Day 3 than the average per- 
formance on Day 1, List 1. Points 
were also plotted on these figures show- 
ing average performance on List 15 
and List 16 of Day 2. 

The analysis of variance showed that 
the most powerful effect resulted from 
changing output speed, F (1, 144) = 
165.75, p < .01. The performance dec- 
Tement obtained from changing input 
speeds was also reliable, F (1, 144) = 
70.96, p<.01. The slight negative 
transfer effect noted above was not 
Persistent since there was reliable im- 
Provement across the three transfer 
lists, F (2, 288) = 25.61, p< 01. 
One four-way interaction (Training 
Input x Transfer Input X Training 
Output x Transfer Output) was sig- 
nificant, F (1, 144) = 70.15, p <.01 

his effect seemingly implied that the 
‘taining Input x Transfer Input 
interaction depended on levels of the 
Training Output x Transfer Output 
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interaction. This interaction could re- 
sult from some groups not showing as 
great a decrement as others in transfer. 
Tt was not possible, however, to give a 
meaningful interpretation of this effect. 

Figure 3 also shows points that give 
the results (average performance 
across two lists) of changing back to 
acquisition conditions. Though there 
was some evidence of a slight negative 
transfer effect, there was a relatively 
rapid shift back to a performance level 
attained in original acquisitions. 
Since there was a relatively rapid shift 
back to the practice level, the habits 
could be assumed to have been inde- 
pendent. 

Of the 4,800 responses on Day 3, 
4, 796 were scored identical to E’s by 
the independent scorer. Four re- 
sponses were given so loudly it was 
impossible to hear the sound of the 
light going off. These responses were 
scored correct, which was against the 
prediction. 


Error Analysis Acquisition 


A complete error analysis was made 
on both the acquisition and transfer 
data. The major error type for all ac- 
quisition groups was omissions. Be- 
ginning as low as 67% for the 3-sec.- 
output-2-sec. input (3-2) group and 
as high as 84% for the 8-2 group, 
the number of omissions reached an 
average of 90% for all groups quite 
rapidly. Overt errors were most similar 
within output groups. The largest 
error type for both fast output groups 
(.8-2 and .8-5) was correct responses 
occurring after the light was off. 
Though both of the slow output groups 
(3-2 and 3-5) showed an initial 
tendency to respond too soon, this 
‘error type diminished very rapidly; 
extraneous response errors were con- 
sistently the more frequent overt errors 
for the slow output groups. All overt 
error types, however, decreased across 
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the course of learning to learn as the 
percentage of omission increased. This 
implied that Ss were learning to re- 
spond within the temporal require- 
ments of the task. Response intrusions 
were a relatively consistent error in the 
3-2 groups. Table 1 gives the error 
types made by each acquisition group 
averaged across the last three lists of 
Day 2. 


Error Analysis Transfer 


Output changed—Table 1 also 
shows the error types made during 


transfer averaged across three lists.’ 


When output speed was faster, most 
of the overt errors were either correct 
late responses or intrusions. When the 


output speed was slower, the error 
types were dependent on the particular 
acquisition conditions. When changed 
from .8&-2 to 3-2, 70% of the errors 
were omissions, compared to 51% for 
the .8-5, 3-5 group. When changed to 
a slower output, the largest overt error 
type was correct responses occurring 
too soon. This type of error was con- 
siderably more frequent, however, 1n 
the 8-5, 3-5 group. The 8-2, 3-2 
group made more response errors 
(39%) than the .8-5, 3-5 group 
(11%). 
Input changed—When the _input 
was faster, error type was again de- 
pendent on acquisition condition. 
Slightly more omission errors were 
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made by the 8-5, .8-2 group than by 
the 3-5, 3-2 group. For the .8-5, 82 
group, by far the largest overt error 
was correct late responses. When in- 
put time was lengthened, the overt 
errors were dependent on acquisition 
conditions. The .8-2, .8-5 group pro- 
duced a considerably larger percentage 
of correct late responses than did the 
3-2, 3-5 group, which produced more 
response errors. 

Input and output changed—When 
both intervals were lengthened, correct 
responses occurring too soon consti- 
tuted the largest overt error group. 
When both intervals were shortened, 
correct late responses accounted for the 
largest number of overt errors. When 
the interval change was mixed (one 
interval faster and the other slower), 
the pattern of errors conformed, for 
the most part, to groups in which the 
output shift was similar. 


Discussion 


The results support the hypothesis that 
input and output speeds are components 
of learning to learn. As S learns how 
to learn paired associates in this para- 
digm, he is learning a speed of assimila- 
tion and a speed of emitting overt re- 
sponses. This learning is not “list spe- 
cific” since it is not confined to a partic- 
ular list of words; rather, it is operative 
when new lists are acquired. These find- 
ings support the micromolar contention 
that learning is best at practiced speeds 
and that changing speed components in 
either direction makes the task more dif- 
ficult. Exposure to different values of 
speed components in a new task effec- 
tively requires the learning of quantita- 
tively different microspeed responses. 

The magnitude of the obtained decre- 
ment suggested that a major part of per- 
formance in this task involved the optimal 
use of pacing techniques. Additionally, 
these pacing components were relatively 
persistent. Exposure to the three trans- 
fer lists did not substantially interfere 
with S’s ability to perform at the prac- 
ticed intervals when changed back. 


The fact that changing to a faster speed 
component increased task difficulty has 
a plausible explanation. Performance 
decrements resulting from being changed 
to a slower speed, on the other hand, 
are more difficult to handle. The require- 
ments of the new task may have been 
different, depending on the direction of 
the change. 

First, changing to a faster output speed 
required S to give the response sooner 
than practiced. During the anticipation 
interval, § had to search, find, perhaps 
hold, and then make the response within 
the light-on interval. Requiring S to 
produce the correct response sooner could 
interfere with learned techniques of 
eliciting associations, and, as a result, 
disrupt recall. One might, then, predict 
a high frequency of correct late re- 
sponses; in fact, this was the predom- 
inant overt error type. Furthermore, if 
the light came on sooner than in train- 
ing it might distract S in the recall 
process. This could result in an increase 
in response type over errors, i.e, re- 
sponse intrusion, stimulus intrusions, 
and/or extraneous responses (importa- 
tions). Again, there were a number of 
these error types. After the Day 3 ses- 
sion, all of the changed groups were asked 
if they felt that the change in the time 
interval(s) affected their performance, 
and if so, how. The most frequent re- 
sponse given by the groups changed to 
a faster response time was that they could 
not think of the word soon enough, and, 
secondly, that the light coming on so 
much sooner was annoying. Changing 
to a faster input interval could be viewed 
as making the task more difficult since 
there was less time for rehearsal. The 
overt error analysis for these groups 
indicated that the correct associations 
were perhaps weaker since there were a 
number of correct responses occurring too 
late to be considered correct and also a 
number of intrusions. 

Second, performance decrements ob- 
tained from changing to slower speeds 
may have resulted from disrupting 
learned pacing techniques or paced procy 
esses. It seems unlikely that slowing 
output would make it more difficult for 
S to search and find the correct response, 
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since more time was available for these 
processes. The interval change might 
have its detrimental effect by requiring 
S co hold the correct response longer. 
In transfer, the fast output groups were 
required to hold the correct response 
2 sec. or more (longer than in training). 
Apparently this was a difficult require- 
` ment, since the largest overt error cate- 
gory was correct responses occurring too 
soon. Almost all of the replies about the 
effect of this interval change were related 
to a fear of forgetting the word. Some 
Ss said they found themselves directing 
their attention to the light, and by the 
time it came on they had forgotten the 
word. 

Decrements occurring from lengthen- 
ing input time may also have resulted 
from disrupting stable pacing behavior. 
The groups that received the fast input 
interval in training may have developed 
a stable technique of associating the 
stimuli paced so that it could be em- 
ployed within the interval. If this were 
the case, then changing the speed could 
interfere with this technique. The in- 
crease in late correct responses and both 
types of response errors suggested that 
Ss’ pacing was disrupted. The most 
frequent replies to the posttest question 
by these groups were that the interval 
was too slow and that their minds wan- 
dered. 

5 On the basis of the total time hypothe- 
sis, it might have been expected that 
changing groups to a longer input inter- 
val would improve performance. Further, 
one might also predict from this hypothe- 
sis that there would be a separation be- 
tween the fast and slow input groups 
during training. Since Ss were given 
only two trials on each list in this experi- 
ment, the prediction would not be ap- 
propriate. The separation in perform- 
ance favoring longer input intervals (e.g. 
Murdock, 1960) may appear at a later 
Stage of list learning. 

This extension of micromolar theory 
has certain implications in the search for 
optimal” temporal units (e.g., Nodine, 
Nodine, & Thomas, 1967). In the pres- 
ent study, one might contend that the 
Practiced units, regardless of length, were 
Optimal to Ss practicing them since Ss 
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performed best at these intervals. The 
question of optimal temporal units, at 
least for this type of paradigm, would 
perhaps best be asked in terms of ease 
of acquisition of particular temporal com- 
ponents. In any event, consideration 
should be given to the finding that Ss 
perform best at practiced intervals, re- 
gardless of any superiority in perform- 
ance among particular groups. 
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STIMULUS SELECTION AND MEANINGFULNESS IN 
PAIRED-ASSOCIATE LEARNING WITH STIMULUS 


ITEMS OF HIGH FORMAL SIMILARITY ' 
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Ss learned a paired-associate list at 1 of 3 levels of stimulus meaning- 
fulness (SM). The stimulus items were 8 CVC trigrams constructed 
of all possible combinations of only 2 different letters in each position. 
Thus, within each list, a given pair of stimulus items had either 0, 1, 
Under these conditions, the extent to which 
a response is elicited by a fractional component of the stimulus is 
reflected in the distribution of response errors. The results confirm 
the prediction that stimulus selection increases as SM decreases. 


or 2 letters in common. 


The term stimulus selection refers 
to the association of a response with 
some fractional component of the given 
or nominal stimulus. Underwood’s 
(1963) analysis of stimulus selection 
is closely related to the principles 
underlying the two-process models de- 
veloped by Atkinson (1960) to account 
for certain types of discrimination be- 
havior. This model defines two per- 
ceptual states: one in which S responds 
to the pattern of stimulation and one 
in which he responds to the separate 
components of the stimulus. The 
former state corresponds to the stim- 
ulus item functioning as an integrated 
unit, the latter to Underwood’s notion 
of stimulus selection. 

In the Atkinson model, the percept- 
ual state adopted by S and his change 
from one state to another is related di- 
rectly to a specified schedule of rein- 
forcement. In the typical paired-asso- 
ciate (PA) verbal learning task, how- 
ever, the reinforcement schedule with 
respect to the pattern-component dis- 
tinction is unspecified. Under condi- 
tions of low formal stimulus similarity 
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(FSS) the reinforcement of a partic: 


ular response occurs irrespective of 4 
whether the response is elicited by the © 


entire (nominal) stimulus or by some 
structural component of it. In this 
situation the state adopted (pattern or 
component) is presumably governed 
by other properties of the stimulus 
items. One such property which is 
likely to be of importance is the extent 
to which prior learning has established 
the stimulus element as an integrated 
unit. Murdock (1961), eg., has 
demonstrated that a CCC trigram func- 
tions in short-term memory as three 
distinct units whereas a highly inte- 
grated item such as a CVC word func- 
tions as a single unit. 

Stimulus meaningfulness (SM) and 
pronunceability may be regarded as 
indexes of stimulus integration. Under- 
wood (1963) has predicted that stim- 
ulus selection varies inversely with SM 
and Postman and Greenbloom (1967) 
have shown that it varies ‘inversely 
with stimulus pronunceability. Stim- 
ulus integration may, of course, de- 
velop within the training session itself 
and thus stimulus selection might be 
expected to decrease late in learning. 


Such a result was obtained: by James ` 


and Greeno (1967) who found stim- 
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ulus selection to be less during over- 

training trials. 

% Under conditions of high FSS, rein- 
forcement ceases to be independent of 
‘he degree of stimulus selection. Stim- 


; “ulus selection will then result in a fail- 


‘ure to discriminate within a certain 
‘subset of the stimulus items, and dis- 
crimination across the entire set of 


stimuli will require that each item be 


A 


” 


learned as an integrated unit. This 
has led Underwood (1963) to predict 
that with high FSS the effects of 
variation in SM should be more pro- 
sounced than with items of low FSS. 
Chat is, an interaction FSS x SM is 
dredicted. 

This prediction is not supported by 
the available evidence. Although in 
Exp. IA and IB of Goss, Nodine, 
Gregory, Taub, and Kennedy (1962) 
a significant interaction was obtained 
between two levels of SM and of 
FSS, the form of this interaction is 
opposite to that predicted by Under- 
wood. Considering the four combina- 
tions of high and low SM with high 
and low FSS, it is clear that the inter- 
action is a consequence of the differ- 
ence between high and low FSS condi- 
tions being greater for the high than 
for the low SM condition. After re- 
viewing the literature, Goss and Nodine 
(1965, p. 148) concluded that “there 
is no convincing evidence of inter- 
actions of similarity and meaningful- 
ness of stimulus members.” 

The difficulty in interpreting this 
negative evidence in relation to Under- 
wood’s formulation is that it does not 
permit an evaluation of the stimulus 
selection hypothesis along with the hy- 
pothesized differential role of stimulus 
selection with changes in SM. The 
present study provides a test of these 
hypotheses under conditions where 


), stimulus selection is related directly to 


d 


the effects of FSS. 
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METHOD 


Construction of Stimulus Items 


The present experiment is based on the 
use of stimulus items generated according 
to the method suggested by Shepard (1963), 
A list of eight high similarity items is gen- 
erated by regarding the positions of the 
trigram as three binary elements. If, e.g., 
{F, R} {A, E} {V, S} are, respectively, the 
possible values of the initial consonant, the 
vowel, and the final consonant of a CVC 
syllable, then the eight CVCs generated are 
FAS, FAV, FES, FEV, RAS, RAV, RES, 
and REV. These trigrams may be grouped 
pairwise into three classes as a function of 
the number of letters a given pair has in 
common, ie., in terms of their degree of 
formal similarity. Thus FAS, FAV repre- 
sents a pair with common values on two of 
the three dimensions while FAS, REV is a 
pair with no common values. For the set 
of eight elements, the number of distinct 
pairs with 2, 1, and 0 common values will 
be 12, 12, and 4, respectively. Suppose 
that eight CVC trigrams generated in this 
fashion are paired as stimulus elements with 
a set of eight response members (say the 
numbers 1-8) and presented as a PA list 
to be learned. A response item which be- 
longs to this set but which is given by S 
as the response to an incorrect stimulus 
will be termed a “stimulus confusion” 
error. Such a response reflects either guess- 
ing or a failure to discriminate between the 
two stimuli involved (i.e. the stimulus to 
which the response is given and the stimulus 
for which this response is correct). Such 
errors may be classified into three disjoint 
sets corresponding to the three classes of 
stimulus pairs defined previously in terms 
of their degree of formal similarity. Inso- 
far as stimulus selection occurs, confusion 
errors will not be randomly distributed 
across these three classes. If, e.g, a single 
letter operates as the functional stimulus, 
then the result will be smaller than chance 
number of errors corresponding to a con- 
fusion of stimuli with no letters in common, 
In general, stimulus selection will result in 
confusion errors, the relative frequency of 
which will be a direct function of the de- 
gree of FSS. If stimulus selection increases 
as SM decreases, then the strength of this 
relationship will increase as SM decreases. 

Lists—Unmixed lists of CVC stimulus 
items at three levels of meaningfulness were 
constructed according to the binary prin- 
ciple outlined above and are set out in 
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Table 1. List 1 comprises the eight words 
given by Shepard (1963) and which, in- 
cidentally, is the only satisfactory list of 
eight CVC words that can be generated 
in this fashion, Lists 2 and 3 comprise 
eight CVC nonwords of high m’ (Noble, 
1961) and low m’, respectively. For all 
lists, the response items were the digits 
1-8 paired randomly with the stimulus 
items. 

Procedure——The Ss were allocated to one 
of the three lists according to a random 
schedule. Each Ş was tested individually 
and given standard anticipation format PA 
instructions except that additional stress 
was placed on the desirability of their guess- 
ing when uncertain of the correct response. 
The lists were presented on a Stowe memory 
drum at a 2:2-sec. rate with an intertrial 
interval of 4 sec. To minimize serial learn- 
ing effects, each list was presented in four 
different random orders. The Ss were run 
to a criterion of two consecutive error-free 
trials or to a maximum of 60 trials, 

Subjects—The Ss were 27 college stud- 
ents (9 per list) enrolled in an introductory 
psychology course at Pennsylvania State 
University. They received additional grade 
points for their participation in the experi- 
ment. 


Resutts 


Confusion errors as a function of 
SM and FSS.—Of the 27 Ss tested all 
but 2 reached criterion within 60 trials. 
One S from each of the high- and low- 
m CVC group was still making 
errors at the sixtieth trial and was 
stopped at that point. The number 
of confusion errors in each of the three 
classes was obtained for all Ss. Since 
in each of the two classes correspond- 
ing to one and two letters in common 
there are three times as many potential 


TABLE 1 
STIMULUS ITEMS 


Initial Final 


List Conso- | Vowel | Conso- | Mean 

nant nant oe 
1 (words) B,R | A,U | GT | 3.99 
2 (high m’) F,R | AE SV 2.89 
3 (low m’) XZ | AE | HJ 0.73 


*See Noble, 1961. 
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TABLE 2 


MEAN CONFUSION ERROR SCORES AS A 
Function oF FSS AT THREE LEVELS 
OF MEANINGFULNESS 


Degree of FSS (No. of Common Letters) 


List 
0 1 2 
Words 9.56 7.78 9.78 
High m’ 13.78 11.11 23.22 
Low m’ 13.89 17.67 30.30 


errors as there are in the class cor- 
responding to zero letters in common, 
error scores in the former two classes 
were divided by three. Thus, the final 
error score represents the ratio of num- 
ber of errors to the number of poten- 
tial errors per item in a given class. 
The mean error scores for the three 
levels of meaningfulness are presented 
in Table 2. 

Relevant to the hypotheses under 
consideration are the within-Ss vari- 
ance components for the FSS effect, 
F (2, 48) = 28.0, p<.01, and the 
interaction SM x FSS, F (4, 48) = 
69, p<.0l. The significant inter- 
action makes it necessary to evaluate 
the significance of the FSS effect at 
each level of SM. With words as 
stimulus items, the FSS effect is not 
significant (F <1), while with CVC 
nonwords of high m, F (2, 48) = 
14.6, and for low m’, F (2, 48) = 26.8, 
$ < .01 in both cases. Thus with CVC 
nonwords of high and low Noble m’, 
confusion errors are significantly re- 
lated to degree of FSS while for CVC 
words confusion errors are independent 
of FSS. 

For the high-m’ CVC condition 
confusion errors in relation to degree 
of FSS occur with increasing frequency 
in the order 1, 0, and 2 letters in com- 
mon. The difference between the first 
two is small and not significant. For 
low-m’ CVCs the frequency of con- 
fusion errors is directly related to de- 
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gree of FSS, errors increasing in the 
order 0, 1, and 2 letters in common. 


DISCUSSION 


In the present experiment acquisition 
involves discrimination among stimulus 
items which, taken in pairs, vary in their 
degree of formal similarity. To the ex- 
tent that responses are elicited by some 
fractional component of the stimulus 
trigram, a nonrandom distribution of con- 
fusion errors results. More precisely, 
the probability of failing to discriminate 
between two stimuli will vary directly 
with the degree of FSS. The results 
confirm the prediction that the strength 
of this functional relationship increases 
as SM decreases. 

With CVC words, FSS emerges as 
an irrelevant variable as far as the dis- 
tribution of response errors is concerned. 
The probability of failing to discriminate 
two words of high FSS is no greater than 
that for stimuli that have no letters in 
common. This result accords with the 
findings of conditioning studies (eg., 
Abbott, 1966) which show that with 
meaningful words as stimuli, the amount 
of generalization to words of high formal 
similarity is very low. 

In contrast to this result, with CVC 
nonwords of high and low m, the rela- 
tionship between FSS and the distribu- 
tion of errors is highly significant. The 
significant interaction FSS X SM is 
accounted for largely by this change in 
the distribution of errors in moving from 
words to nonwords. In fact, for the two 
nonword conditions alone, the interaction 
FSS x SM is not significant, F (2, 
48) = 2.23. It is clear therefore that 
the degree of stimulus selection does not 
increase linearly with increases in m. 
As far as the m scale is concerned CVC 
nonwords of high m differ from non- 
words of low m’ by twice the amount they 
differ from CVC words; yet the signif- 
icant shift in the distribution of errors 
occurs as a function of this smaller dif- 
ference. Since previous studies which 
have tested the hypothesis of an FSS X 
SM interaction have not included CVC 
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words as a stimulus condition, the pres- 
ent results support the conclusion that 
if SM is restricted to within the range 
of CVC nonwords, then such an inter- 
action is not obtained with any con- 
sistency. 

The large effect attributable to the 
CVC elements being words suggests the 
importance for this condition of mediated 
distinctiveness as a factor, in addition to 
the effect of SM in facilitating integra- 
tion of the stimulus elements. There is 
an important qualitative difference be- 
tween the associates elicited by CVC 
words and CVC nonwords, a difference 
which is largely independent of the level 
of meaningfulness of the latter. Unlike 
words, a CVC nonword typically elicits 
a primary associate which is essentially a 
reading of the stimulus, in the sense that 
it is based on, and incorporates, struc- 
tural and phonetic properties of the CVC. 
For example, 73 of the 100 CVCs used 
by Mandler (1955) elicit primary asso- 
ciates which have the same first letter 
as the stimulus CVC. It is possible 
therefore that the large difference be- 
tween CVC words and nonwords is to 
be accounted for by the failure of the 
nonwords to elicit an association or read- 
ing which mediates distinctiveness. It 
may even be the case that, under certain 
circumstances, associations to the non- 
word stimuli may interfere with discrim- 
ination learning, the strength of this 
interference effect increasing with SM. 
If this were so, it would account for the 
finding of Goss et al. (1962) that the dif- 
ference between high and low FSS was 
greater for high than for low SM. 
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MOTION AND TEXTURE AS SOURCES OF SLANT 
INFORMATION * 
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Computer-generated motion pictures of random dot patterns on slanted 
planes translating along a horizontal axis were displayed to 40 Ss. 
The displays included combinations of 4 levels of texture gradients, 
corresponding to 0°, 20°, 40°, and 60° slants, and 5 levels of velocity 
gradients, 0°, 20°, 40°, 60°, and no motion. Ss were instructed to tilt 


a clear plastic plane to the same angle as the displayed plane. 


Slant 


judgments were predictable from a combination of the gradients, with 
greater weight given to velocity information, The relationship of the 
findings to gradient and perspective interpretations of slant perception 


was discussed, 


In a recent study of the perception 
of rigid transformations of visual pat- 
terns (Braunstein, 1966b) and in ear- 
lier research on motion perspective (re- 
viewed by Braunstein, 1962b; Flock 
1965), investigators attempted to select 
stimuli minimizing depth information 
from static sources. In all cases, how- 
ever, a texture gradient necessarily was 
present as the carrier of the motion, 
and depth information present in the 
velocity gradient was also present in 
the texture gradient. The purpose of 
the present study was to explore the 
relative effects of variations in texture 
and velocity gradients on slant judg- 
ments for surfaces in motion, when the 
correlation between these gradients is 
artificially reduced. 

One method of separating the effects 
of texture and velocity gradients in- 
volves the display of translations, along 
an axis perpendicular to the line of 
sight, of three-dimensional patterns. 


1 This research was supported by National 
Science Foundation Grant GB 5545. Com- 
puter facilities were provided by the West- 
ern Data Processing Center, University of 
California, Los Angeles. 

The author is grateful to Orel F. Cole- 
man, who prepared the stimuli, and John W. 
Payne, who assisted in all phases of this 
study, 


Tf the pattern is close to the nodal point 
of the observer’s eye, both texture 
and velocity gradients provide indi- 
cations of relative depth within the pat- 
tern. For a sufficiently distant pattern, 
the velocity differences may become 
imperceptible. The texture would also 
be difficult to discern unless an optical 
system, such as a telescope, is used for 
magnification. This situation may be 
simulated in the laboratory through the 
use of a shadow projection system with 
a relatively distant light source, or by 
means of computer-generated parallel 
projections. Using the latter tech- 
nique, Braunstein (1966b) found that 
Ss could reliably identify a dot pat- 
tern as two-dimensional when both 
texture and velocity gradients were 
uniform and as three-dimensional when 
both indicated curvature, but could 
not reliably select either dimensionality 
when the texture gradient indicated 
curvature although the velocity grad- 
ient was uniform. 

The present study also used com- 
puter-generated displays of transforma- 
tions of dot patterns. The dots were 
randomly distributed on a plane slanted 
with the top (or bottom) away from S. 
The plane was moved from left to 
right (or right to left) across S’s field 
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of view. Size, shape, and brightness 
of the dots did not vary with slant. 
Information concerning the degree of 
slant was available to S only through 
the retinal gradients of velocity and 
texture generated by the projection of 
the slanted plane. Normally, the same 
slant angle would be implied by both 
these gradients. Using computer-gen- 
erated displays, however, it was pos- 
sible to independently vary the texture 
and velocity gradients. This was 
accomplished by using one angle of 
rotation in computing the position of 
each dot on the projection plane, the 
_ two-dimensional display that S actually 
observed, and using another angle of 
rotation in computing the relative mo- 
tion of each dot on the projection plane 
(across S’s field of view). These two 
angles may or may not be the same. 
While perspective would normally be 
identical for both velocity and texture 
gradients, the independent variation of 
these gradients results in a reduced cor- 
relation between perspective informa- 
tion available from these two sources. 
It is therefore necessary to define text- 
ure and motion perspective separately. 

The definition of perspective used 
in the present study, previously dis- 
cussed by Braunstein (1962a), dif- 
fers from Freeman’s second definition 
(1966, p. 26) principally in that a ratio 
between distances rather than a dif- 
ference between visual angles is used. 
The choice of the ratio definition in 
the present line of research is due to 
the interest in relative perspective 
comparisons, rather than in the meas- 
urement of detection thresholds for 
perspective variations. 

Consider two pairs of points in 
three dimensional space (#,,y,,2t:), 
(aYou) and (Xa Ys2ta)s (aVat) 
such that the distance d,, between the 
first and second points is equal to the 
distance d,, between the third and 
fourth points and 2, is the minimum z 
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value of any point displayed while st, 
in the maximum z value. Let d’,, be 
the projection of d,, onto any frontal 
parallel (X-Y) plane and d's, be the 
projection of d,, onto the same plane. 


Texture perspective is defined as 
',2/C' 4. 
Consider next the two points: 


(%5,)5,2m1) and (%0,No,2m2), where 2m, 
is the minimum z value of any point 
displayed and Zm is the maximum z 
value. The first point is moved within 
its X-Y plane by an amount dm, and 
the second point is moved, in the same 
time interval, within its X-Y plane 
by dma, where dm; =dmz. Let d'm, and 
d'ma be the projections of dm, and dmz, 
respectively, onto any frontal parallel 
plane. Velocity perspective is defined 
as d'mı/d'm» These definitions are ap- 
plicable to projections of any three-di- 
mensional pattern. In the case of a 
plane rotated about a horizontal axis, 
the points having maximum or mini- 
mum z values, used in the computation 
of these ratios, would be any points 
having y coordinates corresponding to 
the upper or lower vertical extremes of 
the display. 

In normal visual environments, the 
z coordinates of the points which are 
closest or furthest away for the texture 
computation (są and 2t) would be the 
same as for the motion computation 
(2m, and Zma). In the present study, 
after computing the positions of the 
points with respect to a slant angle for 
texture, new 2 values were assigned on 
the basis of the slant angle for motion. 
If the slant angles are not the same, 
all z values, except for points on the 
axis of rotation, will be different and 
the texture and motion perspective ra- 
tios will therefore be different. 


METHOD 


Subjects—The Ss were 22 male and 18 
female undergraduates, 36 of whom served 
as part of an introductory psychology course 
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requirement and 4 of whom were volun- 
teers from a psychology elective course. 
Stimuli—The stimuli were 240-frame, 
16-mm. motion picture sequences with each 
frame containing approximately 750 small 
white dots against a dark background. The 
sequences were computer generated and rep- 
resented translations along the X axis of a 
plane rotated about that axis, where X and 
Y are the horizontal and vertical axes per- 
pendicular to the line of sight, the Z axis. 
In the computation of the coordinates of 
the displayed dots, points were first selected 
on an X-Y plane by a pseudorandom process 
yielding a rectangular distribution of co- 
ordinates and thus a uniform texture. The 
plane was then tilted (rotated about the X 
axis) by the texture angle, @, producing a 
new Y coordinate and a Z coordinate for 
each point. The points on the rotated plane 
were then projected onto a frontal parallel 
plane through the origin. This was accom- 
plished by multiplying the X and Y co- 
ordinates of each point by a projection factor, 
E/(E-z), where (0,0,E) is the projection 
point and z is the z coordinate of the point 
being projected. As E was always greater 
than z for the points displayed, the projec- 
tion factor increased with increasing s 
values. This resulted in increased distance 
between the projections of points closer to 
the projection point and decreased distance 
between the projections of points more dis- 
tant from the projection point. This process 
generated a texture gradient based on @. 
The computation of the velocity gradient 
began with the original coordinates of the 
points on the X-Y plane. The plane was 
rotated about the X axis by a slant angle 
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B. A Z coordinate and a new projection 
factor, (E/E-z), based on this rotation was 
computed for each point. This factor was 
multiplied by AX, the change in position of 
the plane along the X axis per unit time, to 
obtain the projected change in position for 
each point. These computations yielded a 
velocity gradient based on £. 

While these computational procedures 
yield independent texture and velocity grad- 
ients, it is not possible to maintain the inde- 
pendence of all other display variables. If 
S has an unrestricted vertical field of view, 
the extent of the projected plane will vary 
with the texture gradient, yielding an un- 
wanted additional source of slant informa- 
tion. If S’s field of view is restricted to an 
area within the boundaries of the plane, he 
will see the projection of more of the vertical 
extent of the plane with greater texture 
angles and will consequently see a wider 
range of velocities. As a restricted field 
of view was used, the vertical extent of the 
original X-Y plane visible to S varied with 
texture angle. The dimensions of the X-Y 
planes used in generating the displays are 
given in Table 1. They are adjusted to 
avoid visible edges during observation of 
the film sequences. The projection point 
was (0,0,3) in all cases. Texture gradients 
representing slants of 0° and 60° are illu- 
strated in Fig. 1. 

The stimulus film contained two random 
arrangements of 20 sequences representing 
the combinations of @=0°, 20°, 40°, and 
60° with 8=0°, 20°, 40°, and 60° and the 
four texture gradients displayed with no mo- 
tion of the dots. In the first 20 sequences 
the top of the plane was tilted toward S; in 


Fic. 1. Texture gradients representing slants of 0° (left) and 60°. 
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TABLE 1 
DESCRIPTION OF THE STIMULUS SEQUENCES 
Texture Velocity 
Sequence No.* Vertical Limits» 
Slant Perspective Ratio Slant Perspective Ratio 

1 0° 1.00 —1.00, Stationary 

2 1.00 0° 1.00 

3 20° 1.26 

4 40° 1.55 

5, 60° 1.81 

6 20° 1.28 —.95, Stationary 

7 1.21 0° 1.00 

8 20° 1.28 

9 40° 1.58 
10 60° 1.86 
11 40° 1.78 —1.02, Stationary 
12 1.81 0° 1.00 
13 20° 1.37 
14 40° 1.78 
15 60° 2.16 
16 60° 3.72 1.27, Stationary 
17 4.73 0° 1.00 
18 20° 1.80 
19 40° 2.76 
20 60° 3,72 


Note.—Each sequence was produced once with the top of the plane slanted toward S and once with the top 


slanted away from S. 
a Not related to order of presentation. 


b Upper and lower values of the vertical extent of the plane within which dots are visible to S, in theoretical 


distance units. 


the second half of the film the top of the 
plane was tilted away from S. 

Apparatus—A 16-mm. motion picture 
projector (L-W Model 224-A), operated 
at 24 frames per second, was used to display 
the stimulus film. Two of the shutter 
blades were covered to avoid the perception 
of multiple images noted by Braunstein 
(1966a). The film was projected through a 
glass window onto a translucent screen 
(Polacoat) in a separate room. The S$ 
viewed the screen monocularly from the 
opposite side at a distance of 61.3 in., through 
a tube restricting his field of view to a 20.5- 
in. radius circle. One unit of the theoretical 
display space was projected onto 20.5 in. of 
the screen. The viewing distance corre- 
sponded to the distance of the projection 
point, three units from the origin. Back- 
ground and dot luminances were approxi- 
mately .01 and 1 ftl., respectively. 

A clear Plexiglas plane, 8 in. square and 
.375 in. thick, mounted in a 14-in. square 
frame served as the response device. The 
center of the plane was 10 in. below and 
15 in. to the right of the viewing tube. 


The border of the frame facing S was .5 
in. wide and painted white. All other appara- 
tus visible to S was painted flat black. The 
motion of the plane operated one of a pair of 
synchronous motors. The second motor 
moved a needle on a protractor visible to E. 
A number 47 lamp controlled by Æ was 
located behind S, 33 in. from the plane. 

Procedure—The Ss were told they would 
see dots on a plane, the top of which was 
either tilted toward or away from them. 
These positions were demonstrated by E 
using the Plexiglas plane. The S was, fur- 
ther instructed to “move the [Plexiglas] 
plane so that its tilt, if any, matches the tilt 
of the plane which you see through the 
tube . . 2 and to return the plane to the 
upright position after each response. 

The film was started at a randomly se- 
lected sequence, run through to the end, 
and then started at the beginning and run 
three sequences past the initial „starting 
point. The first three sequences, which were 
repeated at the end of the session, were 
treated as practice trials. The starting 
point of the film was different for each S; 
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half of the Ss were shown the film in re- 
verse to counterbalance remaining order 
effects. 

After each sequence was displayed for 
10 sec. the response device was illuminated 
and S moved the plastic plane to indicate 
his response. The light was then ex- 
tinguished, and the next sequence was dis- 
played. 


RESULTS 


The deviation of the angle to which 
S set the response plane from the ver- 
tical was treated as the dependent vari- 
able. A negative sign was applied to 
any deviations in the direction opposite 
to that displayed. Table 2 shows the 
mean angle to which the response 
plane was tilted, for each of the 40 
stimulus sequences. A four-way analy- 
sis of variance (velocity gradient, text- 
ure gradient, direction of tilt, and Ss), 
omitting stationary displays, showed 
significant (p <.05) main effects for 
motion, F (3, 117) = 241.1, and text- 
ure, F (3, 117) = 59.3, but not for di- 
rection, F (1, 39) =.002. The Velo- 
city X Texture interaction was signif- 
icant, F (9, 351) = 8.27, p < .05, as 
was the triple interaction, F (9, 351) = 
4.14, p<.05, but the interactions 
Velocity x Direction, F (3, 117) = 
2.68, and Texture X Direction, F (3, 
117) = 1.90, were not. All tests used 
the interaction of the effects tested with 
Ss as the error term. 

The stationary displays, omitted 
from the above analysis, were com- 
pared to those produced with uniform 
velocity (a tilt angle for velocity of 0°) 
for the four levels of texture and two 
levels of direction of tilt. Significant 
(p< .05) main effects were found for 
motion, F (1, 39) = 7.8, and texture, 
F (3, 117) =7.7, and for the inter- 
actions Motion x Texture, F (3, 117) 
=6.5, and Texture X Direction, F 
(3, 117) = 4.9. 

Product-moment correlation coeffi- 
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TABLE 2 


MEAN SLANT JUDGMENTS FOR THE 20 Com- 
BINATIONS OF TEXTURE AND VELOCITY 


Velocity 


Texture 
Stationary 0° 20° 40° 60° 


0° —0.3 1.7} 11.6 | 18.6 | 20.7 
20° 2.1 | —0.2} 13.9 | 18.8 | 24.3 
40° 1.8 0.2 | 15.5 | 25.2 | 28.7 
60° 11.6 2.4 | 27.8 | 34.8 | 41.5 


cients were computed between the 
mean response for each display and 
both the texture and velocity angles. 
The response-texture correlation was 
.39; the response-motion correlation 
was .82. The multiple regression of 
slant judgments on texture and velo- 
city angles was J=.21T +.45V — 
2.1, where J is the mean judged re- 
sponse angle in degrees, T is the dis- 
played texture angle and V is the dis- 
played velocity angle. The multiple 
R was .91. 

Judged slant was also compared to 
the perspective values of the moving 
displays. The product-moment cor- 
relation of the response with motion 
perspective was .90. The correlation 
between the response and texture per- 
spective was .42. The independent 
variables in this analysis, motion and 
texture perspective, correlated .53. 
The regression equation was J= 
— 91T + 16.3V — 7.9, with a multiple 
R of .90. As the vertical limits of the 
portion of the original plane visible to 
S was fully determined by the texture 
angle, the coefficients 42 and .53 also 
describe the relationship between the 
displayed vertical extent and the re- 
sponse and between this extent and 
motion perspective, respectively. The 
partial correlation of the response with 
motion perspective, holding the effect 
of texture perspective (or vertical ex- 
tent) constant, was .89. 
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Discussion 


When the usual correlations between 
texture and velocity gradients are arti- 
ficially reduced, slant judgments remain 
predictable from a combination of these 
sources of information, with greater 
weight given to the displayed velocity 
gradient. The comparison of displays 
varying in texture gradient but produced 
with uniform velocity, with stationary 
displays having the same texture gradi- 
ents, is of particular interest. If the 
perception of slant were due to the pres- 
ence of a nonuniform velocity gradient, 
the absence of such a gradient might be 
expected to result in the elimination of 
velocity as a relevant cue. A given text- 
ure gradient produced with uniform 
velocity should then provide the same 
slant information as a stationary view of 
that texture. If, on the other hand, a 
uniform velocity serves as a basis for the 
perception of a surface perpendicular to 
the line of sight, judged slant should be 
reduced under the uniform velocity con- 
dition. The latter explanation was sup- 
ported. When no motion was displayed, 
slant judgments varied with displayed 
texture. When a uniform velocity grad- 
ient was displayed, judged slant dropped 
to near zero, regardless of texture. This 
demonstrates the inappropriateness of the 
line of reasoning holding that depth per- 
ception requires special sources of in- 
formation (cues) and that flatness is 
perceived when information from these 
sources is lacking. Instead, information 
in the retinal image, such as a velocity 
gradient, may indicate either extent in 
depth (slant) or flatness. When informa- 
tion from a given source is eliminated, 
judgments are made on the basis of other 
available information. 

Since the present study compares the 
effects of texture and velocity informa- 
tion in terms of gradients, recent criticism 
of gradient theories, e.g., Ericksson 
(1964) and Freeman (1965), must be 
considered. Both of these writers pro- 
pose alternative explanations in terms 
of cues, but only Freeman is specific 
about the nature of these cues. Free- 
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man emphasizes contour perspective as 
a source of slant information. As the 
principal objectives of the present study 
required a continuous texture, eliminat- 
ing visible contours, a direct interpreta- 
tion of the results in Freeman’s terms is 
not appropriate. Perspective ratios may 
be considered, however, both in terms 
of texture and velocity variations. Judged 
slant was found to be predicted about 
as well from motion perspective alone as 
from a combination of the slant angles 
displayed through velocity and texture 
gradients. 

There is no question about slant angle 
being derivable mathematically from in- 
formation in the proximal stimulus. Any 
projection, even a parallel projection, of 
a slanted plane with randomly distributed 
texture elements contains sufficient in- 
formation to mathematically derive a 
slant angle? Consider a plane rotated 
about the horizontal axis perpendicular 
to the line of sight. Start at the inter- 
section of the line of sight with the axis 
of rotation and count some arbitrary num- 
ber of texture units, going up (or down) 
the vertical axis. Then proceed left (or 
right) for the same number of units. The 
ratio of the visual angles subtended by 
the vertical excursion to that subtended 
by the horizontal excursion is the cosine 
of the slant angle. (This is a some- 
what simpler way to obtain optical 0 
than that proposed by, Flock, 1964.) 
There is unfortunately little convincing 
evidence that the human observer is par- 
ticularly sensitive to the variables in- 
volved in making slant judgments from 
this information, or capable of the re- 
quired transformations. 

Perspective, on the other hand, is well 
established as a relevant variable in depth 
perception, even when studied in isolation 
(Braunstein, 1962b). However, as Free- 
man (1965) points out, perspective de- 
pends not only upon the slant angle, but 
upon the projection (viewing) distance 
and the dimensions of the surface. This 
dependency is found with the present 
ratio definition of perspective, as well as 


2 Degree of slant, but not direction, can be 
determined from parallel projections. 
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with Freeman’s difference definition. 
One is left with the situation that optical 
9, uniquely derivable from slant, may not 
be a perceptual variable and perspective, 
a perceptual variable, is not uniquely de- 
rivable from slant. 

In the stationary sequences in the pres- 
ent study, texture gradients appeared 
quite insufficient as sources of slant in- 
formation, with a judged slant of only 
11° corresponding to a displayed 60° 
texture. There was no indication of an 
ability on the part of Ss to accurately 
perceive slant angle on the basis of the 
displayed texture. In order for slant 
angle to be derived from a velocity grad- 
ient, rate of change in position along the 
X axis must be considered with respect 
to distance above that axis, on the origi- 
nal plane. On the slanted plane, this 
judgment would have to be made in terms 
of surface density units. If Ss could 
perceive units of equal texture density 
for these stimuli, slant judgments based 
on texture gradients alone should have 
proved more accurate. It is therefore 
proposed, in agreement with Freeman 
(1965), that Ss cannot make the dis- 
criminations required by gradient theory 
and are unable to accurately judge slant 
either from textured surfaces or perspec- 
tive rectangles, in the abstract. Relative 
slant judgments may be made from per- 
spective variations, and high correlations 
of these judgments with displayed slant 
are obtained when the viewing distance 
or projection point is kept constant. 
Slants may be reproduced accurately 
when the comparison and standard stimuli 
are located at the same distance from S. 
This is the general indication of results 
in the literature (Flock, 1965). If the 
viewing distance of the standard stimulus 
were randomly varied without indications 
to S of these variations, it is unlikely the 
slant judgments made with a comparison 
stimulus at a constant distance would be 
very accurate. This prediction is based 
on an hypothesis that perspective is the 
principal source of information used by 
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Ss to judge slant, and slant judgments 
can be accurate only when other variables 
influencing perspective are controlled. 
This hypothesis, while clearly differing 
from Flock’s approach, is also not identi- 
cal to Freeman’s perspective theory, 
which tends to restrict the study of slant 
perception to figures that are rectangular 
in the frontal parallel plane. Certainly, 
deviations of contour lines from right 
angles with the base of a figure appear 
to be highly detectable. Deviations from 
uniform velocity affect slant judgments in 
a similar manner. Deviations from uni- 
form texture carry similar information, 
but have a lesser effect on judged slant. 
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This study was designed to test the hypothesis that associatively re- 
lated nouns embedded in connected discourse—at least in the case of 
items that appear in the same or in contiguous sentences—are stored 
more efficiently than associatively unrelated nouns. A 2 X 2 factorial 
design was employed (with 20 Ss in each group) in which Ss were 
asked to memorize a passage that contained associatively related nouns 
or a passage that contained associatively unrelated nouns. After a 
single exposure to a passage, retrieval was tested by a written recall 
test or by a paced successive binary recognition-memory test that 
was designed to greatly reduce opportunities for construction. The 
context of the high-association (HA) passage was the same as the 
context of the low-association (LA) passage. The hypothesis was 
supported; the facilitating effect of association was independent of 


retrieval method. 


Previous research (Rosenberg, 1966, 
in press) involving auditory and visual 
presentation and a variety of passages 
has reliably confirmed the expectation 
that associatively related words em- 
bedded in connected discourse would 
be recalled better than associatively un- 
related words. This expectation was 
based upon some assumptions about 
the factors that operate during the 
storage and retrieval of connected dis- 
course, most important of which are 
the chunking (during storage) of asso- 
ciatively related items that occur in the 
same sentence or in contiguous sen- 
tences, and the construction (during 
retrieval) of associates on the basis of 
items already recalled. 

The importance of construction in 
the recall of structured verbal materials 

1The research reported herein was per- 
formed pursuant to Contract OEC-3-6- 
0617840508 with the United States Depart- 
ment of Health, Education and Welfare, 
Office of Education, under the provisions of 


P.L. 83-531, Cooperative Research, and the 
provisions of Title VI, P.L. 85-864, as 


amended. 


has been emphasized by several writers 
(e.g. Deese, 1961; Postman, 1965). 
In regard to the present problem, the 
results of a number of cloze studies 
(Rosenberg, in press) in which stimu- 
lus or response words (high- and low- 
strength associates from free-associa- 
tion norms) were deleted from pass- 
ages of connected discourse and Ss 
were required to “guess” the deleted 
items on the basis of the remaining con- 
text, indicated that construction could 
have contributed appreciably to the re- 
call scores of Ss in the learning studies 
who were exposed to high-association 
passages. The question raised by 
these cloze data is whether associative 
facilitation in the recall of items em- 
bedded in connected discourse could be 
eliminated if opportunities for construc- 
tion were eliminated or greatly re- 
duced. Tf this did not turn out to be 
the case, one would have to conclude 
that associatively related words appear- 
ing in connected discourse are stored 
more efficiently than associatively un- 
related words. 
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In the present study one group of 
Ss was exposed to a passage that con- 
tained associatively related nouns, and 
another group to a passage that con- 
tained associatively unrelated nouns. 
These groups were further subdivided 
into groups that received a written re- 
call task and groups that received a 
paced successive binary recognition- 
memory task similar to one used by 
Lachman and Tuttle (1965) in a study 
of the recall and recognition of items 
from passages that varied in approxi- 
mation to English. The recognition- 
memory task was designed to elimi- 
nate or at least greatly reduce oppor- 
tunities for construction and had the 
following characteristics: (a) key 
items (the high- and low-strength asso- 
ciates) were randomized within a long 
list of distracter items; (b) key items 
did not appear in the order in which 
they appeared in the passages; (c) key 
items rarely appeared contiguously; 
(d) key items were presented one at a 
time, aurally, so that S could not 
search through the entire list for re- 
lated items or for items that appeared 
contiguously in the passages; (e) the 
list was presented at a rapid rate so 
that S would have little time to think 
of the passage as a whole; (f) no 
words appeared on Ss’ response sheets ; 
and (g) none of the context items 
from the passages appeared in the 
recognition list. 


METHOD 


Subjects —Eighty paid volunteers from 
introductory undergraduate psychology 
courses were assigned at random to four 
groups of 20 Ss each. 

Materials. —A high-association (HA) and 
a low-association (LA) passage were pre- 
pared. Each contained the same context 
materials, the same stimulus words (14 
nouns) selected from free-association norms 
(Palermo & Jenkins, 1964), but different 
response words. Three high-strength re- 
sponse words accompanied each stimulus 


word in the HA passage, and three low- 
strength items accompanied each stimulus 
word in the LA passage. The HA and LA 
response words for each stimulus were com- 
parable in Thorndike-Lorge (1944) fre- 
quency (most of the key items were AA and 
A words), and in length, identical in gram- 
matical class (nouns), and meaningful in the 
context of the passage. The words within 
each group of four HA items tended to elicit 
each other in free association. The index of 
interitem associative strength for the 56 HA 
items (stimulus words plus response words) 
was 21.44%, and for the 56 LA items it was 
.02%. Twenty-four of the words that were 
designated LA responses did not occur at 
all as responses in the Palermo and Jen- 
kins (1964) college norms, 17 occurred as 
idiosyncratic responses in the norms as a 
whole, and 1 occurred with a frequency of 
three in the college norms. 

The two passages are reproduced below, 
and for the reader’s convenience, the stimulus 
words have been placed in parentheses and 
the response words have been underlined. 
The three response words for each stimulus 
word are the underlined items that follow 
the item in parentheses. 


High-association passage—It was a 
a Sunday afternoon in March and the 
air was clear and brisk. The (MAN) 
walked to the corner to meet the woman 
who was looking at a boy and a girl 
across the way. After exchanging a 
few words, they went to a restaurant, 
and as they entered, they passed some- 
one who was examining a (TABLE), a 
chair and a desk, and someone with a 
cloth. After they ordered, there was 
time to look at a newspaper, where there 
was a story about a (KING) and 
a queen. It concerned a ruler who had 
just been given a new crown. There 
was also a story about a (PRIEST) 
whose church was in difficulty, which 
went on to discuss God and religion. 
The front page contained an item about 
a (ton) that had a fight with a tiger 
in a zoo, and had its mane pulled. It 
also contained an article on (JUSTICE), 
Jaw and peace written by a former judge, 
Their order arrived, and everything was 
good: the (cHexsE), the bread, the 
crackers and the milk. After they 
finished, they headed toward a (city) 
which was located near a town on the 
other side of the state, The country 
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they passed through on the way was 
nice. When they arrived, they went 
directly to a (mouse) where a home 
used to be. They had to check up on 
the roof and the garage. There was a 
(RIVER) nearby, and the water was 
visible on a clear day. A lake and a 
stream were nearby also. The care- 
taker was outside with a (HAMMER) and 
a nail. A tool of some sort and a saw 
were on the ground. There were some 
(smoes) on the grass, and there was 
dirt on the fellow’s feet. His socks 
and laces were discolored. Someone had 
left a (NEEDLE), some thread, a pin 
and his sewing about, and there were 
some (KITTENS), cats, dogs and mice 
outside, as well. It grew late and they 
decided to leave. 

Low-association passage—It was a 
Sunday afternoon in March, and the air 
was clear and brisk, The (MAN) 
walked to the corner to meet the guest 
who was looking at a horse and a 
bird across the way. After exchanging 
a few words, they went to a restaurant, 
and as they entered, they passed some- 
one who was examining a (TABLE), a 
door and a post, and someone with a 
glass. After they ordered, there was 
time to look at a newspaper, where 
there was a story about a (KING) and 
a nurse. It concerned a leader who 
had just been given a new plane. There 
was also a story about a (PRIEST) 
whose friend was in difficulty, which 
went on to discuss life and industry. 
The front page contained an item about 
a (x10Nn) that had a fight with a hound 
in a pit and had its fuzz pulled. It also 
contained an article on (JUSTICE), age 
and doubt, written by a former sailor. 
Their order arrived, and everything 
was good: the (CHEESE), the corn, the 
lettuce and the fish, After they finished, 
they headed toward a (city) which 
was located near a road on the other 
side of the valley. The region they 
passed through on the way was nice. 
When they arrived, they went directly 
to a (HoUsE) where a store used to be. 
They had to check up on the gate and 
the plaster. There was a (RIVER) near- 
by, and the forest was visible on a 
clear day. A grave and a station were 


nearby also. The caretaker was out- 
side with a (HAMMER) and a bell. A 
stem of some sort and a pan were on the 
ground. There were some (SHOES) on 
the grass, and there was dirt on the fel- 
low’s porch. His wrists and knees were 
discolored. Someone had left a 
(NEEDLE), some metal, a hat and his 
violin about, and there were some 
(KITTENS), guns, rocks and oars out- 
side as well. It grew late and they 
decided to leave. 


To determine beforehand whether there 
were any strong preexperimental constraints 
between context material and key items 
that might facilitate recall of HA or LA 
items, cloze data were collected on the 
context of the HA and LA passages in 
printed form. For this purpose, all the 
stimulus and response words were deleted 
and replaced by underlined blank spaces of 
uniform length, The Ss (N=21) for 
this study were told that their task was to 
guess the words that were deleted from the 
passage on the basis of the remaining con- 
text. Each S’s response sheet was scored 
for verbatim construction of HA and LA 
items, and to increase the sensitivity of the 
cloze test to contextual constraints, an item 
from a given group of HA or LA words 
was counted correct if it appeared in any 
one of the four positions assigned to that 
group in the original passage. 

The proportion of cloze responses that 
were found to be HA items was .06, and 
LA items .03. While there is a difference 
in favor of HA items, it is slight, and the 
result primarily of constraints between the 
context and one group of HA items, man, 
woman, boy, girl. If this group and its 
LA counterpart are eliminated, the propor- 
tions in question become, for HA and LA 
items, respectively, .03 and .02. Since these 
groups of items appear at the beginning of 
their respective passages, they should both 
be subject to a primacy effect and not con- 
tribute to any difference that might occur 
in the recall of HA and LA items. 

The recognition-memory test for each 
level of association consisted of a list of 
168 nouns (56 key items and 112 distrac- 
ters). The distracter items were the same 
for the HA and LA conditions, and were 
selected in a nonsystematic manner from 
among the AA and A nouns of the Thorn- 
dike-Lorge (1944) norms. Four different 
random orders of items were constructed for 
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the recognition lists to control for possible 
serial effects, with the following limitations: 
(a) no more than two key items were 
allowed to occur contiguously, (b) the order 
in which the key items appeared among the 
distracters was different from the order in 
which they appeared in the passage, and 
(c) each successive group of 21 words 
contained seven key items (thus the key 
items were distributed throughout the list 
of distracter items). None of the nouns 
from the context of the passages appeared 
within the recognition lists. 

The recognition Ss were each given two 
sheets of paper that contained four lists of 
underlined blanks numbered from 1 through 
21. These eight lists were lettered from A 
through H. The Ss in the recall groups 
were given sheets of lined paper. The pass- 
ages and the recognition lists were recorded 
on magnetic tape. 

Procedure-——The conditions of the experi- 
ment were presented in rotation to groups of 
from two to five Ss. All Ss listened to a 
passage played once on a tape recorder fol- 
lowed by either a written recall test or a 
recognition-memory test. The presentation 
time for each passage was 1 min, 55 sec., 
and the study-test interval was 4 sec. The 
passages had been recorded at a moderate 
speaking rate with normal intonation. The 
learning and test instructions were de- 
livered by E prior to presentation of a 
passage, All Ss were told that their task 
was to listen to a passage and to try to 
remember, verbatim, as much of it as they 
could. The Ss in the recall condition were 
asked to write down as much of their 
Passage as they could remember, and to 
guess at items they could not remember. 
The signal to begin writing was the words 
“start writing” which was delivered by E. 
The recall Ss were told that they would 
be given ample time to complete their writ- 
ing task. 

The recognition Ss were told that for the 
recognition-memory task, they would be 
presented aurally with a long series of words 
numbered consecutively in groups of 21, 
some of which were in the passage and some 
of which were not, and that they were to 
respond to each word by putting down in the 
appropriate underlined space on the response 
sheets, a plus mark if they felt it was in 
the passage or a minus mark if they felt 
it was not. The recognition Ss were in- 
structed further to respond quickly and to 
guess on all items they were not sure of. 

The time interval from one numbered 


item in the recognition lists to another was 
3 sec., during which S heard a number and 
a word, and then recorded a plus or a minus 
on his response sheet. The interval between 
each series of 21 items was 3 sec. 


RESULTS 


The main dependent variable was the 
total number of key words recalled or 
recognized correctly. The written re- 
call protocols were scored for verbatim 
recall, and a key item was scored as 
being correct regardless of its location 
on the recall sheets. A key item that 
occurred more than once in S’s writ- 
ten recall was always scored as having 
occurred once. The recognition test 
sheets were also scored for the tend- 
ency to give false positives, and a sec- 
ond score was computed—a corrected 
score—using the formula 

C = 56E, 
D 

where C is number of key words cor- 
rect, E is number of false positives, 
and D is number of distracters. The 
constant 56 refers to number of key 
items in the list. With this correction, 
S’s score would reduce to zero if he 
recorded a plus for every word in the 
recognition list. None of the Ss in 
the recognition condition failed to com- 
plete their task. 

Table 1 contains the means and 
standard deviations for the uncorrected 


TABLE 1 


MEANS AND SDs FoR THE NUMBER OF KEY 
Worps RECALLED AND RECOGNIZED 


(UNCORRECTED) 
Association 
Retrieval Method p= T LT 
HA LA 
ENIE ENEA 
il 

eae 24.40 16.55 

SD 6.57 7.37 
Re ition 

A 49.95 41.10 

SD 5.32 8.12 
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scores. It can be seen there that the 
HA groups were superior to the LA 
groups, and that the recognition con- 
dition was superior to the recall con- 
dition, A 2 X 2 factorial analysis of 
variance revealed an F (1, 76) of 
19.54, p < .001, for Association, and 
an F (1, 76) of 231.08, p < .001, for 
Recall vs. Recognition. The inter- 
action was not significant. 

Group LA-Recognition recorded 
more false positives (X = 23.90, SD 
= 10.26) than Group HA-Recognition 
(X = 21.05, SD = 15.66), but the dif- 
ference was not significant, ¢ (38) = 
.68, p > .05. The means for the cor- 
rected scores for the HA and LA 
recognition groups were, respectively, 
36.20 (SD = 8.11) and 28.70 (SD = 
9.83). The 2 x 2 factorial analysis of 
variance was repeated, using the cor- 
rected scores for the recognition 
groups, with identical results; F (1, 
76) = 18.14, p < .001, for Association, 
and F (1, 76) = 44.12, p < .001, for 
Recall vs. Recognition, The interac- 
tion was not significant. Since the 
false-positive rate was the same for 
both recognition groups, and since the 
results for the uncorrected and the cor- 
rected scores were identical, only un- 
corrected scores were used in subse- 
quent analyses. 

The effect of association appeared 
strikingly in the case of the 14 key 
words—the stimulus words—which 
were the same for the HA and LA 
conditions. Table 2 contains the de- 
scriptive statistics for this measure. 
The trends revealed in Table 2 are the 
same as the trends revealed in Table 1. 
The analysis of variance resulted in 
F (1, 76) = 28.87, p < .001, for Asso- 
ciation, and F (1, 76) = 105.65, p< 
.001, for Recall vs. Recognition. The 
interaction was not significant. 

If S recalls one word from a group 
of four nouns which had appeared in 
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TABLE 2 


MEANS AND SDs FoR THE NUMBER OF 
StmruLus Worps RECALLED AND 
RECOGNIZED (UNCORRECTED) 


Association 
Retrieval Method 
HA LA 

Recall 

M 8.55 5.45 

SD 2.16 2.70 
Recognition 

M 13.10 10.95 

SD 1.07 2.44 


the same or in contiguous sentences in 
connected discourse, the probability of 
recalling all of the other words in the 
group should be greater for HA items 
than for LA items. This expectation is 
predicated on the assumption that the 
location of associatively related items in 
the HA passage makes associative 
chunking possible during learning and 
associative construction possible during 
recall. A similar prediction is made for 
recognition-memory; however, in this 
case, such a finding would have to be 
attributed primarily to the storage fac- 
tor. A recall-dependency score was 
computed for each S by dividing the 
number of whole groups of four nouns 
recalled or recognized by the number of 
opportunities to recall or recognize a 
whole group, i.e., the number of in- 
stances in which at least one of the 
words from a group had been recalled 
or recognized. The means for the 
HA-Recall and LA-Recall conditions 
were .19 and .05, respectively, and for 
the HA-Recognition and LA-Recogni- 
tion conditions, .55 and .39. Thus, 
the HA group was superior to the LA 
group for both recognition and recall. 
An analysis of variance revealed, F (1, 
76) = 16.55, p < .001, for Associa- 
tion, and F (1, 76) = 86.55, p < .001, 
for Recall vs. Recognition. The inter- 
action was not significant. 
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Discussion 


While these results do not rule out 
the possibility that construction contrib- 
utes to the recall of associatively related 
nouns embedded in connected discourse, it 
is evident (within the limitations of the 
design and the materials of the present 
study) that associatively related nouns 
are retrieved better than associatively 
unrelated nouns even when opportunities 
for construction during retrieval are 
greatly reduced. However, it is possible 
that in the present study the contribution 
of construction to recall scores was mini- 
mal, because, for all measures of per- 
formance the difference between HA and 
LA for the recall condition (where, sup- 
posedly, both storage factors and con- 
struction during retrieval were operat- 
ing) was not greater than the difference 
between HA and LA for the recognition 
condition. 

While opportunities for construction 
may be severely limited in the present 
recognition task, the likelihood of re- 
sponding positively to a key item in the 
HA-Recognition lists may be increased 
by another factor. According to tradi- 
tional associative theory, whenever a 
word occurs, it should tend to elicit im- 
plicitly its high-strength associates. In 
terms of the present HA-Recognition lists, 
it is possible that since the words in the 
various groups of four are, in the main, 
associatively interrelated, the implicit 
occurrence of key items during retrieval 
(occasioned by the overt occurrence of 
other key items) may act as prompts 
to the recognition of associates that 
occur subsequently. This expectation is 
strengthened by observations of Storms 
(1961) on associative priming in word- 
association tests and of Underwood 
(1965) on the role of implicit associative 
responses in false recognition. If such 
a factor was operating in the present 
study, one would expect its effects to be 
cumulative; ie. the likelihood of re- 
sponding positively to a key HA item 
should increase as a function of the num- 
ber of associatively related items that 
have already occurred during recognition. 
No such effect should be forthcoming in 
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the case of the LA-Recognition condition. 
To test this hypothesis, a rank of one, 
two, three, or four was assigned to each 
item in the 14 HA and LA word-groups 
(for all list orders) in terms of the order 
of its occurrence in its recognition list. 
While the items assigned each rank 
varied from list order to list order, the 
position within the list of distracter items 
of all items assigned the same rank was 
the same. The dependent variable was 
the proportion of Ss who responded posi- 
tively to the items assigned each rank. 
The values for this variable for Ranks 
1, 2, 3, and 4, respectively, were, for 
Group HA-Recognition, .91, .86, .79, and 
80, while for Group LA-Recognition 
they were .86, .75, .70, and .62. It can 
be seen that although HA Ss generally 
surpassed LA Ss, recognition accuracy 
decreased in both groups as a function of 
rank order, with HA Ss leveling off at 
Rank 3. 

Thus, there appears to be no evidence 
for a prompting effect in the case of HA 
items, but rather a general reduction in 
recognition accuracy due most likely to 
an accumulation of retroactive interfer- 
ence from the distracter items. 

One other factor should be considered 
in interpreting the present results. The 
author has spoken throughout of the oc- 
currence of construction during retrieval. 
It is possible, however, that in a case in 
which some of the key HA and LA 
items are unintelligible during original 
auditory exposure to the passages, S 
may attempt to construct (“fill in”) these 
items, and as the results of the cloze 
studies (Rosenberg, in press) suggest, 
the constructed items are more likely to 
be HA responses of the intelligible key 
words than LA responses. The result of 
such construction, of course, might be to 
enhance the recall and recognition scores 
of HA Ss. However, since, upon post- 
experimental questioning, all of the pres- 
ent Ss indicated no difficulty in under- 
standing the materials presented, it is 
unlikely that this factor contributed to 
the superiority of the HA condition. 

Taken together, the results of the pres- 
ent study offer strong support for the 
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hypothesis that associatively related words 
embedded in connected discourse (at least 
in the case of associatively related nouns 
that occur in the same or in contiguous 
sentences) are stored more efficiently 
than associatively unrelated words. 
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SECOND-ORDER DEPENDENCY IN PROBABILITY 
LEARNING* 


MICHAEL H. STRUB an» JAMES R. ERICKSON 
Ohio State University 


44 Ss served in an 800-trial, 2-choice probability-learning experiment 
concerned with Ss’ ability to use 2nd-order conditional probabilities. 
Marginal and Ist-order conditional probabilities were controlled at p= 
.50, as were 2 of the 4 2nd-order conditionals. For the other 2 2nd- 
order conditionals, the form (event on Trial either the same or dif- 
ferent from Trial n-2) and amount (72% vs. 92%) of dependency 
were varied factorially. High-strength dependencies were learned, but 
performance in low-strength groups was strongly affected by the form 
of the dependency. The data could be accounted for either by a modi- 
fied version of a stimulus-sampling theory or by run-structure theories 


of probability learning. 


Within probability learning, condi- 
tional probability is defined as the prob- 
ability of an event (or a response) on 
Trial n given that a specified set of 
events has occurred on trials which pre- 
cede n. Several investigators, notably 
Hake and Hyman (1953) and Ander- 
son (1960), have demonstrated that S’s 
response probabilities are close to first- 
order conditional event probabilities 
as well as to overall event frequency. 
It has also been suggested, by apply- 
ing informational analyses to probabil- 
ity-learning data, that Ss normally 
make very little use of events on trials 
more remote than n-1, in that the in- 
formation transmitted by considering 
trials beyond n-1 is quite small in com- 
parison to the information transmitted 


1The research reported in this paper was 
carried out at the Human Performance Cen- 
ter and was sponsored by the Aeromedical 
Research Laboratories, Aerospace Medical 
Division, Air Force Systems Command, 
Wright-Patterson Air Force Base, Ohio, 
under Contract No. AF 33(615)-2248 with 
the Ohio State University Research Founda- 
tion, Further reproduction is authorized 
to satisfy the needs of the United States 
Government. Portions of these data were 
Presented in a paper at the Midwestern Psy- 
chological Association meeting in Chicago, 
May 1967. 


by the event on Trial n-1. This has 
been shown in situations where first- 
order conditional probabilities were 
manipulated (Hake & Hyman, 1953) 
and in situations where they were not 
(Edwards, 1961). 

The fact that Ss normally make little 
use of information from trials beyond 
the last one does not mean that they 
cannot do so. They can learn double- 
alternation and other simple repeating 
sequences (Schoonard & Restle, 1961; 
Vitz & Todd, 1967) and undoubtedly 
can respond to much more complex 
sequences involving second- or higher- 
order sequential dependencies in deter- 
ministic situations, but there is almost 
no evidence on whether they can do 
the same in probabilistic situations. 
One of the purposes of this study was 
to provide information on the follow- 
ing questions: Can Ss learn to use 
second-order event dependencies, and 
if so, how does the strength or the 
form of the dependency influence their 
responses ? 

A second purpose of this study was 
to examine possible generalization 
effects when one type of second-order 
event probability is manipulated on pre- 
dictions involving other second-order 
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probabilities. There is evidence that 
manipulation of the first-order condi- 
tional probability of one event affects 
the first-order response probability of 
the other event (Rubinstein, 1961), 
but there is no evidence on this point 
regarding second-order dependencies. 

It should be emphasized that when- 
ever the conditional event structure is 
manipulated, the run structure is also 
changed. Many studies have shown 
that Ss’ responses in probability-learn- 
ing situations are affected by the run 
structure of the sequence (e.g., Gam- 
bino & Myers, 1966; Jones & Myers, 
1966; Nicks, 1959; Restle, 1966; Rose 
& Vitz, 1966) ; in fact, even when Ss 
are told to ignore run length, they do 
not do so (Beach & Swensson, 1967). 
The question of whether Ss respond to 
second-order event patterns or to run 
structures cannot be answered with 
certainty, or even with very much con- 
fidence, since not only is it impossible 
to manipulate these factors ortho- 
gonally, but the correlation between 
them is very high. 

Second-order conditional event prob- 
ability is defined as p (Es,n|Ej,n-Ex,n-2)- 
In a two-choice situation there are four 
such parameters to be independently 
manipulated. It is possible to construct 
sequences containing both random and 
dependent second-order events and at 
the same time control both marginal 
and first-order probabilities at p = .50. 
For example, consider the following as 
the first 10 events in a sequence: 1, 2, 
2, 1, 2, 2,1, 1, 1, 2. Observe Events 
3-10. Note that each event occurs 
equally often (zero-order or mar- 
ginal control) and each event follows 
itself and the other event equally often 
(first-order control). Event pairs 1,1 
and 2,1 are followed by 1 and 2 equally 
often (partial second-order control), 
but pairs 1,2 and 2,2 are always fol- 
lowed by 2 and 1, respectively (partial 
second-order predictability). 


Table 1 shows the event probabilities 
used in this study. In each of the four 
groups, zero-order, first-order, and two 
of the second-order probabilities were 
“controlled” at = .50, while the other 
two second-orders were “manipulated” 
at the desired probability. The manip- 
ulated second orders were set at p= 
.92 for the high-strength groups (H) 
and at p=.72 for the low-strength 
groups (L). For the two groups 
where 1|1,1 and 2|1,2? are the manip- 
ulated second orders, the predictable 
trials are signaled by an E, event on 
the prior trial and the “correct” or 
most frequently reinforced response is 
to predict the same event that occurred 
two trials back; these groups are 
labeled E for equal, since the predict- 
able events are equal to the event two 
trials back. For the two groups where 
2|2,1 and 1|2,2 are the manipulated 
second orders, the predictable trials fol- 
low trials on which E, appears and 
the most frequently reinforced response 
is to predict the opposite of the event 
that occurred two trials back; these 
groups are labeled U for unequal; the 
predictable events are unequal to the 
event two trials back. 

To illustrate the necessary correla- 
tion between event dependency and run 
structure, consider the run structure 


2It has been common to refer to the more 
frequently occurring event as E, and the 
less frequent event as Es in two-choice 
situations, and to label predictions of their 
occurrence as responses A: and As re- 
spectively. Since the events occurred with 
equal frequency in the present study, the 
designation of the E, and E: events was 
arbitrary, based only on position. An 
alternative would have been to label the 
more frequently occurring second-order 
event E: in all conditions. It was decided 
not to do this, since in studies where all 
four second-order conditionals are manip- 
ulated within each schedule, the distinction 
between E; and E, positions is necessary, 
and failure to maintain the distinction here 
might have caused future confusion. 


SECOND-ORDER DEPENDENCY IN PROBABILITY LEARNING 


TABLE 1 


MARGINAL, FIRST-ORDER, AND SECOND- 
ORDER PROBABILITIES USED IN 
THE PRESENT STUDY 


Groups 


Event Probabilities 


HE LE HU LU 


Note.—Event probabilities were realized exactly 
within each 100-trial block. 


generated by the above second-order 
dependencies. The run length distri- 
bution of E, events in the E conditions 
is characterized by a large number of 
runs of Length 1 (caused by the 2|1,2 
dependency) with the remaining runs 
tending to be long (caused by the 1|1,1 
dependency). The distribution of E, 
tuns in the U conditions is sharply 
peaked at Length 2, with a few runs 
of Length 1,3 or longer (since after 
1,2 a 2 is likely and after 2,2 a 1 is 
likely). Run-length distributions for 
E, in E conditions and for E, in U 
conditions closely approximate ex- 
pected distributions where p =.50 
(Nicks, 1959). 

To summarize briefly, the purposes 
of this study were to examine (a) how 
Ss’ second-order response probabilities 
are affected by manipulation of the 
strength and form of event second- 
order dependencies, (b) the extent to 
which response dependencies generalize 
from manipulated to controlled second- 
order event dependencies, and (c) 
the effects on response run structure of 
the event run structure generated by 
controlling the second-order event de- 
pendencies. 


METHOD 


Subjects—Forty-four male college stud- 
ents were recruited by advertisement and 
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paid $1.25/hr for participating in the ex- 
periment. In addition to their base pay, 
they received 4¢ for each correct prediction 
and lost 4¢ for each incorrect prediction. 
Eleven Ss were assigned randomly to each 
of the four groups shown in Table 1. 

Apparatus—An Estes-Straughan (1954) 
type two-choice guessing apparatus was lo- 
cated in S’s sound-treated booth. It con- 
sisted of a white signal light, two red feed- 
back lights, two response keys, and a “home” 
button which S continuously pressed when 
not predicting. On the right side of the 
apparatus were two counters displaying to 
S his current total of correct and incorrect 
predictions. Stimuli were prepunched on 
paper tape and presented via a paper-tape 
input device located outside S’s booth. Re- 
sponses, reinforcement events, and response 
latencies were recorded on a paper-tape out- 
put device. Low-level white noise was pre- 
sented through a set of earphones to mask 
equipment noise. 

Procedure——Each S was instructed that 
his task was to decide which of two red 
lights would come on. He began each trial 
by holding down the home key with the 
index finger of his preferred hand. At the 
start of a session the white light appeared 
as a signal for S to make his first prediction, 
After his prediction, he returned his finger 
to the home key. Two and one-half seconds 
after the prediction response, one of the 
two event lights lit for .4 sec. and the cor- 
rect or error counter advanced. After re- 
ceiving feedback, S was free to make another 
prediction; trials were thus partially self- 
paced. 

The response timer was activated by the 
onset of the event light and stopped when 
S lifted his finger from the home key to 
make a prediction. It was hoped that this 
would provide a relatively good measure of 
decision latency uncontaminated by move- 
ment time, forced-paced intertrial intervals, 


etc. 

After 102 predictions, S rested while a 
second S took his place; they alternated in 
this manner for a total. of eight 102-trial 
blocks, four per day for two successive 
days. Daily sessions lasted for approxi- 
mately 2 hr. : i 

Design—The principal variables investi- 
gated were the form and strength of the 
second-order dependency as shown in Table 
1, Stimulus schedules were constructed by 
selecting events from a table of random 
numbers (Rand Corp., 1955) according to 
the appropriate second-order probabilities. 
The various marginal and conditional prob- 
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abilities were realized exactly in the last 
100 trials of each 102-trial block. Each S 
received a random ordering of the eight 
schedules. For purposes of analysis and inter- 
pretation of generalization and other effects, 
two within-Ss variables were identified and 
treated as factors. The first was called 
second-order discrimination, testing differ- 
ences between the controlled (p=.50) and 
the manipulated (p=.72 or .92) second- 
order conditionals. The second was based 
on the first-order patterns within the second 
orders; one level was called first-order repe- 
tition, based on the relationship Ex,n = Ey,n-, 
and the second level was called first-order 
alternation since its defining relationship 
was Ein A Ezma. 


RESULTS 


Response data: Conditional prob- 
abilities —Figure 1 shows the second- 
order prediction data for the various 
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groups. The main analysis of these 
data was a 2* factorial analysis of vari- 
ance (Form X Strength x Second- 
Order Discrimination x First-Order 
Pattern), with the mean second-order 
response probabilities, ie, (Ain 
Ejn1,Exn-2), on the last two 100-trial 
blocks as the dependent variable. 

The main effects of Form, F (1, 
40) = 8.82, p < .001, Strength, F (1, 
40) = 8.90, p<.01, Second-Order 
Discrimination, F (1, 40) = 85.93, p 
< .001, and First-Order Pattern, F (1, 
40) = 21.73, p<.001, show that, 
overall, Ss responded with higher-sec- 
ond-order response probabilities to E 
conditionals, to H conditionals, to ma- 
nipulated conditionals, and to repeti- 
tion pattern first orders within the sec- 
ond-order conditionals. 

The Strength x Second-Order Dis- 
crimination interaction was highly sig- 
nificant, F (1, 40) = 23.79, p < .001, 
because the difference between con- 
trolled and manipulated second orders 
was greater for H conditionals. For 
example, in Cond. HE the response 
probability for the controlled 2|2,2 pat- 
tern was .63 on the last two blocks, 
while it was .83 in Cond. LE; corre- 
sponding probabilities for the manip- 
ulated 1|1,1 dependency were .92 for 
HE and .89 for LE. 

Interpretation of the significant 
Strength x First-Order Pattern is 
similar, F (1, 40) =7.37, p< 01. 
For the high-strength groups, the re- 
sponse probabilities to first-order alter- 
nation and repetition patterns were 
fairly close by the end of learning, 
while for the low-strength groups re- 
sponse first-order probabilities were 
much higher for patterns with first- 
order repetitions (e.g., 1|1,1 and 2|2,1) 
than for patterns with first-order alter- 
nations (eg. 2|1,2 and 1|2,2) and, 
again, in LU there was no evidence of 
use of the 1|2,2 pattern at all. 
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In addition, the Form X Second- 
Order Discrimination interaction ap- 
proached significance, F (1, 40) = 
3.49, p < .08; there was a tendency 
for the controlled and manipulated 
second-order dependencies to be more 
widely separated for the E than the 
U groups. No other interactions ap- 
proached significance. 

An additional analysis of variance 
was performed on the manipulated 
second-order dependencies only, in- 
cluding trial blocks as a repeated-meas- 
ures variable, to compare learning rates 
for the various conditions. Significant 
effects included Form, Strength, First- 
Order Pattern, and Strength x First- 
Order Pattern as noted above, a Trial 
Block main effect, and a Form X 
Strength X Blocks interaction, F (7, 
280) = 2.41, p<.05. The latter 
interaction showed that learning trends 
were different for the four groups. 
HE reached asymptote very quickly, 
and HU improved steadily throughout 
the 800 trials but did not reach an 
asymptote. LE improved steadily but 
reached a lower asymptote, and LU 
showed relatively little learning. In 
addition, in the H conditions both first- 
order patterns reached about the same 
level by the last 200 trials, while in the 
L conditions the reptition patterns were 
learned to a higher level. In LE the 
repetition pattern overshot the condi- 
tional event probability by a consider- 
able amount, while the alternation pat- 
tern was at a probability-matching 
level; in LU the repetition pattern 
reached a probability-matching level, 
while the alternation pattern remained 
at about p = .50, 

Table 2 presents asymptotic data on 
the controlled marginal and first-order 
dependencies. The marginal prob- 
abilities were all approximately .50, 
and the statistical analysis revealed no 
Significant differences among them. 
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The first-order probabilities showed 
consistent tendencies toward repetition 
of the previous event, a bias appearing 
also in the first-order pattern effects 
found in the analyses of the second- 
order data. Analyses of these data 
revealed a significant main effect of 
Strength, F (1, 40) = 6.94, p < .025, 
and a significant Strength X Form 
interaction, F (1, 40) = 4.69, p < .05 
for p(Asn|Eon+), but no significant 
effects for p(Aj,n|E,n.). The repre- 
tition biases were stronger in the L 
conditions; the interaction was caused 
by the fact that the repetition bias was 
strongest in Cond. LE and weakest 
in Cond. HE (this was true for both 
E, and E, events, but reached signif- 
icance only for the prediction of E,s). 
The interpretation of this is clear from 
the run analysis (below), but, in gen- 
eral, the conditions for which the sec- 
ond-order dependencies were well 
learned showed little first-order bias. 

Response data: Runs.—Figure 2 
shows the probability of making an A; 
response as a function of k, the number 
of preceding E; events in the current 
run, for the various conditions in the 
present study. The largest value of 
k varies from condition to condition 
and varies for the E, and E, events 
within conditions, equaling the longest 
run for which the frequency is large 
enough for the response probability to 
be stable. Curves are shown for re- 
sponse probabilities in early (Blocks 1 


TABLE 2 


MARGINAL AND FIRST-ORDER CONDITIONAL 
RESPONSE PROBABILITIES AVERAGED OVER 
THE Last Two 100-TrraL BLOCKS 


Cond. | p(Atjn) | 2(ArnlErn-) p(A2,n|E2,n-1) 
HE .497 501 511 
HU 532 577 1923 
LE 467 592 664 
LU 491 578 601 


Note,—Corresponding event probabilities = .50, 
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and 2), middle (Blocks 4 and 5), and 
late (Blocks 7 and 8) stages of learn- 
ing along with the corresponding event 
probabilities. Such curves are com- 
monly called recency curves. A trend 
for p(A;|kE;) to increase as k in- 
creases is called positive recency, and a 
trend for p(A,|kE;) to decrease as k 
increases is called negative recency (or 
“the gambler’s fallacy,” which may or 
may not be a fallacy, depending on the 
run structure). 

In the E conditions, the recency cur- 
ves for p(A,|kE,) reflect the manip- 
ulated second-order dependencies, 
while the p(A,|kE,) curves reflect the 
controlled dependencies, The opposite 
is true for the U conditions. All of 
the response curves for the E condi- 
tions show strong positive recency. In 
Cond. HE, Ss were clearly following 
a strategy of predicting an alternation 
following a run of one E,, but when k 
> 2, they were predicting that the E, 


High Strength 


lA, |kED) 


pla,{keD 
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run would continue. The positive re- 
cency is very strong for these Ss by the 
middle of learning; in fact, the recency 
curve for this condition continues at p 
= 1.0 until k was over 20 during the 
last two blocks. In Cond. LE there 
was also strong positive recency in the 
E; curves, especially during Blocks 7 
and 8, though the above “strategy” is 
not quite so appropriate. The recency 
curves for Ez events also show positive 
recency although the event recency 
curves, p(E,|kE,), are much flatter 
than those for E, events. It seems that 
the positive recency generated by the 
manipulated E dependencies general- 
ized from the E, runs to the E, runs. 
This is clear whether one looks at the 
general form of the recency curves or 
looks at the difference between the re- 
sponse and event recency curves as an 
index of generalization. 

Learning was poorer to the U de- 
pendencies, both in terms of learning 


High Strength Low Strength 


a—-—a Blocks 18 2 
o-—~0 Blocks 485 
—— Blocks 768 


Fic. 2. Response and event recency curves, p(A:|kE.) and p(Ex|kE:), 
for various conditions. 
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rate and asymptotic response prob- 
abilities; this is also reflected in the 
recency curves. In Cond. HU, Ss 
showed strong tendencies to predict 
runs of Length 2 for the E, events by 
Blocks 7 and 8, but the response re- 
cency curve was not as extreme as the 
event recency curve, and even after 800 
trials Ss did not learn that the longest 
run for E, events was 3! In Cond. 
LU, Ss clearly did not learn the tend- 
ency for runs to be of Length 2; the 
recency curves for this condition show 
only a general negative recency trend, 
with no sign of a sharp drop following 
runs of Length 2. The recency curves 
for E, runs in the U conditions show 
none of the positive recency found in 
E, runs from the E conditions; instead 
negative recency is shown, with little or 
no decrease in the amount of negative 
recency, even after 800 trials. Lind- 
man and Edwards (1961) and Jones 
and Myers (1966) have shown that 
negative recency decreases and turns 
to positive recency as early as Trial 
100 when the event run structure is 
quite similar to that of the E, runs 
from the E conditions and the E, runs 
from the U conditions. Again, it is 
obvious that the sharp negative re- 
cency, which is appropriate for E, 
runs in U conditions, generalizes to the 
E, runs. The amount of generalization 
can perhaps most clearly be seen by 
comparing the recency curves for E, 
tuns in Cond. LE with E, runs in 
Cond. LU. Here the event run struc- 
tures are quite similar, but the response 
recency curves are quite different. 
Latency data—Latency data were 
analyzed for the various conditions 
above, but will be presented only 
briefly here. In general, response 
speed was faster under the H than the 
L conditions and faster to repetition 


8 Complete data are available from the 
authors, 
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first orders than to alternations. The 
latency learning curves were quite dis- 
similar for the various conditions. In 
Cond. HE, responses were fairly fast 
from the start and reached asymptote 
quickly (by Block 3 or 4), while in 
other conditions responses were quite 
slow during Block 1, dropped sharply 
over blocks, but had not reached an 
asymptote by the end of Block 8. 


Discussion 


The data indicate that Ss can learn to 
use information from trials beyond n — 1 
in probabilistic situations. They seem to 
be able to use any strong second-order 
structure in event sequences in making 
predictions, but use of weak second-order 
dependencies depends a good deal on the 
form and (first-order) pattern of the de- 
pendency. It seems also that there is a 
good deal of generalization from manip- 
ulated to controlled dependencies (or run 
structures). Whether generalization of 
responses to manipulated second-order de- 
pendencies caused the generalization ob- 
served in the recency curves (Fig. 2) 
or generalization of responses to the 
dominant event recency curves caused 
the differences observed in the second- 
order response conditional curves (Fig. 
1) is, of course, impossible to say; the 
data can be interpreted either way. 

A theory of probability learning could 
be constructed to assume that Ss respond 
to second-order event probabilities, using 
the two prior events as cues in making 
each response. Such a theory would be 
similar to the Burke and Estes (1957) 
k-span model discussed by Restle (1961) 
and Rose and Vitz (1966). In order 
to account for the present data, the k- 
span theory would have to be revised to 
account for first-order repetition biases, 
possibly by positing a weighting function 
on the prior events such that the event 
on Trial n— 1 would be more likely to 
be sampled than the one on n — 2, etc. 
In addition, this weighting function 
would have to be inversely related to 
P(E, nl Ej, nv Ex,n-2) to account for the 
fact that first-order biases were much 


268 MICHAEL H. STRUB AND JAMES R. ERICKSON 


weaker when second-order dependencies 
were strong. 

On the other hand, present theories of 
probability learning assuming that event 
runs are the relevant stimuli for S in 
making predictions can also account for 
the data in this study. The best de- 
veloped of these are the relatively deter- 
ministic model of Restle (1961) and the 
generalization model of Gambino and 
Myers (1967). Of these, the present 
data most strongly support the Gambino 
and Myers theory, which unlike Restle’s 
can account for the form of the recency 
curves obtained, for the perseverative 
errors (which were most prominant in 
Cond. HU but also occurred in other 
conditions), and for the strong general- 
ization effects from runs of the manip- 
ulated event to the controlled event. The 
fact that the generalization is primarily 
one way, however, is not predicted by the 
theory as it now stands. The necessary 
modification is slight, involving replace- 
ment of their parameter æ (Gambino & 
Myers, 1967, p. 418) with two parameters 
a, and a, yielding different amounts of 
generalization from E, to E, runs than 
from E, to E,. 
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Previous experiments have shown that people are conservative 
processors of fallible information when data are sampled from 1 of a 
set of binomial populations. In the experiments reported here, S's 
observed samples of data and revised odds estimates about which of 2 
normally distributed populations was being sampled. Revisions were 
nearly optimal when based on an individual datum, but became con- 
servative when based on a sequence of data. This result points to 
misaggregation of the diagnostic impacts of 2 or more data as a 
major. source of conservatism in information processing. Additionally, 
Ss were more conservative when a datum favored the population that 
previous data had favored; overall conservatism was only about 4 as 
great as predicted from experiments that used binomial populations. 


Man’s environment is largely proba- 
bilistic and, as such, can be modeled 
by probability theory. Observable 
events may be generated by sources 
that are inherently unobservable. 
Knowledge of such sources is available 
through processes analogous to statis- 
tical inference, the set of procedures 
by which a statistician makes infer- 
ences about populations on the basis of 
observing samples of data. Accord- 
ingly, many experiments have used 
probability theory as a framework 
within which to investigate human in- 
ference (Peterson & Beach, 1967). 

Much of this research has used the 
following experimental paradigm: Two 
different hypotheses specify two dif- 
ferent binomial populations and data 
are sampled from one of them. On 
observing each datum, Ss become more 
or less sure of which hypothesis is cor- 
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rect and reflect this change of opinion 
by revising probabilities they have as- 
signed to the two hypotheses. These 
experiments have shown that such 
probability revisions bear orderly rela- 
tions to optimal revisions prescribed 
by formal Bayesian models of statis- 
tical inference (e.g., Edwards, Lind- 
man, & Phillips, 1965). 

Binomial data generating processes, 
because of their extreme simplicity, 
have provided a useful tool in this re- 
search. But this simplicity poses prob- 
lems with respect to the generalization 
of results. Since a binomial datum can 
take on only two different values, there 
are only two possible revisions that 
can be made when a datum is sampled. 
Inferential tasks in real life, however, 
are not always so simple; information 
often varies along a continuum rather 
than being restricted to only two 
values. The present experiments in- 
vestigated inference with continuously 
yariable data sources. 

Statisticians have found the model 
of a normal distribution to have tre- 
mendous generality when random 
events vary along a continuum. Sir 
Francis Galton expressed this general- 
ity with particular enthusiasm : 
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I know of scarcely anything so apt to 
impress the imagination as the wonderful 
form of cosmic order expressed by the “Law 
of Frequency of Error.” The law would 
have been personified by the Greeks and 
deified, if they had known it. It reigns 
with serenity and in complete self-effacement 
amidst the wildest confusion. The huger 
the mob and the greater the apparent an- 
archy, the more perfect its sway. It is the 
supreme law of Unreason. Whenever a 
large sample of chaotic elements are taken 
in hand and marshalled in the order of their 
magnitude, an unsuspected and most beau- 
tiful form of regularity proves to have been 
latent all along [Walker, 1943]. 


There is a reason for such general- 
ity: events tend to result from multi- 
ple causes. The central limit theorem 
asserts that, under quite general con- 
ditions, when several effects combine in 
an additive fashion, the results will be 
approximately normally distributed, 
even though the individual effects may 
not be. The generality of normal dis- 
tributions in a person’s natural ecology 
provides the motive for the present 
experiment, to investigate how people 
use fallible data to make inferences 
about normally distributed populations. 


Bayes’ Theorem 


Just as with binomial populations, 
Bayesian statistics provide the theory 
of optimal inference against which one 
can evaluate performance with nor- 
mally distributed populations. It is a 
formal model for the revision of proba- 
bility in the light of new information. 
Given a pair of hypotheses, H, and Hp, 
and the occurrence of datum, D, Bayes’ 
theorem may be written as 


log 9, =log L + log Q,, [1] 


where Q, is the odds in favor of H, 
prior to observing D, L is the likeli- 
hood ratio, the ratio of P(D|H,) to 
P(D|H»), and Qı is the posterior 
odds, after observing D. The log of 
the likelihood ratio is an additive mea- 
sure of the amount of revision from 
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prior to posterior opinion as the result 
of observing D. The Bayesian form 
of the measure (BLLR) is calculated 
by subtracting the log of Bayesian 
prior odds from the log of Bayesian 
posterior odds; the corresponding sub- 
jective measure (SLLR) is inferred 
by subtracting the log of estimated 
prior odds from the log of estimated 
posterior odds. 


Previous Findings 


The present experiments used nor- 
mally distributed populations to in- 
vestigate several important findings 
that have emerged from research using 
binomial populations. The Ss were 
conservative ; they revised their proba- 
bility estimates in the right direction, 
but the magnitude of revision was less 
than the optimal amount as calculated 
by using Bayes’ theorem. The Ss 
seemed unable to extract all the infor- 
mation latent in a datum (Edwards et 
al., 1965). The amount of conserva- 
tism, however, was not constant but 
increased as data became more diag- 
nostic. The value of SLLR increased 
with BLLR, but the rate of increase 
was less than the optimal amount 
(Peterson, DuCharme, & Edwards, 
1968; Peterson & Miller, 1965). Part 
of the cause of conservatism seemed to 
reside in the task of aggregating 
evidence over a sequence of data. The 
performance of Ss was more Bayesian 
when the inference was based on only a 
single datum than when it was based 
on a sequence of data (Peterson & 
Swensson, 1968). One reason for the 
deterioration of performance when the 
task required aggregating evidence was 
that the sequence in which the data oc- 
curred influenced the amount by which 
S was conservative. He was more 
conservative when a datum infirmed 
than when it confirmed his currently 
favored hypothesis; there was an 
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inertia effect (Pitz, Downing, & Rein- 
hold, 1967). 


EXPERIMENT I 


Experiment I used two different 
normally distributed populations. One 
distribution consisted of heights of men 
and the other of heights of women. 
The Ss observed a sample from one 
of the two populations and used the 
observations as a basis for revising 
their odds estimates about which of the 
two populations generated the data. 


Method 


Construction of stimuli—An empirical 
sample of the heights of 110,000 United 
States men and women between the ages of 
18 and 79 served as the basis for the means 
and standard deviations of the populations 
used in the experiment (Stoudt, Damon, Mc- 
Farland, & Roberts, 1965). Under the 
plausible assumptions of normal distribu- 
tions, this sample yielded for men a mean 
height of 68.2 in. with a standard deviation 
of 2.8; for women the mean was 63.0 in. 
with a standard deviation of 2.5. A table 
of random normal deviates was used to 
generate the heights used as stimuli in the 
experiment. 

Apparatus—A vertical board was marked 
like a long ruler, in feet and inches, in units 
of ł in. This board contained a removable 
graph showing two overlapping normal dis- 
tributions, representing the heights of men 
and women, respectively. A metal strip 
along one side of the ruler permitted the 
attachment of magnetic pointers; a pointer 
represented a height selected at random 
from one of the two populations. 

Subjects —Twenty-three undergraduate 
men at the University of Michigan served as 
a Ss in groups of approximately eight 
each, : 

Procedure.—The first portion of the ex- 
periment consisted of a presentation of de- 
fining samples of heights designed to inform 
Ss about each of the two populations. In 
the second portion of the experiment, Ss ob- 
served randomly sampled heights and revised 
their probability estimates about which popu- 
lation was being sampled. 7 

The defining samples in the first portion 
consisted of 48 male heights and 48 female 
heights. The Ss estimated the mean height 
of each population before observing each 
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sample, after observing one half of each 
defining sample, and again after the end of 
each sample. The mean height of 68.2 in. 
for men elicited a terminal mean estimate of 
68.75 in.; the mean height of 63.0 in. for 
women elicited a terminal mean estimate of 
63.0 in. The average S came to know the 
mean of each of the two populations by the 
end of the defining samples. 

In the second portion of the procedure, Ss 
were given single-datum trials requiring no 
aggregation of evidence and trials containing 
sequences of four data which did require 
aggregation. In both cases, Æ instructed Ss 
that one of the two populations had been 
chosen by the toss of a fair coin. The Ss 
were therefore to set their prior odds about 
the two populations at 1:1. For the single- 
datum trials, Æ sampled a height from the 
chosen population and displayed it to Ss. 
On the basis of observing a height Ss 
decided which population was more likely 
the one being sampled and also how many 
times more likely, writing their estimates in 
the form of odds ratios, The Ss made 48 
such single-datum estimates with their prior 
odds reset each time to 1:1. 

Following the single-datum trials, Ss ag- 
gregated evidence over trials consisting of 
sequences of four data each. Again E 
chose a population by the toss of a fair 
coin, but this time he sampled a sequence 
of four heights from that population. He 
displayed the heights to Ss cumulatively, one 
at a time, and Ss reestimated the odds about 
the two populations after each datum. 


Results 


Figure 1 displays the mean log esti- 
mated posterior odds as a function of 
the corresponding Bayesian posterior 
odds for each of the four trials in each 
of the 12 sequences. The 45° line in- 
dicates perfect correspondence ; as data 
accumulate a Bayesian S would re- 
vise his odds so that his estimate would 
move up and down that line. Binomial 
experiments lead to the expectation 
that Ss would revise their odds along 
a straight line with a slope less than 
1.0. Specifically, extrapolation from 
binomial experiments suggests that the 
slope should be about 33 (Peterson 
et al, 1968; Phillips & Edwards, 


1966). 
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Fic. 1. Mean log estimated posterior odds 
as a function of log Bayesian posterior odds. 
(The line represents Bayesian performance.) 


Figure 1 shows that Ss move along 
the Bayesian line only in the mid- 
range, between log Bayesian odds of 
—1.0 to +1.0 (odds of 1:10 to 10:1); 
thereafter, Ss move along a more 
horizontal line. A product-moment 
correlation of .98 indicates that a sub- 
stantial portion of the variance in esti- 
mated odds can be accounted for by 
a linear function of Bayesian odds. 
The slope of the best fitting straight 
line is .67, much closer to the Bayesian 
slope of 1.0 than the extrapolated slope 
of .33. The average S extracted about 
% rather than only 4 of the available 
information from the data when mak- 
ing his inferences in this sequential 
task. 
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The results displayed in Fig. 1 can 
be examined in more detail by means of 
a trial-by-trial analysis (Peterson & 
Swensson, 1968). Trial-by-trial log 
likelihood ratios were calculated by us- 
ing the posterior odds of Trial n-1 as 
a prior odds for Trial n. Equation 1 
implies the relevant subtraction. The 
three graphs in Fig. 2 present this 
analysis for single-datum trials, for 
Trial 1 of each sequence, and for Trials 
2, 3, and 4 of each sequence. The data 
points are almost on the Bayesian line 
for single-datum trials and for Trial 1 
of the sequences. But performance de- 
teriorates after Trial 1; responses be- 
come both more conservative and more 
variable. This combination of results 
means that essentially all of the con- 
servatism evident in the cumulative re- 
sponses of Fig. 1 is due only to re- 
visions made after Trial 1. 

The variability in responses after 
Trial 1 suggests the existence of se- 
quence effects. But the inertia effect 
reported by Pitz et al. (1967) did not 
seem to be in operation. The appro- 
priate analysis required segregating the 
trials into either confirming trials 


(where the datum on a trial favors the 
hypothesis with the highest Bayesian 
probability prior to the trial) or infirm- 
ing trials (where the datum favors the 
alternative hypothesis). 


The slope of 


TRIALS 2, 3, 4 


HEIGHT IN INCHES 


Fig. 2. 


Log likelihood ratio as a function of height. 
performance; the data points are means of Ss’ estimates. 


(The lines represent Bayesian 
In the right hand graph dots 


represent confirming trials and triangles infirming trials.) 
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the best fitting straight line relating 
SLLRs to corresponding BLLRs was 
81 for infirming trials but only .50 for 
confirming trials. In support of a 
linear regression analysis, the correla- 
tions were .97 and .93, respectively. 
Both of these correlations are higher 
than the correlation of .91 for Trials 
2, 3, and 4, indicating that segregation 
of the trials into confirming and in- 
firming decreases the noisiness of the 
data. The trials are segregated in the 
right-hand portion of Fig. 2 showing 
graphically that Ss were more, rather 
than less, conservative on confirming 
trials. This is the opposite of an in- 
ertia effect. 


Discussion 


Why does the increased complexity of 
the normal distribution lead to less con- 
servatism than was extrapolated from bi- 
nomial experiments? It is tempting to 
leap to the conclusion that Bayes’ theorem 
describes inferential behavior in real life 
situations, and that stimuli used in the 
present experiment are more representa- 
tive of those situations than are abstract 
stimuli, such as balls and urns, used in 
binomial experiments. That conclusion, 
however, is a long step from the data. 
The problem is that the likelihood ratio 
associated with any datum is equal to 
the ratio of the ordinates of the two 
displayed probability distributions at that 
datum. Whenever prior odds are one, 
Bayesian posterior probabilities are equal 
to the likelihood ratio. Prior odds were 
always one for single-datum trials and 
for the first datum in each sequence, 
Precisely where the performance of Ss 
was Bayesian, suggesting the possibility 
that the experiment was measuring graph 
reading rather than inferential perform- 
ance. In order to control for this possi- 
bility, a second experiment deleted the 
display of the graphs. 


Experiment IT 


_ The design of Exp. II was essen- 
tially a replication of Exp. I, except 
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for deletion of the display of the nor- 
mal distributions. The new display 
defined only the mean of each popula- 
tion and the indifference point, the 
point at which a height was equally 
likely to be sampled from either popu- 
lation. The display contained no in- 
formation about the ordinates of the 
distributions. 


Results 


Analysis of the performance of the 
25 Ss in Exp. II paralleled the analysis 
used in Exp. I. Figure 3 displays 
mean log estimated odds as a function 
of corresponding Bayesian values. 
Once again, Ss were nearly optimal in 
the midrange and then became increas- 
ingly conservative as their estimates 
became more extreme. The correla- 
tion between estimated and Bayesian 
log odds was .95, with the slope of the 
regression line equal to .47. The Ss 
extracted an average of about $ of the 
information latent in the data, as con- 
trasted with 4 when the display was 
available in Exp. I. 

The trial-by-trial analysis of Fig. 4 
corresponds to the same analysis in 
the first experiment. The wavy lines 
added to each graph in Fig. 4 repre- 
sent the smoothed, running average of 
the data points for the single-datum 
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Fic. 3. Mean log estimated posterior odds 
as a function of log Bayesian posterior odds. 
(The line represents Bayesian performance.) 
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TRIALS 2, 3, 4 
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Fic. 4. Log likelihood ratio as a function of height. (The lines represent Bayesian per- 
formance; the data points are means of Ss’ estimates. The wavy lines represent the smoothed 
running average calculated from single-datum trials. In the right-hand graph dots represent 


confirming trials and triangles infirming trials.) 


trials. This average shows that Ss 
overreacted to data moderately diag- 
nostic in favor of women and underre- 
acted to data strongly diagnostic in 
favor of men. This bias, measured on 
single-datum trials, also occurred on 
Trial 1, but the responses became both 
more conservative and more variable 
on later trials. 

As in Exp. I, the results show the 
opposite of an inertia effecti Again 
the correlation between SLLR and 
BLLR supports a linear regression 
analysis, .92 for confirming trials and 
95 for disconfirming trials. The re- 
spective slopes are .26 and .64, indi- 
cating considerably more conservatism 
for confirming trials. As in Exp. I, 
segregating the trials into confirming 
and disconfirming decreased the noisi- 
ness of the data; the correlation for 
Trials 2, 3, and 4 was only .83. 


Discussion 


Deletion of the display of normal dis- 
tributions led to a slightly biased function 
for single-datum trials and for Trial 1. 
The similarity of the results of the two 
experiments, however, indicates that Ss 
did not simply turn Exp. I into a graph- 
reading task. Since the compositions of 
the populations being sampled are dis- 
played in most binomial experiments, the 


predictions from binomial experiments 
are more relevant to Exp. I. 

Why was performance with normal dis- 
tributions more nearly optimal than ex- 
pected on the basis of extrapolation from 
previous experiments? This increase in 
optimality occurred in spite of the com- 
plexity of normal data generating pro- 
cesses, where data are not restricted to 
only two values of likelihood ratios but 
may, in theory, take on an infinite num- 
ber of values (for other instances in 
which an increase in complexity led to 
an increase in optimality see Peterson & 
Beach, 1967). One reason for the in- 
crease in optimality in the present experi- 
ment may be that Ss were more familiar 
with the data generating processes; both 
more familiar with the abstract nature 
of normal distributions and also more 
familiar with the particular distributions 
used, the heights of men and women. 

The reversal of the inertia effect in 
these experiments is even more difficult 
to explain. Why should Ss be more con- 
servative in response to confirming data? 
A possible explanation is suggested by 
the curvilinearity of the cumulative func- 
tions displayed in Fig. 1 and 3. This 
explanation assumes that Ss move along 
the curvilinear function when revising 
estimates. In response to infirming data 
they tend to move back through the mid- 
range of the function where their esti- 
mates are nearly optimal; in response to 
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confirming data they move on into the 
more extreme portion of the function 
where their estimates are conservative. 

The most striking result of these ex- 
periments is the combination of relatively 
good performance on initial trials and 
pronounced conservatism on later trials. 
This result is in agreement with that of 
Peterson and Swensson (1968) and 
shows that the locus of conservatism re- 
sides in the process of combining the 
diagnostic impacts of two or more data. 
Locating conservatism in the aggrega- 
tion process does not, of course, explain 
why people are conservative information 
processors, but it does go a long way 
toward restricting the domain of poten- 
tial explanations. 
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SECOND INTERVAL DISCRIMINATION CONDITIONING OF 
THE GSR AS A FUNCTION OF UCS INTENSITY AND 
TRACE AND DELAY CONDITIONING PARADIGMS? 


ARTHUR R. ZEINER ? 
University of Southern California 


Unconditional stimulus intensity and trace-delay (.5-sec. and 10-sec. 
CSs) paradigms were varied among 4 groups of college Ss in a GSR 
discrimination conditioning experiment. Interstimulus interval (ISI) 
was 5 sec. for all groups. First interval GSRs (FIRs) occurring 
during the ISI and second interval GSRs (SIRs) occurring in a 
5-sec. post-UCS interval were measured on nonreinforced test and 
extinction trials. Results indicated that strong US groups discrimi- 
nated better than did weak US groups with both response measures; 
no overall trace-delay differences were noted. 


Multiple responses have been ob- 
served in autonomic classical condi- 
tioning experiments in cases where the 
CS-UCS interval is longer than mini- 
mal response latency. Grings, Lock- 
hart, and Dameron (1962) categorized 
responses within a 5-sec. CS-UCS 
interval as first interval responses 
(FIRs )and responses occurring in a 
5-sec. post-UCS interval on nonrein- 
forced test trials and extinction trials 
as second interval responses (SIRs).* 
They found significant discrimination 
conditioning in mental defectives at an 
earlier acquisition test trial with SIR 
than with FIR measures of learning. 
In a later study, with a college sopho- 
more subject population, Lockhart and 


1This study is based upon a thesis pre- 
sented to the Department of Psychology at 
the University of Southern California in 
partial fulfillment of the requirements for 
the MA degree (Zeiner, 1965). Financial 
support from the National Institute of 
Mental Health Grant (M-3916) is acknowl- 
edged. Some of the results were presented 
at the 1964 American Psychological Associa- 
tion Meetings in Los Angeles, California. 

2Now at the University of California 
School of Medicine, San Francisco. 

3 Since this experiment was performed, 
additional experimental evidence and classi- 
fication schemes for SIR have been offered 
by Lockhart (1966) and Prokasy and 
Ebel (1967). 


Grings (1964) found that SIR dis- 
crimination was superior to first re- 
sponse discrimination. Second inter- 
val response was discussed in terms of 
an “offset CR,” response substitution, 
an inhibition of delay phenomenon, 
temporal conditioning, and perceptual 
disparity. None of the potential ex- 
planations discussed was felt to be 
adequate to account for SIRs or SIR 
conditioning. The authors believed 
that “the explanation of second inter- 
val responding will require a reinter- 
pretation of the nature of autonomic 
classical conditioning.” 

The UCS intensity and trace and de- 
lay conditioning paradigms are com- 
monly held to influence both rate and 
amount of classical conditioning. Evi- 
dence from eyelid conditioning experi- 
ments supports the hypothesis that 
amount of conditioning is a function 
of UCS intensity (Spence & Platt, 
1966). The phenomenon, however, 1s 
by no means general. Grings and 
Lockhart (1963) and Wickens, Allen, 
and Hill (1963) found no supportive 
evidence that magnitude of GSR con- 
ditioning is a function of UCS inten- 
sity even though the first requirement, 
that UCR magnitude varies with UCS 
intensity, had been demonstrated by 
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Hovland and Riesen (1940).4 Simi- 
larly, studies comparing trace and de- 
lay variation on rate and amount of 
classical conditioning are scarce. The 
hypothesis that trace conditioned re- 
sponses are more difficult to establish 
than are delay conditioned responses 
rests mainly on the findings of Rodnick 
(1937) and Pavlov (1927) and their 
co-workers. 

It was the purpose of the present ex- 
periment, therefore, to test with SIR, 
a potentially sensitive measure of con- 
ditioning, that (a) discrimination con- 
ditioning is a function of UCS inten- 
sity, (b) discrimination conditioning 
occurs at an earlier test trial and to 
a greater extent with delay than with 
trace paradigms, and (c) SIR is an 
offset phenomenon or an artifact of 
conditioning to CS offset. 


METHOD 


Subjects —Forty undergraduate students 
from an introductory psychology course at 
the University of Southern California served 
as Ss. Participation in an experiment was 
a class requirement. Each student was paid 
$1.00 for serving in the experiment which 
lasted for less than 1 hr. 

Experimental design—Unconditioned stim- 
ulus intensity and trace and delay discrimina- 
tion conditioning paradigms comprised the 
experimental variation in a 2X2 factorial 
design, Eighteen women and 22 men were 
assigned to form four independent subgroups 
of 10 Ss. Conditioning, a within-Ss_ vari- 
able, was inferred from differential respond- 
ing to a CS followed by shock and to a CS 
not followed by shock on nonreinforced ac- 
quisition test trials and extinction trials. 

Apparatus—The GSR was obtained as a 
de conductance change between two 4 X § in. 
contoured sheet silver electrodes taped to the 
first and third fingers of S’s left hand. The 
GSR leads were connected by a Darrow-type 
Wheatstone bridge input to an Offner Model 
542 Dynograph, Current through S’s 


‘It should be noted that Dykman and 
Gantt (1959) and Dykman, Gantt, and 
Whitehorn (1956), using heart rate in dogs 
as the index, demonstrated that both CR 
and UCR magnitude varied with UCS in- 
tensity. 


fingers was 45 wa. Paper speed was 5 
mm/sec. The UCS was a .5-sec. de shock 
from a Grass S-5 stimulator. Shocks were 
delivered to the volar surface of the right 
forearm by two 4X Ẹ in. silver electrodes 
14 in. apart, center to center. Both skin and 
electrodes were cleaned with rubbing alcohol. 
Conditioned stimuli were red and green §-in. 
diameter lights matched on brightness and 
image size. Trace CS duration was .5 sec. 
and delay CS duration was 10 sec. For 
both trace and delay conditions CS-UCS 
interval was 5 sec, Thus the CS went off 
45 sec. before UCS onset for the trace 
group and stayed on 5 sec. beyond UCS 
onset for the delay group. The overlapping 
delay paradigm was specifically chosen to 
test a hypothesis about second interval re- 
sponding. Temporal relations between CS 
and UCS were controlled by three Hunter 
111A timers. The timers also controlled 
two eyent markers on the recorder which 
marked CS and UCS onsets and offsets. In- 
tertrial intervals of 30, 40, and 50 sec. 
were randomized by a table of random 
numbers. 

Procedure—The experiment was con- 
ducted in a soundproof room of the Univer- 
sity of Southern California laboratory. The 
Ss were seated at a table 30 in. in front of 
a milkglass screen which was inserted into 
an enclosure separating E and recording 
equipment from S. Timing and auxiliary 
equipment were located in a separate room. 
Ishihara color charts were administered to 
determine whether or not Ss had normal 
color vision. Two Ss had to be replaced 
because of color blindness. 

The Ss read instructions about the gen- 
eral nature of the GSR and the shock work 
up while E calibrated the equipment. Shock 
levels for each S were individually deter- 
mined by a shock work-up procedure. Un- 
conditioned stimulus intensity was brought 
up in 2-3-v. steps. The S indicated when he 
first felt the shock and when the shock 
reached a level of discomfort. Weak UCS 
was defined as a shock which was 5 v. above 
threshold of awareness. Strong UCS was 
defined as a shock which was 5 v. above 
level of discomfort. Mean voltage levels 
for weak UCS trace and delay groups were 
17.6 v. and 21.2 y., respectively. The dif- 
ference in mean voltage level between trace 
and delay groups was not significant. How- 
ever, the difference in mean voltage level 
between weak and strong UCS groups was 
significant, £ (39) = 3.91, p<.0l. 

‘After the shock work up all Ss received 
10 adaptation trials (5 trials on each CS). 


278 


The acquisition series consisted of 40 stimu- 
lus presentations (20 presentations of each 
of two CSs). Of 20 CS+ trials, 6 were 
nonreinforced test trials which occurred 
after 2, 5, 7, 10, 11, and 13 CS-UCS pairings. 
Six out of 20 CS— presentations were desig- 
nated as control trials. Control trials oc- 
curred an equal number of times before and 
after test trials. Each S also received 10 
extinction trials (5 trials on each CS). 


RESULTS 


Although the experiment was spe- 
cifically concerned with UCS intensity 
and trace and delay variation on SIRs, 
the experimental hypotheses were eval- 
uated in terms of both FIR and SIR. 
The first interval GSR was defined as 
a decrease in resistance of 100 ohms 
or more within a period of 1—5 sec. 
from CS onset. Second interval GSR 
was defined as a decrease in resistance 
of 100 ohms or more within a 5-sec. 
period measured from the point where 
UCS onset would have occurred if it 
had been presented, i.e., within the 
time interval 5-10 sec. after CS onset. 
For both FIR and SIR the base level 
was measured from respective response 
onsets. Unconditioned response mag- 
nitudes on six acquisition trials pre- 
ceding test trials were also measured 
from response onset within a 5-sec. 
interval from UCS onset. 

Unconditioned response—Table 1 
presents mean UCR magnitudes based 
on six acquisition trials preceding test 
trials for strong and weak UCS inten- 
sity groups. An analysis of variance 
performed on these data indicates that 
the mean difference in magnitude of 


TABLE 1 


Mean UCR MAGNITUDE on SIX ACQUISITION 
TRIALS PRECEDING TEST TRIALS FOR 
STRONG AND WEAK UCS INTENSITY 


Groups 
Trials 
UCS In- 
tensity 
1 2 3 4 5 6 
Weak |1.25 | 1.36] 1.11 | 1.43 | 1.63 | 1.04 


Strong | 2.13 | 2.23] 2.30 | 2.21 | 2.22 | 2.21 
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UCR between strong and weak UCS 
intensity groups was highly significant, 
F (1, 36) = 19.18, p < 001. Neither 
trace nor delay nor trials nor any 
other effects approached significance. 

First interval response—Figure 1 
shows mean magnitude of first interval 
GSRs to test and control CSs over 
adaptation, acquisition, and extinction 
phases for strong and weak UCS in- 
tensity groups. Trials along the ab- 
scissa are in blocks of two except for 
the first trials on adaptation and ex- 
tinction. Response magnitude along 
the ordinate is expressed as square root 
of conductance change in micromhos. 
Trace and delay groups did not differ 
significantly and were therefore com- 
bined. 

As is evident from Fig. 1, a signifi- 
cant UCS intensity effect was found 
during acquisition, F (1, 36) = 7.17, 
p <.05, with the strong UCS group 
responding to CSs with larger GSRs 
than the weak UCS group. Overall 
discrimination conditioning, based on 
the mean difference in magnitude of 
GSR between six test-control trials, 
was highly significant, F (1, 36) = 
29.59, p<.001. A significant UCS 
Intensity X Discrimination interaction 
indicated that the strong UCS group 
discriminated better than did the weak 
UCS group, F (1, 36) = 4.50, p < 
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CRs over adaptation, acquisition, and extinc- 
tion phases of experiment to test and control 
CSs for strong and weak UCS intensity 
groups (N =20/group). 
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.05. Neither the hypothesis that delay 
conditioning leads to better discrimina- 
tion than does trace conditioning nor 
the hypothesis that delay paradigms 
lead to earlier evidence of discrimina- 
tion than do trace paradigms was sup- 
ported. The same effects were signifi- 
cant with extinction as with acquisition 
data. 

Second interval response.—Figure 2 
presents the absence, development, and 
extinction of discrimination between 
test and control CSs over adaptation, 
acquisition, and extinction phases of 
the experiment. Trials along the ab- 
scissa are in blocks of two except for 
the first trials on adaptation and ex- 
tinction, Square root of conductance 
change in micromhos is represented 
along the ordinate. Strong and weak 
UCS data are plotted separately. 
Trace and delay variation did not dem- 
onstrate differential effects and the 
data were therefore combined. 

The UCS intensity effect during ac- 
quisition was significant, F (1, 36) = 
8.24, p < .01, with larger CRs being 
obtained to both test and control CSs 
from the strong than from the weak 
UCS intensity group. Discrimination 
conditioning, as measured by the mean 
difference in magnitude of SIR based 
on six test-control CSs, was highly sig- 
nificant, F (1, 36) = 25.66, p < .001. 
The UCS Intensity x Discrimination 
interaction indicating that the strong 
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Fic, 2. Mean magnitude of second inter- 
val CRs over adaptation, acquisition, and 
extinction phases of experiment to test-con- 
trol CSs for strong and weak UCS intensity 
groups (N = 20/group). 


UCS group discriminated better than 
did the weak UCS intensity group 
was also supported, F (1, 36) = 5.41, 
p<.05. The ¢ tests, for each trial 
separately, supported the hypothesis 
that evidence of discrimination condi- 
tioning occurs at an earlier acquisition 
test trial with delay than with trace 
paradigms. Both the UCS intensity 
effect and the discrimination condi- 
tioning effect remained significant in 
the extinction phase of the experiment. 
The UCS Intensity x Discrimination 
interaction, however, failed to reach 
significance. 


Discussion 


Experimental evidence was presented 
that strong UCSs produce significantly 
larger UCRs than do weak UCSs. Fur- 
ther, it was clearly demonstrated with 
both first and second interval conditioned 
response measures that CR magnitude is 
a function of UCS intensity. 

Trace and delay conditioning param- 
eters did not have differential effects 
on overall discrimination conditioning 
with either FIR or SIR measures of con- 
ditioning. Although there was some in- 
dication of earlier discrimination with de- 
lay than with trace paradigms, the effect 
was not very strong and was only evident 
with the SIR measure of conditioning. 
The difference with respect to trace and 
delay paradigms between these results 
and those of Pavlov (1927) and Rodnick 
(1937) might, in part, be due to the 
fact that Pavlov used dogs and Rodnick 
used a latency criterion, whereas college 
students and a magnitude criterion were 
used in the present experiment. 

Results of the present experiment 
demonstrate that contiguity between CS 
offset and UCS onset is not a necessary 
condition for SIR. Second interval dis- 
crimination conditioning was demon- 
strated both in the presence of an ongoing 
CS (delay paradigm CS overlapping by 
5 sec. the UCS presentation point) and 
in the absence of an external CS (trace 
paradigm CS terminating 4.5 sec. before 
the UCS presentation point). Thus, no 
external stimulus change was provided 
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which would aid S in locating SIR in 
time. 

Although SIR meets the criteria of a 
CR present theories of classical condition- 
ing appear inadequate to explain SIR or 
SIR conditioning. Pavlov (1927) held 
that stimulus substitution takes place as 
a result of CS-UCS pairings. The CS 
takes on properties of the UCS and comes 
to elicit the UCR or some part of it. 
Such a view may be adequate to explain 
the first interval response in cases where 
the original response to the CS is similar 
to the response that is to be conditioned, 
as is the case with GSR. It still does 
not explain why an SIR should occur. 
An “inhibition of delay” explanation is 
inadequate because FIR and SIR can 
both occur together or they can occur 
singly on any given test trial. Therefore, 
it cannot be the case that FIR moves 
back to the point of reinforcement as is 
required by an inhibition of delay ex- 
planation. Time estimation by Ss as an 
explanation for SIR is not ruled out by 
the present experiment. 

Present data fit a combination of pre- 
paratory notions and a modified stimulus 
substitution view. By this view, FIR(s) 
is mediated by learned fear, whereas SIR 
comes about through stimulus substitu- 
tion. A strict interpretation of stimulus 
substitution requires not only that the CS 
should elicit the UCR or some part of 
it, but also that it should do so at the 
tight time. That is, CS onset plus a 
temporal interval equal to the CS-UCS 
interval should elicit the CR. Such a 
state of affairs is found with the SIR. 

The author felt that results illustrated 
in this experiment may be specific to the 
GSR. However, since this experiment 
was performed, SIRs for a variety of 
autonomic responses (salivation, heart 
rate, pupil dilation) have been reported. 
The new findings support earlier GSR 
evidence and results of the present experi- 
ment. 


REFERENCES 


Dyxman, R. A, & Gantt, W. H. Para- 
sympathetic component of unlearned and 
acquired cardiac responses. Journal of 


Comparative and Physiological Psychol- 
ogy, 1959, 52, 163-167. 

Dyxmany, R. A. Gantt, W. H., & Wur- 
HORN, J. C. Conditioning as emotional 
sensitization and differentiation. Psycho- 
logical Monographs, 1956, 70(15, Whole 
No. 422). 

Grincs, W. W., & Locknart, R. A. Ef- 
fects of “anxiety lessening” instructions 
and differential set development on the 
extinction of GSR. Journal of Experi- 
mental Psychology, 1963, 66, 292-299. 

Grincs, W. W., LocKHART, R. A., & DAM- 
ERON, L. R. Conditioning autonomic re- 
sponses of mentally subnormal individuals. 
Psychological Monographs, 1962, 76(39, 
Whole No. 558). 

Hovrann, C. I„ & Riesen, A. H. Magni- 
tude of galvanic and vasomotor response 
as a function of stimulus intensity. Jour- 
nal of General Psychology, 1940, 23, 103- 
121. 

LocKHaRrT, R. `A. Comments regarding 
multiple response phenomena in long in- 
terstimulus interval conditioning. Psycho- 
physiology, 1966, 3, 108-114. 

Locxnart, R. A., & Grincs, W. W. Inter- 
stimulus effects in GSR discrimination 
conditioning. Journal of Experimental 
Psychology, 1964, 67, 209-214. 

Pavrov, I. P. Conditioned reflexes. New 
York: Dover Publications, 1927. 

Proxasy, W. F., & Eset, H. C. Three com- 
ponents of the classically conditioned 
GSR in human subjects. Journal of 
Experimental Psychology, 1967, 73, 247- 
256. 

Ropnicx, E. H. Characteristics of delayed 
and trace conditioned responses. Journal 
of Experimental Psychology, 1937, 20, 
409-425. $ 

Spence, K. W., & Pratt, J. R. UCS in- 
tensity and performance in eyelid con- 
ditioning. Psychological Bulletin, 1966, 
65, 1-10. 

Wickens, D. D, Arten, C. K, & Hut, 
F. A. Effects of instructions and UCS 
strength on extinction of the conditioned 
GSR. Journal of Experimental Psychol- 
ogy, 1963, 66, 235-240. màl 

ZEINER, A. R. Second interval discrimina- 
tion conditioning of the GSR as a func- 
tion of US intensity and trace and delay 
conditioning paradigms. Unpublished 
masters thesis, University of Southern 
California, 1965. 


(Received September 13, 1967) 


= 


Journal of Experimental Psychology 
1968, Vol. 78, No. 2, 281-284 


RUNWAY PERFORMANCE AND COMPETING RESPONSES 
AS FUNCTIONS OF DRIVE LEVEL AND METHOD 
OF DRIVE MEASUREMENT! 


JOHN J. PORTER, HARRY L. MADISON, an» PETER C. SENKOWSKI 


University of Wisconsin, Milwaukee 


Estes’ (1958) S-R drive theory was tested in a segmented runway 
which permitted objective competing response measures. Rats were 
run on a 3-day-on, 2-day-off schedule. One group remained on 0-, 6-, 
or 20-hr. food deprivation for 3 consecutive days; another was 
changed to a new 0-, 6-, or 20-hr. deprivation on each of 3 days; a 3rd 
group remained at 100%, 95%, or 90% body weight for 3 days. For 
all groups, competing responses decreased with increased drive, as 
predicted. However, start and running speeds increased with drive 
whether or not competing response trials or competing response times 
were subtracted, failing to confirm the theory. The distribution of 
competing responses over the runway length was a U-shaped function. 


Estes’ (1958)  stimulus-response 
theory of drive implies that increases 
in asymptotic instrumental perform- 
ance as a function of increases in drive 
are due to decreased effects of the ex- 
traneous cues present on higher drive 
trials. The extraneous cues should 
cause competing responses and there 
should be fewer competing responses 
at high drive. Cotton (1953), using 
daily changes in hours of food depriva- 
tion, found that differences in running 
speed as a function of drive level were 
greatly reduced when trials on which 
competing responses were made were 
removed from his data, thus supporting 
Estes’ theory. Cicala (1961) found a 
similar tendency for convergence of 
running speeds at different drive levels 
when competing response trials were 
removed, using daily changes in hours 
of food deprivation, but not when the 
same drive level was maintained over 
several days. 


1 Supported by grants from the University 
of Wisconsin Graduate School, and Na- 
tional Science Foundation Undergraduate 
Research Grant GE-6131. Part of this 
Paper was read at the American Psychologi- 
oo meeting, Washington, D. C., 


The present study compared per- 
formance at different drive levels, with 
and without competing response trials, 
as well as with and without the running 
time occupied by competing responses. 
The latter measure was made possible 
by recording competing responses via 
a segmented runway which permitted 
an objective measure of pauses, retrac- 
ing, and other competing responses. 
In light of the different results ob- 
tained by Cicala when using varied 
and constant drive, three procedures 
for determining drive level were used: 
daily changes in hours of deprivation, 
hours of deprivation maintained over 
several days, and percentage of ad-lib 
body weight controlled by constant- 
weight cages. 


METHOD 


Subjects and apparatus—Three groups of 
eight male albino rats, 90 days old, were 
used. The runway was 73 ft. long, including 
a 1-ft, start box and a 14-ft. goal box, and 
was 6 in. high throughout. The start box 
and runway proper were 3 in. wide, and the 
goal box was 4% in. wide. The runway 
proper consisted of 20 3-in, square segments, 
each pivoted so that S’s weight closed a 
microswitch, actuating one of the pens of 
an Esterline-Angus event recorder, thus 
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recording the time spent on each runway 
segment. A photocell mounted 1 ft. from 
the dropping start door and another located 
14 in. from the rear wall of the goal box 
above the food receptacle measured start 
time and running time, respectively. One 
group of Ss (Group 3) was housed in indi- 
vidual constant-weight cages? which per- 
mitted maintenance of Ss at predetermined 
weights within +2 gm., about .5% of their 
ad-lib weights. The other groups were also 
housed in such cages, but were fed on a con- 
ventional hours-of-deprivation schedule. 

Design—All Ss were run in alternating 
cycles consisting of 3 days running followed 
by 2 days off, in order to permit changes in 
weight or deprivation levels during the off 
time. Group 1 Ss were run on 3 consecutive 
days at either 0-, 6-, or 20-hr. food depri- 
vation. The Group 2 Ss’ deprivation levels 
of 0, 6, or 20 hr, were changed daily at ran- 
dom, with the restriction that each level 
occur once in a running cycle, a procedure 
similar to that of Cotton (1953). Group 3 
Ss were run for 3 consecutive days at 100%, 
95%, or 90% of ad-lib body weight. In 
order to control for the effect of growth 
upon body weight, percentage of body weight 
was based upon S’s body weight during the 
most recent 100% running cycle. Body 
weight after 2 days of ad-lib feeding defined 
100% body weight. Thus, comparison of 
drive levels was within Ss, and comparison 
of drive measures was between Ss, 

Procedure—For the first 9 days all Ss 
were food deprived, during which levels of 
deprivation were varied according to the 
schedule above. On Days 10-13 all Ss were 
placed on their greatest deprivation level. 
On Days 10-11 each $ was given five goal 
box placements daily of 5 min. each, or until 
S ate two 97-mg. Noyes pellets. On Days 
12-13 each S was given five rewarded trials 
daily through the entire runway with the 
start-box door open. On Days 14-15 Ss in 
Group 1 were held at the deprivation level 
on which they would run during the next 3- 
day running cycle; Group 2 Ss were given 
2 different hours of deprivation on the 2 
days; and Group 3 Ss were stabilized at a 
new percentage of body weight. 

On Day 16 the experiment began. Six 
trials were given on Day 16, 8 on Day 17, 
and 10 on Day 18 and every running day 
thereafter. If S had not reached the goal 
box photocell within 3 min. after release 
from the start box, he was removed and his 
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daily mean running time was recorded for 
that trial. This had to be done only five 
times throughout the entire experiment. On 
the tenth day of acquisition and thereafter, 
competing responses were measured. A 
competing response was defined as a retrace 
in the runway proper, or more than 1 sec. 
spent on any one runway segment. The 
cycle of 3 days running followed by 2 days 
off was repeated until each S had received 
294 training trials, 98 at each of three drive 
levels. 


RESULTS AND DISCUSSIONS 


Performance data over the last 60 
trials under each drive level, during 
which there was little change in per- 
formance, were analyzed. There was 
a significant reduction in number of 
competing responses with increased 
drive, F (2, 42) = 16.22, p < .001, as 
implied by Estes’ theory. As shown 
in Fig. 1, mean reciprocal running time 
(running speed) for the three groups 
increased as a function of drive level, 
F (2, 42) = 42.7, p< .001. When 
competing response trials were re- 
moved from the data the effect of drive 
level remained significant, F (2, 42) = 
31.2, p < .001, as it also did when com- 
peting response time was removed, F 
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Fic. 1. Running speed as a function of 
drive level, with and without competing 
response (CR) trials and competing response 
time, for two hours-of-deprivation groups 
and one percentage ad-lib body weight group. 


YS 


COMPETING RESPONSES AND DRIVE STRENGTH IN THE RUNWAY 283 


(2, 42) = 39.0, p < .001. Removal of 
neither competing response trials nor 
competing response time appreciably 
reduced running speed differences as 
a function of drive level. 

Analysis of start speed (1/start 
time) yielded similar results. Start 
speed increased significantly with in- 
creases in drive, F (2,42) = 40.8, p< 
.001. When competing response trials 
were removed the effect remained sig- 
nificant, F (2, 42) = 34.1, p < .001. 
Similarly, removing competing re- 
sponse time did not affect the positive 
relationship between drive and starting 
speed, F (2,42) = 39.6, p < .001. Re- 
moval of neither competing response 
trials nor competing response time ap- 
preciably reduced start speed differ- 
ences at the various drive levels. 

Champion (1967), by eliciting con- 
trolled competing responses with an 
additional alley in a choice situation, 
reported a larger depressing effect of 
competing responses upon running 
speed at high than at low deprivation. 
His Ss were given preliminary train- 
ing on the competing response at both 
high and low deprivation, and thus his 
competing responses were quite differ- 
ent from the uncontrolled or “spon- 
taneous” ones observed by Cotton, 
Cicala, and the present study. It may 
be that competing responses which have 
been learned at both high and low 
drive levels are more effective in de- 
pressing performance at the higher 
levels, whether they are controlled or 
spontaneous. This will require fur- 
ther investigation, with the interesting 
problem of trying to insure that spon- 
taneous competing responses, involv- 
ing behaviors practiced throughout the 
life of the organism, are equally well 
learned at all drive levels. 

Comparison of the three methods of 
measuring drive in the present study 
showed that the groups differed in run- 


ning speed, F (2, 21) = 3.48, p < .05, 
with competing response trials and 
competing response time included, and 
a little less when either competing re- 
sponse trials or competing response 
time were removed, F (2, 21) = 3.36 
and 3.27, respectively, p slightly 
greater than .05 in both cases. These 
differences were due mainly to the dif- 
ferences between the body weight 
group (Group 3) and the hours-of- 
deprivation groups (Groups 1 and 2), 
as is apparent from Fig. 1. Of par- 
ticular interest is the large difference 
in running speed between the 100% 
body weight condition for Group 3 and 
the 0 hr. of deprivation condition for 
Groups 1 and 2. While there is no 
reason to suppose that 6 and 20 hr. 
of food deprivation should be equiva- 
lent to 95% and 90% of ad-lib body 
weight, clearly a minimum drive state, 
as produced by both 0-hr. deprivation 
and 100% body weight, should pro- 
duce similar performance levels. How- 
ever, Ss at 100% body weight were 
significantly faster for both start speed, 
F (2, 21) = 24.5, p < .001, and run- 
ning speed, F (2,21) = 37.7, p < .001, 
than Ss on 0-hr. deprivation. Appar- 
ently, the 2 days allowed for the body 
weight Ss for ad-lib feeding were not 
sufficient to return them to predepriva- 
tion drive levels. 

Also of interest is the pattern of 
competing responses over successive 
runway segments shown in Fig. 2. 
The 20 runway segments were divided 
into four successive blocks of 5 seg- 
ments for this analysis. Block 1 is 
nearest the start box, and Block 4 
nearest the goal box. For ease of pre- 
sentation, the three methods of drive 
measurement were combined in Fig. 
2, but the same pattern of competing 
responses over runway blocks was ob- 
served regardless of the method of 
drive measurement, with the Method x 
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Fic. 2. Competing responses as a func- 
tion of blocks of five runway segments, with 
drive level as the parameter. (0, 6, and 
20 are hours of deprivation; 100, 95, and 90 
are percentage of body weights.) 


Drive Level X Runway Block interac- 
tion quite small, F (12, 126) = .65, 
p > .2, and the effect of method upon 
the number of competing responses 
also not significant, F (2, 21) = 1.54, 
p>.2. Overall, the progressive flat- 
tening of the curves of Fig. 2 as drive 
level increased was reflected in a sig- 
nificant Runway Block x Drive Level 
interaction, F (6, 126) = 13.61, p< 
001. The change in competing re- 
sponses over successive blocks was sig- 
nificant, F (3, 63) = 27.77, p < .001, 
and led to the use of the Newman- 
Keuls test for repeated measures (cf. 
Winer, 1962, pp. 309 and 319) to an- 
alyze the frequency of competing re- 
sponses in successive blocks. These 
decreased significantly from Blocks 1 
to 2, and from 2 to 3, and increased 
significantly from Blocks 3 to 4, p < 
.001 in all cases. Unfortunately, the 
runway was not segmented in the goal 
area; thus it was not possible to tell 
if the upward trend in competing re- 
sponses apparent in Fig. 2 for the 
fourth runway block continued into the 
goal box. These results are in agree- 
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ment with past studies of the goal gra- 

dient in the runway (cf. Weiss, 1960), 

with the characteristic slowing down 

near the goal area evidenced in the 
form of an increase in competing re- 
sponses in the present study. 

In summary, these results provide 
no evidence that removal from the data 
of trials on which competing responses 
occur, or removal of the time occupied 
by competing responses, has any ap- 
preciable tendency to equalize differ- 
ences in asymptotic performance under 
different drive levels, whether drive is 
measured by hours of food deprivation 
or by percentage of ad-lib body weight. 
Clearly, the authors’ criteria for com- 
peting responses, while objective, were 
of necessity somewhat arbitrary and 
gross. To the extent that removal of 
such experimentally defined competing 
responses from the data is equivalent 
to removing the effects of extraneous 
cues, these results argue against Estes’ 
stimulus-response theory of drive. 
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60 undergraduate students learned a list of 8 paired associates in a 
factorial design with 4 pairs composed of short-latency associative 


reaction time (RT) response term: 
terms. 2 of the 4 response terms 0 


s and 4 pairs with long-latency RT 
f each group were low-meaningful- 


ness trigrams and 2 were high-meaningfulness trigrams. The list was 
learned by 3 groups of Ss under 2-, 4-, or 8-sec. presentation rate. 
Consonant with the predictions of the study, the trigrams of short RT 
were learned faster than the trigrams of long RT and the effect of RT 


was most pronounced under the shortest presentation rate. 


Fewer 


trials were required to learn high-meaningfulness trigrams than low- 
meaningfulness trigrams and the learning rate was faster the longer 
the presentation rate but total time to learn did not vary with presenta- 
tion rate. Significant interactions suggested that the effect of RT was 
limited to low-meaningfulness trigrams. 


Three general approaches have been 
used in the measurement of meaning- 
fulness of verbal units: (@) measures 
based on associations produced by Ss 
in response to verbal units as stimuli 
(Glaze, 1928; Hull, 1933; Mandler, 
1955; Noble, 1952; Witmer, 1935), 
(b) measures based on rating scale 
scores of verbal units (Noble, Stock- 
well, & Pryer, 1957), and (c) mea- 
sures based on counts of frequency of 
occurrence of words, syllables, and let- 
ters (Thorndike & Lorge, 1944). The 
various scales of meaningfulness gen- 
erated by these different methods have 
been found to be highly correlated 
(Underwood & Schulz, 1960) and 
paired-associate learning experiments 
have shown a marked positive relation- 
ship between learning rate and mean- 
ingfulness as determined by several of 
these different methods (Carroll 
Burke, 1961; Kimble & Dufort, 1955; 
Mandler & Huttenlocker, 1956). 

1 This research was supported in part by a 
State University of New York Research 
Foundation Faculty Fellowship. 
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A variable which has received little 
attention in studies designed to evalu- 
ate the meaningfulness of verbal units 
is associative reaction time (RT), the 
time elapsing between the presentation 
of a verbal unit and the first associa- 
tion produced. Beck, Phillips, and 
Bloodsworth (1962) measured the 
RTs for each of Hull’s 320 CVCs and 
found significant correlations between 
RT and two indexes of meaningfulness 
_Archer’s association values (r= 
—.19) and Trapp and Kausler’s asso- 
ciation values (r = —.23). Although 
these correlations are small it would 
seem that RT should be an important 
variable since in PA learning studies 
presentation rate is of a relatively brief 
duration—typically 2-8 sec. On the 
assumption that the positive relation- 
ship between meaningfulness and 
learning rate is to some extent a func- 
tion of the frequency of associations 
elicited by the verbal units, stimulus 
and response terms which produce as- 
sociations rapidly (short-latency RT 
terms) should be learned faster than 
terms which produce associations 
slowly (long-latency RT terms), re- 
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gardless of the number of associations 
the terms might produce if latency 
were not a factor. If learning rate is 
a function of RT, it follows that the 
effect of RT should be more marked 
under learning conditions which em- 
ploy a brief presentation rate (e.g. 
2 sec.) since a relatively long presen- 
tation rate (e.g., 8 sec.) might provide 
adequate time for units with long-la- 
tency RTs to produce associations and 
thus increase the probability of their 
being learned. 

It was the purpose of the present 
study to determine the effect of RT on 
the rate of learning paired associates 
under three different presentation rates 
and two levels of meaningfulness, 
Specifically, it was predicted that in a 
PA-learning task, in which response 
terms (CVC trigrams) are equated on 
the basis of at least one of each of the 
three different measures of meaning- 
fulness, learning rate will be a func- 
tion of RT, i.e., trigrams with short- 
latency RTs will be learned faster than 
trigrams with long-latency RTs, and 
the difference between the learning 
rates will be greater, the shorter the 
presentation rate, 


METHOD 


Phase 1—In this phase of the experi- 
ment a random sample of 60 CVC trigrams 
was selected from the 840 trigrams that 
constitute the lower and upper 20% of 
Noble’s (1961) m’ scale, 30 trigrams from 
each end of the distribution; and the RT of 
each trigram was determined. Each tri- 
gram was typed in capital letters on a 3X5 
in. white card. A deck composed of these 
cards was constructed with the order of the 
cards randomly determined. Thirty under- 
graduate volunteers (16 females and 14 
males) from the State University of New 
York at Albany served as Ss in the scaling 
procedure. 

In the RT scaling procedure, each S$ 
was individually given a standard set of 
instructions which informed him that he 
would be presented with a series of cards 
each containing a CVC trigram (some 


words and some nonwords) ; that he should 
attend to these trigrams and respond ver- 
bally as quickly as possible with the first 
word or words which come to his mind; 
and that he should continue responding until 
instructed to stop and prepare for the next 
card to be presented. He was further in- 
formed that the signals to stop and prepare 
for the next card would be presented on a 
tape recording. The tape was programmed 
to instruct S to respond at the presenta- 
tion of each card, 30 sec. after which the 
tape instructed S to terminate his associa- 
tions to the trigrams and to await the 
presentation of the next card, 3 sec. after 
which the next card was presented. The 
E, who was seated opposite S, manually 
presented each card and simultaneously 
started a stop clock at the direction of a 
signal from the tape. The clock was 
stopped at the first verbalized association 
to the trigram made by S and the associa- 
tive reaction time (RT) to the nearest .01 
of a sec. was recorded by E. The E also 
recorded the first association per se and 
tallied the number of additional associations 
made by S during the 30-sec. period. 

Means and variances of RT latencies for 
all Ss combined, on each of the 60 tri- 
grams, were calculated and entered in a 
table of either low-m! trigrams or high-m’ 
trigrams together with each trigram’s cor- 
responding mean number of associations pro- 
duced, scores on Noble’s m’, a’, and a scales, 
mean rated pronunciability, and total log- 
frequency values. Since a complete table of 
pronunciability ratings of CVC trigrams 
was not available, the ratings were obtained 
from a separate sample of 30 male and fe- 
male undergraduates at the State University 
of New York at Albany following those 
scaling procedures employed by Underwood 
and Schulz (1960), ie, Ss were required to 
rate the ease of pronunciability of each tri- 
gram along a continuum from 1 (greatest 
ease of pronunciability) to 9 (hardest pro- 
nunciability). 

Using mean RT as the primary criterion, 
four trigrams were selected from the high- 
meaningfulness group—two from the short- 
latency end of the RT distribution (GEN 
and HEL) and two from the long-latency 
end of the RT distribution (RUG and 
TON). An attempt was made to equate all 
four trigrams as closely as possible on 
the mean number of associations produced, 
the Noble values, and the mean rated pro- 
nunciability. The same procedure was em- 
ployed in obtaining two short-latency RT 


s? 
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TABLE 1 


RT VALUES, MEANINGFULNESS VALUES, ASSOCIATION VALUES, PRONUNCIABILITY 
RATINGS, AND LOG FREQUENCY VALUES OF EIGHT TRIGRAMS USED AS 
RESPONSE TERMS IN THE ParrED-AssociaTE List 


Low-Meaningfulness Trigrams High-Meaningfulness Trigrams 
Trigrams Short Latency Long Latency Short Latency Long Latency 
LIW YAZ GUH KOJ HEL | GEN | RUG | TON 
Mean RT 3.29 |3.54 |498 |5.47 | 2.11 |2.26 | 2.82 | 3.33 
(M) (3.46) (5.22) (2.18) (3.08) 
Mean No. Associations 5.43 |450 | 4.86 | 4.63 | 5.47 |5.90 | 5.73 | 5.73 
(4.96) (4.74) (5.69) (5.73) 
Noble’s m’ 1.25 | 1.23 | 1.29 |1.21 | 3.67 | 3.58 | 3.90 | 3.83 
m) (1.24) (1.25) (3.62) (3.86) 
Noble’s a’ 1.47 |1.46 | 1.49 |1.44 | 3.88 | 3.80 | 4.10 | 4.04 
(M) (1.46) (1.46) (3.84) (4.07) 
Noble's a 32 40 38 35 98 98 | 1.00 98 
(M) (.36) (.36) (.98) (.99) 
Mean Rated Pronunciability 7.05 | 3.15 | 445 | 4.95 | 1.75 | 2.10 | 1.85 | 1.40 
M) (5.10) (4.70) (1.92) (1.62) 
Sum of Log of Frequency gaia 4.991 | 6.373 | 4.612 | 7.329 | 7.230 | 6.487 | 7.735 
(M) (5. 31) (5.492) (7.279) (7.111) 


trigrams (LIW and YAZ) and two long- 
latency RT trigrams (GUH and KOJ) from 
the low-meaningfulness sample. This 
method produced four groups of two tri- 
grams each: (a) short RT high meaningful- 
ness (SL-HM)—HEL and GEN, (b) long 
RT high meaningfulness (LL-HM)—RUG 
and TON, (c) short RT low meaningfulness 
(SL-LM)—LIW and YAZ, and (d) long 
RT low meaningfulness (LL-LM)—GUH 
and KOJ. Although it was not possible to 
precisely equate the four trigrams at each of 
the two levels of meaningfulness, it can be 
seen from Table 1, which lists the trigrams 
with their corresponding RT values, mean- 
ingfulness values, association values, pro- 
nunciability ratings, and log-frequency val- 
ues, that the means of the RT subgroups on 
the control variables (the measures other 
than RT) correspond quite closely. Except 
for mean number of associations and log- 
frequency values, in those cases where dif- 
ferences exist, the differences favor the 
Potential learning rate of the long-latency 
trigrams. 

Phase 2—In this phase of the experiment 
the eight trigrams thus selected were used 
as response terms in a paired-associates learn- 


ing task for the purpose of determining the 
rate at which each trigram was learned. 

Subjects—The Ss were 60 undergraduate 
volunteers from the State University of New 
York at Albany (40 females and 20 males), 
naive with respect to previous experience in 
verbal learning experiments. Two addi- 
tional Ss were not included in the sample 
because their performance scores did not 
meet the criterion of learning of the experi- 
ment. 

Materials—The stimulus terms for the 
eight trigrams were randomly selected two- 
digit numbers (except those with “1” as the 
first digit) randomly assigned to the tri- 
grams. The eight paired associates were 
randomly arranged in four orders with re- 
strictions that each pair occupy a different 
position in each order and that no pair fol- 
low another pair more than once. The four 
orders were typed in capital letters on a 
continuous loop of paper for use in a 
Lafayette Memory Drum (Model 303B). 

A practice list of paired associates was 
constructed in the same manner as the test 
list except that the two-digit number stimu- 
lus terms were different from those of the 
test list and the response terms were CVC 
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trigrams randomly selected from the 50th 
percentile of Nobles m’ list. A second 
Lafayette Memory Drum was used for pre- 
senting the practice list. 

Procedure—The experimental procedure 
was designed to be identical for all Ss except 
that Ss were equally assigned, as they ap- 
peared, to learn the list under one of three 
different presentation rates: a 2-sec. presen- 
tation rate (1 sec. for stimulus term alone 
and 1 sec. for stimulus and response terms 
together) and a 2-sec. intertrial interval, a 
4-sec. presentation rate and a 4-sec. intertrial 
interval, or an 8-sec. presentation rate and 
8-sec. intertrial interval, The Ss were indi- 
vidually tested. Each S was orally pre- 
sented with standard instructions in the oper- 
ation of the memory drum and in the 
anticipation method learning task. The Ss 
were given five trials on the practice list at a 
4-sec. presentation rate and a 4-sec. inter- 
trial interval. Any reasonable pronunciation 
of the response terms was allowed. Since 
the eight paired associates were presented 
in four different orders on a continuous loop 
tape, an attempt to further reduce serial 
position effects was made by randomly select- 
ing the first order to be presented to S. 
Thus, the first pair to be viewed by each S 
was one of four different pairs. 

Learning trials were terminated at the 
end of one errorless trial or at the end of 
40 trials if an errorless trial had not oc- 
curred. Learning rate of a given pair for a 
given S was the number of the trial on 
which S made his first correct anticipation. 
A pair not learned at the end of the 40 
trials was assigned a score of 41, The 
performance criterion required that S learn 
at least one pair from each of the four tri- 
gram groups (SL-HM, LL-HM, SL-LM, 
and LL-LM) within 40 trials. The per- 
formance of two Ss did not meet this cri- 
terion and, therefore, were excluded from 
the study. 

Design.—The learning-rate score for a 
given S on a given trigram was the first 
trial on which S correctly anticipated the tri- 
gram. The learning-rate score for a given 
S on a trigram group (SL-HM, LL-HM, 
SL-LM, or LL-LM) was the mean of the 
scores of the two trigrams which comprised 
the group. Although Ss were tested under 
only one of the three different presentation 
rates, all Ss were required to learn the same 
eight paired associates. The design was, 
therefore, a mixed factorial in which presen- 
tation rate (2-, 4-, and 8-sec.) was a be- 
tween-Ss variable and RT (short latency 
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and long latency) and meaningfulness (low 
and high), within-Ss variables. 


RESULTS 


The mean number of trials and mean 
total time required for the first correct 
anticipation of the four groups of tri- 
grams by presentation rates are given 
in Table 2. The significant main ef- 
fect of the RT factor, F (1, 57) = 
12.07, p < .001, is apparent in Fig. 1 
which is a plot of the mean trials scores 
on the short- and long-latency RT tri- 
grams as a function of presentation 
rate. The mean of the first correct 
anticipation trial for the four short- 
latency RT triagrams over presentation 
rates (M = 9.05) is less than that of 
the long-latency RT trigrams (M = 
14.96), a finding consistent with the 
primary hypothesis of the present 
study. 

The significant main effect of mean- 
ingfulness, F (1, 57) = 123.07, p< 
.001, is apparent in Fig. 2, which is a 
plot of the performance scores for the 
low- and high-meaningfulness trigrams 
as a function of presentation rate. The 
mean of the first correct anticipation 
trial for the four high-meaningfulness 
trigrams over presentation rates (M = 
6.98) is less than that of the low-mean- 
ingfulness trigrams (M = 12.57). 

The significant main effect of pres- 
entation rate can be seen in Fig. 1 and 
2, where the mean number of trials to 
the first correct anticipation varies in- 
versely with presentation rate, Mo sec. 
= 16.35, M4 sec, = 8.00, Ms sec. = 4.98, 
F (2, 57) = 60.90, p < .001. 

The significant interaction between 
RT and presentation rate, F (2, 57) = 
8.08, p < .001, can also be seen in Fig. 
1, where the difference between the 
means of the number of trials to first 
correct anticipation for short-latency 
RT trigrams and long-latency RT tri- 
grams is greater, the briefer the presen- 
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| TABLE 2 
MEAN NUMBER OF TRIALS AND MEAN TIME REQUIRED FOR THE First CORRECT 
ANTICIPATION OF SHORT- AND LONG-LATENCY RT TRIGRAMS, 
Low- AND HIGH-MEANINGFULNESS TRIGRAMS, 
BY PRESENTATION RATES 
Low-Meaningfulness Trigrams High-Meaningfulness Trigrams 
Short Li Long Lai L 
r A (LIW + YAZ) (GUR + Koy) dint GEN) UE FTON 
No. of Total No. of | Total No, of Total No. of Total 
Trials Time Trials Time Trials Time Trials Time 
2 sec. 
M 18.0 36.0 27.3 54.6 11.0 22.0 9.2 18.4 
$ SD 9.22 18.44 7.85 15.70 4.98 9.96 5.62 11.24 
sec. 
M 7.9 31.6 10.9 43.6 7.6 30.4 5.6 22.4 
k SD 3.18 12.72 4.86 19.44 4,08 16.32 2.64 10.56 
sec, 
M 4.9 39.2 6.4 51.2 5.0 40.0 3.6 28.8 
SD 1.65 13.20 2.74 21.92 1.92 15.36 1.43 11,44 
M 10.3 35.6 14.9 49.8 7.9 30.8 6.1 23,2 
SD 7.99 15.31 10,56 19.71 4.57 16.04 4.34 11.71 


tation rate, a finding consistent with 
the secondary hypothesis of the pres- 
ent study. The significant interaction 
between meaningfulness and presenta- 
tion rate, F (2, 57) = 48.90, p < .001, 
can be seen in Fig. 2, where the differ- 
ences between performance on low- 

and high-meaningfulness trigrams is 

inversely related to presentation rate, 
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Fic. 1. Mean number of trials to first 


correct anticipation for short-latency RT 
and long-latency RT trigrams as a function 
of three presentation rates. 


i.e., the shorter the presentation rate, 
the greater the difference between 
scores on the low- and high-meaning- 
fulness trigrams. From Table 2 it 
would appear that the significant inter- 
action between RT and meaningfulness, 
F (1, 57) = 23.88, p < .001, results 
from the fact that the difference be- 
tween mean performance scores on 
short- and long-latency RT trigrams of 
low meaningfulness, Mguim — Mur-im 
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Fic. 2. Mean number of trials to first 
correct anticipation for low-meaningfulness 
and high-meaningfulness trigrams as a func- 
tion of three presentation rates, 
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= 10.3 — 14.9 = —4.6, favors the faster 
learning rate of the short-latency tri- 
grams, whereas the difference between 
short- and long-latency trigrams of high 
meaningfulness, Mgr amu — Mrr am = 
7.9 — 6.1 = 138, favors the long-latency 
trigrams. The significant triple inter- 
action, F (2, 57) = 3.55, p < .05, sug- 
gests, however, that the double interac- 
tion of RT Xx Meaningfulness is a 
function of presentation rate, i.e., the 
RT x Meaningfulness interaction is 
most pronounced at the 2-sec. presenta- 
tion rate and least at the 8-sec. rate. 

Since the total time required for the 
first correct anticipation is a constant 
multiple of the presentation rate, the 
significant effects with respect to the 
within variables (RT and meaningful- 
ness) hold true for the total time mea- 
sures as well as for the mean trials 
measures. With respect to the be- 
tween variable (presentation rate), 
however, the use of total time rather 
than number of trials yields different 
results. An F test of the presentation- 
rate scores in terms of mean total time 
required for the first correct anticipa- 
tion was not significant, F (2, 57) = 
0.54, p > .05. 


Discussion 


The significant main effect of RT sup- 
ported the hypothesis that the PA learn- 
ing rate of CVC trigrams is a function 
of associative reaction time (RT), i.e, 
trigrams with short-latency RTs are 
learned faster than trigrams with long- 
latency RTs. In view of the controls 
employed, it would appear that RT is in 
some measure independent of several 
widely used measures of meaningfulness. 
The significant interaction between RT 
and presentation rate supported the hy- 
pothesis that the effect of RT would be 
most pronounced under brief presentation 
rates, but the significant interaction be- 
tween RT and meaningfulness suggests 
that the effect of RT is limited to low- 


meaningfulness trigrams. This limitation 
may be the result of unique character- 
istics of the trigrams used in the long- 
latency high-meaningfulness group, i.e., 
RUG and TON were the only words in 
the list of eight CVC trigrams. It is 
conceivable that the uniqueness of words 
presented within a group of nonsense 
syllables facilitates the learning of words, 
i.e., the von Restorff effect. An alterna- 
tive explanation may lie in the RT scal- 
ing procedure, wherein the uniqueness of 
words presented within a group of non- 
sense syllables may elicit a surprise re- 
action which in turn results in a delay 
in the verbal associations, thus arti- 
factually increasing the latency of RT. 
Some small evidence favoring this inter- 
pretation was found in the RT scaling 
data which showed that the trigram 
SEX, included in the sample of high- 
meaningfulness trigrams, produced the 
largest number of associations, but had 
one of the longest RTs of all trigrams. 

Although presentation rate was a sig- 
nificant factor when the criterion mea- 
sure was expressed in terms of the 
number of trials to the first correct 
anticipation, the use of total time to the 
first correct anticipation resulted in no 
significance, findings consistent with those 
of Bugelski (1962). The significant in- 
teraction between presentation rate and 
meaningfulness suggests, however, that 
learning rate is not completely inde- 
pendent of presentation rate. A perusal 
of the mean total time measures of 
Table 2 indicates that for low-meaning- 
fulness trigrams, the 4-sec. presentation 
rate resulted in the least total time re- 
quired, whereas for high-meaningfulness 
trigrams, presentation rate is directly re- 
lated to total time required, findings 
which are in complete agreement with 
those of Carroll and Burke (1965). 

The significant effects of meaningful- 
ness are consistent with the general find- 
ings of previous studies which have 
clearly indicated that learning rate is 
directly correlated with meaningfulness 
(Carroll & Burke, 1965; Kimble & Du- 
fort, 1955; Mandler & Huttenlocher, 
1956). 
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CAN THE SUPERIOR LEARNABILITY OF MEANINGFUL 


AND PLEASANT WORDS BE TRANSFERRED 
TO NONSENSE SYLLABLES?? 


ALBERT SILVERSTEIN an» RICHARD A. DIENSTBIER 2 
University of Rhode Island 


The Ss in 2 experiments first learned a PA list of words as responses 
to nonsense syllables. In 1 condition the words differed in meaningful- 
ness (M) and in the other condition the words differed in pleasantness 
(PL). Next, Ss learned a 2nd PA list of the same syllables as 
responses to numbers. High-M words were learned faster than low-M 
words, but did not transfer. either this difference in learning or the 
difference in rated M to the syllables. Pleasant words were learned 
faster than indifferent words in one experiment (men’s) but not in 
the other (women’s). In neither experiment did the words transfer 
their. difference in learning to the syllables, but in the men’s experi- 


ment they did transfer the difference in rated PL. 


While both meaningfulness (M) and 
pleasantness (PL) of learning ma- 
terials have received much attention 
from psychologists investigating verbal 
learning, it is not known whether or 
not different values of these variables 
can be transferred to new items in such 
a way as to alter their ability to be 
learned. Yet, such information is 
needed to determine the exact locus of 
these variables’ influence on learning. 

Those theories of meaningfulness 
that emphasize the availability and in- 
tegration of identifying responses to 
verbal items (cf. Goss & Nodine, 1965; 
Underwood & Schulz, 1960) would 
predict no effect of induced M on 
learning beyond that produced by fa- 
miliarization. On the other hand, a 
theory of M that emphasizes the vari- 
ety or distinctiveness of associative 
responses to an item (cf. Noble, 1963) 
would predict a gain in rate of learning 
as a direct function of the number of 
different new responses conditioned to 


1This research was supported by United 
States Public Health Service Grant MH 
08974 from the National Institute of Mental 
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2 Now at the University of Rochester. 


items. Experiments by Bailey and 
Jeffrey (1958) and Parker and Noble 
(1963) have failed to reveal any differ- 
ence in rate of PA learning for re- 
sponse terms conditioned to varying 
numbers of new associates beyond what 
could be ascribed to the influence of 
familiarization. An alternative ap- 
proach to inducing M would be to con- 
dition single words of widely different 
M to nonsense syllables of initially 
equal and low M. Any theory propos- 
ing that M involves some dimension of 
a unitary response to a stimulus rather 
than a pattern of evoked responses 
(e.g., intensity of 7m, Osgood, Suci, & 
Tannenbaum, 1957) would predict that 
such a conditioning of M will transfer 
the difference in the ability to be 
learned from words to syllables. The 
current study employs such a condi- 
tioning procedure. 

Recently, Staats (1964) has sug- 
gested that PL is an attribute of a 
meaning response that can be condi- 
tioned to neutral items, giving them 
secondary reinforcing properties. If 
so, such conditioning should lead to 
superior learning of items in direct 
proportion to the PL of words to which 
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they were conditioned. Silverstein and 
McCreary (1964) produced evidence 
of a positive relation between induced 
PL and learning, but they used photo- 
graphs of real scenes, rather than 
words, as conditioners of PL. 

The present study reports two ex- 
periments in which Ss first learned a 
list of paired associates with nonsense 
syllables as stimuli and nouns as re- 
sponses and then learned a second PA 
list using the same syllables as re- 
sponses to two-digit numbers. In both 
experiments there was one condition 
in which the nouns differed in M and 
another in which they differed in PL. 
The major experimental question was 
whether the syllables in List 2 would 
be learned at different rates according 
to the M or PL of the words with 
which they had been paired. 


METHOD 


Experiments I and II were identical in 
methodology except the former used women 
Ss and words previously scaled by women 
for M and PL, while the latter used men 
Ss and words previously scaled by men. 
The experiments conformed to the A-B, C-A 
transfer design with the effects of the inde- 
pendent variables assessed within Ss through 
a mixed-list procedure. In the M condi- 
tion half the nouns were high M and half 
were low M; in the PL condition half the 
nouns were highly pleasant (P) and half 
were indifferent (I). The A-B, C-A design 
was used because of the larger effects of both 
M (Underwood & Schulz, 1960) and PL 
(Anisfeld & Lambert, 1966) on responses 
than on stimuli, because conditioned mean- 
ing should be more potent when the CS is 
presented prior to the US (Staats, 1964), 
and because this design should yield less 
negative transfer than the A-B, A-C design. 

Subjects—In Exp. I there were 48 women 
Ss from basic psychology courses at the Uni- 
versity of Rhode Island. For Exp. II Ss 
were 32 male undergraduates from basic 
psychology classes at the University of 
Rhode Island. All Ss were experimentally 
naive and were equally and randomly as- 
signed to the two conditions. They were 
Tun individually by the anticipation method. 


Stimulus materials—The response words 
in List 1 were selected from a pool of 153 
two-syllable nouns previously rated for PL 
and M by two independent samples of men 
and two samples of women students. A 
total of 101 of these nouns were common 
to the men’s and women’s lists (Silverstein 
& Dienstbier, in press). The ratings of M 
were obtained through procedures like 
Noble’s (Noble & Parker, 1960), using a 
5-point scale with a low of 1 and a maximum 
of 5. The PL ratings were made on a 7- 
point scale running from very unpleasant (1) 
to very pleasant (7), with 4 as neutral. In 
the M condition PL of the sublists was 
equated, while printed frequency covaried 
with M (Thorndike & Lorge, 1944). In the 
PL condition M of the sublists was equated 
and frequency was roughly equated. Table 1 
shows the words used in the two experiments 
along with their PL and M ratings and 
Lorge (L) frequency counts. The values 
given a word in a list are those produced by 
the appropriate-sex sample. 

The eight nonsense syllables used were 
taken from the 47% and 53% association 
values of Glaze (1928). Intralist similarity 
was low, with four different vowels and 14 
different consonants used. No first or last 
letters were repeated in the list. The two- 
digit numbers used were of low-association 
value, from .79 to 1.22 (Battig & Spera, 
1962). No first or second digits were re- 
peated, fives and zeros were not used, and 
no number was the reverse of any other 
number. } 

In each experiment two different pairings 
of stimuli and responses were used for both 
lists, and half the Ss in each condition were 
assigned to a pairing. The pairings were 
randomly determined with the following 
restrictions: In List 1 each syllable was 
paired with a word from a different sublist 
in the two pairings, and no syllable was 
paired with a word that began with its first 
letter. In List 2 each number was paired 
once with a syllable from each sublist (as 
determined by the word it had been paired 
with previously), and no number was paired 
with a syllable beginning with its first 
phoneme. There were four different random 
orders of presentation used for the pairs of 
both List 1 and List 2. Starting orders were 
randomly assigned to Ss. 

Apparatus.—Stimuli were mounted on 35- 
mm. slides and shown by a Kodak Carousel 
automatic slide projector. The automatic 
timer of the projector was set at 5 sec. per 
slide with 10 sec. between trials. 
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TABLE 1 
Worps Usep IN List 1 


Exp. I (Women) Exp. II (Men) 
Words Words r 
M PL L M PL L 
Pleasantness Condition 
P Words P Words 
Caress 3.71 6.30 15 Caress 3.54 6.34 75 
Delight 3.73 6.34 353 Laughter 3.73 6.24 275 
Kindness 3.71 6.37 115 Pleasure 4.10 6.43 442 
Sunshine 3.93 6.41 194 Summer 4,46 6.30 783 
M 3.77 6.36 184 M 3.86 6.33 393.8 
I Words I Words 
Breakfast 3.41 3.38 539 City 4,39 4i 1258 
Grocer 3.19 4.17 31 Highway 3.87 4.29 156 
Object 3.33 3.85 394 Paper 3.81 4.06 1235 
Paper 3.40 4.03 1235 Table 3.57 4.04 1325 
M 3.33 3.86 549.8 M 3.91 4.21 993.5 
Meaningfulness Condition 
High-M Words High-M Words 
ity 4.36 3.74 | 1258 Building 4.03 3.39 | 1014 
Morning 3.74 5.22 | 2015 Highway 3.87 4,29 156 
Study 3.98 3.57 942 Paper 3.81 4.06 1235 
Woman 4.29 5.20 | 2431 Story 3.81 4.93 1651 
M 4.07 4.63 1661 M 3.88 4.17 1014 
Low-M Words Low-M Words 
Arbor 2.20 4,73 29 Arbor 2.25 4,53 29 
Boulder 2.37 5.07 13 Dotage 1.44 3.57 1 
Mallet 2.21 3.36 17 Lorry 1.74 4.27 3 
Zenith 2.40 4.87 2 Sequel 2.10 3.97 3 
M 2.29 4.51 15.3 M 1.88 4.09 9 


Procedure—For the pairs in both List 1 
and List 2 the stimulus term appeared for 
5 sec. followed by a 5-sec. showing of its 
paired response. With List 1 Ss were in- 
structed to spell the nonsense syllables and 
pronounce the words out loud on the study 
trial, but, thereafter, they were to try to 
anticipate the words only. The Ss were run 
to a criterion of two successive errorless 
trials in order to maximize the degree of 
conditioning. Those who failed to reach 
criterion within the limit of 17 trials were 
replaced in the design. With List 2 Ss 
were instructed that the syllables would be 
the same as those they had Previously en- 
countered and to both say the number and 
spell the syllable out loud on the study trial. 


Thereafter, they were to try to anticipate 
the syllables only. This task was continued 
to a criterion of one perfect trial or a 
minimum of 10 trials. Any S who failed 
to anticipate each pair correctly at least once 
within the limit of 20 trials was replaced in 
the design. 

After learning List 2 Ss were asked to 
rate the nonsense syllables on the dimen- 
sion appropriate to their experimental condi- 
tion. In the PL condition Ss were given 
a thermometer-type scale on a card for 
rating syllables’ PL. This was a 7-point 
scale like that used for the word ratings. In 
the M conditions Ss were given a Noble-type 
scale (Noble, 1961) ranging from “no as- 
sociations” (1) to “very many associations 
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(5). Noble’s instructions were used for this 
task. The syllables were shown to Ss, one 
at a time, for as long as was needed for 
the ratings. 


RESULTS 


In reading the presentation of re- 
sults it should be remembered that the 
sex of Ss is confounded with different 
response terms in the first list, but not 
in the second list in which all Ss 
learned the same items. All statistical 
tests made on percentages followed 
transformation of the data to arc sines. 
The degrees of freedom associated with 
tests made of Exp. I were 47 and for 
Exp. II were 31. 

Meaningfulness: List 1—The mean 
number of trials to the criterion of two 
perfect trials was 10.43 for the women 
and 14.00 for the men. As can be seen 
from Table 2 both experiments showed 
the typical substantial advantage for 
high-M over low-M words. Both the 
women’s and men’s differences were 
reliable beyond .001 (t= 7.62 and 
4.48, respectively). In both experi- 
ments this advantage was reflected in 
the response-availability stage (Under- 


wood & Schulz, 1960). The women 
completed this stage more rapidly for 
the high-M words by .61 trials (t= 
3.11, p<.01) and the men by .88 
trials (t= 3.15, p< .01). While the 
women continued to show the superi- 
ority of the high-M words through the 
association stage (Underwood & 
Schulz, 1960) by .74 trials (t = 3.52, 
p < .01), the men completed this stage 
with nearly identical speeds for the 
two sublists. The percentage correct 
following first correct response, a mea- 
sure of associative stability, was re- 
liably greater for high-M words in both 
experiments (by 12.9% for the women 
and 7.3% for the men). 
Meaningfulness: Transfer. — The 
speed of learning List 2 was virtually 
identical for women and men Ss (11.9 
and 12.5 trials). Table 2 shows that 
the number of correct anticipations 
over 10 trials was quite similar for the 
syllables paired with highly meaning- 
ful words (high M) and those paired 
with low meaningful words (low M) 
in both experiments. In neither case 
was the ¢ for the difference reliable. 


TABLE 2 
Mayor Statistics: MEANS AND DIFFERENCE—?’s 
Exp. I (Women) Exp. II (Men) 
Meaningfulness Condition 
= High-M Low-M 

Statistic Hi ms eae taitt ean Mean laitt 
P 

% Correct List 1 78.3 58.5 7.62* 64.5 54.1 aa 

No. Correct List 2| 21.0 18.7 1.79 17.7 18.4 < 
SyliableM 1.66 1.67 <1 1.89 1.59 1.43 

Pleasantness Condition 

P Mean I Mean tatt P Mean I Mean faite 
3.76" 

% Correct List 1 62.4 63.8 <1 68.5 56.8 Ls 

o. Correct List 2 19.7 19.2 <1 18.0 16.8 

Syllable PL a 4.61 4.32 1.69 4.95 4.02 2.98" 


*p < 01. 
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Stage analyses revealed no differences 
between the two sublists of either ex- 
periment for the response-availability 
stage, but showed a small superiority 
for the high-M sublists in the associa- 
tion stage. This superiority was .26 
trials for the women (f= 1.51, p< 
-15) and .84 trials for the men (t= 
1.75, p < .10). By pooling the scores 
on this stage of learning for both ex- 
periments a mean difference of .49 
trials appeared, which is reliable at .05 
(t= 2.23). 

The percentages of correct response 
following first correct response were 
nearly identical for the two sublists of 
both experiments. There also were no 
reliable differences for intralist error 
rates for the two sublists. Table 2 
also shows the near identity of mean- 
ingfulness ratings for the syllables 
paired with high- and low-M words. 
In neither experiment did the differ- 
ence approach reliability, 

Pleasantness: List 1—The mean 
number of trials to criterion was 11.51 
for the women and 12.25 for the men. 
Table 2 shows that the P words were 
learned more rapidly than the I words 
by the men in Exp. II, but that PL had 
no effect on the women in Exp. I. For 
the men the difference was very relia- 
ble (t = 3.76, p < .01), and manifested 
itself in faster Tesponse-availability (by 
.63 trials) and association (by .55 
trials) stages, and in a greater per- 
centage correct following first correct 
response (by 7.5%). However, only 
the response-availability difference was 
reliable (t=2.14, ~<.05). The 
values of ¢ for the other two differences 
failed to reach 10% significance. For 
the women’s experiment none of these 
differences between P and I words pro- 
duced a ¢ greater than unity. 

Pleasantness: Transfer—The speed 
of learning List 2 to criterion was 


10.66 trials for the women and 13.69 
trials for the men. Table 2 shows that 
there was no reliable difference in num- 
ber correct over 10 trials between the 
P and I syllables (in terms of the 
words they had been paired with) for 
either experiment. In line with this 
result, the differences between sublists 
in both experiments were virtually nil 
for the response-availability and asso- 
ciation stages and for the percentage 
correct following first correct response. 
Nor were there reliable differences be- 
tween sublists in intralist error rates 
for either experiment. The differential 
PL of the two sublists of words was 
transferred reliably to the syllables in 
the men’s experiment (t= 2.98, p< 
01), but not in the women’s experi- 
ment (¢= 1.69, p > .10). 


Discussion 


The evidence from these experiments 
is that, while high-M words were learned 
faster than low-M words and (in some 
circumstances) P words were learned 
faster than I words, these differential 
learnabilities were not transferred to 
nonsense syllables associated with those 
words. In the meaningfulness case the 
only difference found between syllables 
that had been differentially conditioned 
was in the speed with which responses 
were attached to correct stimuli after be- 
coming available. The most probable 
explanation for this small effect is that 
the words with which the syllables had 
been associated in List 1 sometimes served 
as mediators in List 2 between the syll- 
ables and the numbers, and that the high- 
M words were better able to serve this 
function. Such an explanation is based 
on the well-documented formation of 
backward associations during PA learn- 
ing (cf. Feldman & Underwood, 1957). 
An explanation of this effect in terms of 
syllables paired with high-M words ac- 
quiring a wider range of new associations 
is implausible in view of the absence of 
a difference in rated M between the two 
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sublists. The absence of any difference 
in speed of response learning betw: e 
two sublists indicates that none of 
differences in strength of identi 
responses were transferred from wi 

to syllables, and supports the contention 
of Goss and Nodine (1965, p. 227) that 
the only facilitation in learning that can 
be produced by inducing meaning or 
meaningfulness arises from greater inte- 
gration and availability of recognition re- 
sponses to items on the list. 

While there is good reason for our 
belief that the A-B, C-A paradigm maxi- 
mized the likelihood of obtaining an 
effect of induced M, only further research 
can rule out the possibility that an effect 
could be produced by some other para- 
digm (e.g., the A-B, C-B backward con- 
ditioning paradigm). Meanwhile, our 
failure to produce any differences in the 
overall course of learning the syllables 
by inducing M suggests that the tradi- 
tional superiority of high-M over low-M 
words in learning is not the result of 
differences in any dimension of a unitary 
meaning response such as that measured 
by Polarity on the Semantic Differential 
(Osgood et al., 1957). 

The failure to obtain any effect of in- 
duced PL on learning, even in Exp. II 
where P words were more quickly learned 
than I words, stands at variance with 
the findings of Silverstein and McCreary 
(1964) and Silverstein (1966). These 
latter studies differed from the current 
one only in that they used photographs 
of real scenes as conditioning stimuli. 
The most obvious resolution of these 
findings is the hypothesis that pleasant 
words are substantially less potent than 
photographs in eliciting emotional reac- 
tions and, hence, less capable of produc- 
ing conditioning of the affect. It is not 
clear what the basis was for the judg- 
ments of higher PL obtained for P-paired 
than for I-paired syllables. It may have 
been the result of Ss recalling the specific 
words associated with the syllables and 
using them as the basis for a difficult 
judgment. In any event, these results 
point out the danger involved in using 


d 


shifts in pleasantness ratings as an index 
of conditioned incentive value (cf. Staats, 
(1964). 

There is reason to believe that the ab- 
sence of a difference in speed with which 
women Ss learned the P and I words was 
related to the presence of a higher level 
of task anxiety in women than in men 
Ss. Such anxiety may be presumed to 
interfere with any difference in affective 
response produced by P and I items. 
Preliminary evidence from our laboratory 
indicates that anxiety-reducing manipula- 
tions (e.g., incidental learning of List 1) 
can produce a superiority for P over I 
items with women Ss. 


2 While the conditioning procedure used by 
Staats was such that recall of specific emo- 
tional words was unlikely to be the basis 
for the shifts in ratings he reported, and 
they may reflect genuine affective condition- 
ing, the methodological point regarding the 
use of rating data remains an important one. 
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PERCEPTION OF SLANT WHEN PERSPECTIVE 
AND STEREOPSIS CONFLICT: 


EXPERIMENTS WITH ANISEIKONIC LENSES * 


B. J. GILLAM 
Biometrics Research, New Vork State Department of Mental Hygiene, New York 


Aniseikonic lenses change disparities such that perspective and 
stereoscopic slant cues are in conflict. Ss viewing binocularly through 
aniseikonic lenses were required to judge the slant of each of a set of 
surfaces providing perspective slant cues. An adjustment method was 


used to record slant judgments, 


In general, reliable compromise 


responses between the conflicting cues were obtained although some Ss 
suppressed perspective cues almost completely. Response to stereo- 
scopic cues was greatest for perspective patterns which were found 
to provide ineffective slant cues in monocular vision. 


The clinical condition of aniseikonia 
can be simulated by placing a 90° 
afocal lens, which magnifies the retinal 
image in the horizontal meridian, be- 
fore one eye. Frontal plane surfaces 
viewed binocularly under such condi- 
tions commonly appear slanted around 
a vertical axis because of the binocular 
disparities introduced by the horizontal 
magnification of one image. For a 
lens before the right eye, the slant is 
given geometrically by the following 
equation (Ogle, 1950) : 


Tan 0 = [(m—1)/2m]-[y/a], [1] 


where @ is the anticlockwise angle of 
rotation around a vertical axis, m is 
lens magnification as a percentage, y is 
observation distance, and 2a is inter- 
ocular distance. 

Perspective cues to slant, defined in 
terms of retinal gradients,? are not 


1This work was carried out while the 
author was at the Australian National Uni- 
versity, Canberra on a  General-Motors 
Holden Research Fellowship. Grateful ac- 
knowledgment is made to G. N. Seagrim of 
the Australian National University for his 
advice and assistance, and to J. Fleiss of 
Biometrics Research, New York, for his 
advice on statistical matters. 

? The apparent perspective of contours 
viewed through aniseikonic lenses is the 
reverse of that which would normally ac- 


changed by the lenses, since only a 
slight horizontal magnification of one 
retinal image is involved; and this is 
imperceptible in monocular vision. 
Therefore, if perspective cues are pres- 
ent aniseikonia creates a conflict be- 
tween these cues (consistent with the 
true slant of the surface) and disparity 
(consistent with the slant given by 
Equation 1). 

Ogle (1950) measured aniseikonic 
slant under optimum conditions for the 
use of stereopsis and found that Ss 
viewing through aniseikonic lenses set 
horopter rods on a curve rotated 
around the fixation point, as predicted 
from Equation 1. 

Ames (1946) suggested that the 
presence of perspective cues may have 
been responsible for his casual obser- 
vation that some surfaces appear more 
slanted than others when viewed 
through aniseikonic lenses. The re- 
search described in this paper is an 
attempt to test this assertion experi- 
mentally. 


company slants such as those perceived 
(Ames 1946). The distinction between 
retinal and apparent perspective is not crucial 
to the argument of the present paper and is 
discussed in relation to aniseikonic distortion 
in Gillam (1967). 
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The most extensive work on depth 
perception under conditions of conflict 
between disparity and monocular cues 
is that of Schriever (1925). He used 
stereoscopic photographs of three-di- 
mensional objects and induced cue con- 
flict by reversing the right and left eye 
views. He asked Ss which object they 
saw, e.g., a cone or a tunnel, as an 
index of which cue was dominant. 
Concerning perspective vs. ‘stereopsis 
he reports “perspective always domi- 
nates in the long run [p. 153].” 
Schriever’s experiments were in- 
genious, but his procedure did not allow 
for compromises between the conflicting 
cues. Nor did he vary stimulus con- 
ditions in a systematic way to see which 
variables influence cue dominance. 

Tttelson (1960) investigated conflict 
between disparity and size cues to rela- 
tive distance by varying the angular 
size of two outer rectangles placed 
either in front of or behind a central 
rectangle. The Ss were asked whether 
the outer rectangles were in front of or 
behind the central rectangle. He 
found that. as the size difference be- 
tween the outer and inner rectangles 
increased, the influence of binocular dis- 
parity was reduced. 

In the case of slant the general term 
“perspective” covers the following 
cues: (a) gradients of vertical com- 
pression of contour spacing across the 
horizontal meridian of the retina, or 
“linear perspective.” This is produced, 
e.g., by a slanted pattern of horizontal 
lines; (b) gradients of horizontal 
compression of contour spacing across 
the horizontal meridian of the retina, or 
“foreshortening.” This is produced, 
e.g., by a slanted pattern of vertical 
lines; and (c) combinations of a and b. 
Such gradients can be monotonic de- 
creases of contour spacing or irregular, 
with a change in the probability of 
contour spacing. 

It is proposed to test the hypothesis 
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that the extent to which a perspective 
cue dominates the disparity cue in a 
conflict will be related to the effective- 
ness of the perspective cue, where ef- 
fectiveness is defined as the veridicality 
of slant judgments made with that cue 
alone. The more effective the perspec- 
tive cues present, the less should be the 
susceptibility of a surface to aniseikonic 
distortion. The effectiveness of the 
perspective cues present can be deter- 
mined by obtaining monocular slant 
judgments of a surface. 


EXPERIMENT I 
Method 


Seven stimulus surfaces were used. Two 
with patterns of vertical lines (A and B) 
provided only foreshortening. The lines 
were 4.4 cm. and 6.9 cm. apart. respectively. 
Two with patterns of horizontal lines (C and 
D) provided only linear perspective. The 
lines were 2.5 cm. and 5 cm. apart, re- 
spectively. Surfaces E and F had grid 
patterns combining A/C and B/D, respec- 
tively. Surface G had a pattern of randomly 
scattered dots. and provided only irregular 
gradients. All lines in the patterns were 
composed of small dots to provide ample 
stimuli for stereoscopic vision. Each pat- 
terned surface was presented for slant 
judgments in binocular vision with one of 
three afocal lenses before the right eye 
(aniseikonic trials) and in monocular vision 
(right eye) with no lens present. In the 
aniseikonic trials the surfaces were always 
in the frontal plane, the slants to be judged 
being lens induced; whereas in the monocu- 
lar trials the surfaces were actually slanted. 
The method of adjustment was used to 
measure perceived slant. i 

Ogle’s (1950) method of measuring 
aniseikonic slant required Ss to adjust the 
stimulus surface itself until it appeared to 
be in the frontal plane. The deviation from 
the true frontal plane was taken as a measure 
of the distortion. This method yielded very 
unreliable results when tried with unsophis- 
ticated Ss because of the odd appearance of 
moving contours viewed through the lenses 
as reported by Ames (1946). The method 
adopted after pilot studies used a monocu- 
larly viewed, therefore undistorted, variable 
comparison stimulus which S could move 
around a central vertical axis to appear at 
the same slant as the binocularly viewed 
standard stimulus. 


PERCEPTION OF SLANT WITH CONFLICTING CUES 


It was considered desirable to avoid con- 
tours on the variable stimulus similar to 
those on the standard surfaces, so that Ss 
would be forced to match the two surfaces 
solely on the basis of apparent slant. The 
variable stimulus chosen was a circular “Mec- 
cano” wheel, diameter 6 cm., with diagonal 
markings, placed in the upper part of the 
visual field above and in front of the stimulus 
pattern and hidden from the left eye by a 
screen. In the monocular trials the whole 
left eye was occluded. In the aniseikonic 
trials only the upper part of the left visual 
field containing the variable stimulus was 
occluded. 

In the aniseikonic trials the afocal lens 
before the right eye was in the 90° position 
with its axis parallel to the median plane. A 
piece of glass of the same diameter as the 
lens was placed before the left eye. Afocal 
lenses of 2%, 3%, and 4% magnifications were 
used, consistent with stereoscopic slants of 
9°38’, 14°7’, and 8°10’, respectively. Reti- 
nal perspective was, of course, consistent with 
the true frontal plane position of the sur- 
faces. The viewing distance was 56 cm. 
Distance between the center of the lens and 
the center of the piece of plain glass was 
kept constant at 65 mm. The effect on the 
predicted aniseikonic slant of variations in 
interocular distance, when interlens distance 
is held constant, is negligible (Gillam 1964). 

In the monocular trials each of the seven 
patterns was presented at real slants of 
10°, 14°, and 18° to the right around a central 
vertical axis, 

Apparatus—The patterns were painted in 
black India ink on sheets of plate glass 3 mm. 
thick, 59 cm. wide, and 43 cm. high. Each 
line of the Patterns A, B, C, D, E, and F 
was composed of small dots placed at random 
intervals but with the same overall density 
of 7 dots/cm for each line. For Pattern G, 
64 dots were distributed at random within 
each 5-cm. square to ensure an even distribu- 
tion over the whole surface. 

The patterns were not extended into the 
upper third of the glass sheets in the monocu- 
lar visual field behind the variable stimulus. 
The upper edges of the vertical line and ran- 
dom dot patterns were uneven to eliminate 
linear perspective, 

The sheets of glass could be slid into a 
vertically positioned wooden frame attached 
to a table. The frame could be rotated 
manually by E around a central vertical axis 
in the monocular trials. 

Behind the frame, a large dome of white 
sanded perspex was placed in a vertical 
Position. This backdrop was illuminated 
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from behind so as to give an even, diffuse 
light behind the glass. This gave the im- 
pression that the dots were drawn on a white, 
completely textureless surface. The glass 
was not visible. When the glass was in 
position and in the frontal plane, each pattern 
subtended a vertical visual angle of 9° and 
a horizontal visual angle of 15°. 

The lens-presentation apparatus consisted 
of a metal box 40 cm. square, mounted on a 
stand. Rectangular holes were cut in both 
sides of the box which formed an aperture 
through which S seated behind the box 
could view the stimulus patterns. 

Adjustable screens around the aperture 
and a septum between the eyes allowed the 
horizontal extent of the visual fields for the 
two eyes to be approximately restricted to 
the binocular visual field. Adjustable 
screens above the aperture allowed either the 
variable to be occluded for the left eye or 
the whole left eye view to be occluded for 
monocular trials. In addition, a sanded 
perspex screen could be lowered over the 
whole aperture between trials. 

Two rotatable metal discs were mounted 
inside the box in such a manner that any one 
of six lenses, or pieces of plain glass of the 
same diameter mounted around their circum- 
ferences, could be brought in front of each 
eye. 
"The wheel variable was suspended from a 
horizontal bar 12 cm. in front of the glass. 
Its height could be adjusted so that it ap- 
peared just above the stimulus pattern. It 
could then be screened off for the left eye 
without obscuring the pattern. 

The S could move the variable around a 
central vertical axis by remote control by 
moving a rod back and forth with his right 
hand. 

Subjects—The Ss were 18 undergraduates 
in psychology. None had had spectacles 
prescribed and all gave 8/8 correct responses 
on the Verhoeff stereopter. No S had pre- 
vious experience with aniseikonic lenses. 


Procedure 

Each S attended the laboratory for two 
sessions. The second session was always at 
least 2 days after the first. f 

Session 1.—The S’s eyes were first aligned 
with the center of the lens by adjusting a 
chin rest. The screens were then adjusted. 
The main part of the instructions was as 
follows: 


You will see a pattern painted in black 
ink on a sheet of glass. Different pat- 
terns will be used during the experiment. 
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The patterns will appear to be slanted 
at different angles. [The E demon- 
strated by slanting a sheet of glass 
mounted in the apparatus and containing 
a random dot pattern.] Your task will 
be to judge the slant of the pattern on 
the glass and to move this wheel so 
that it appears to be slanted by the same 
amount as the pattern on the glass. In 
other words, I want you to make the 
surface of the wheel parallel to the 
surface of the glass. [The Æ demon- 
strated by slanting the random dot pat- 
tern and then moving the wheel until it 
was at approximately the same slant as 
the pattern.] Is that clear? Move it 
first so that it appears to be more 
slanted than the pattern, then back so 
that it appears less slanted than the 
pattern, then move it to and fro within 
this range until it appears to be the same 
slant as the pattern. There is no time 
limit. 

The E indicated that S was to observe the 
pattern as soon as the occluder was raised 
but was not to begin moving the wheel until 
after 5 sec. when E gave a signal. The S 
was told also that he was to look at the 
whole pattern, not just one corner of it, that 
he was always to use binocular vision, and 
that he was not to screw up his eyes. No 
mention of lenses was made. 

The S was then given 10 practice trials 
with an overall pattern of randomly spaced 
dots set at each of the following slants in 
random order: 6°R, 6°L, 12°R, 12°L, 18°R, 
18°L, 24°R, 24°L. The practice trials were 
binocular with plain glass before both eyes. 
He was then given the 21 aniseikonic trials 
(each of the seven patterns in combination 
with each of the three lens magnifications) 
in a random order which was different for 
each S. 

Session 2—After the same preliminary ad- 
justment of the screens as in Session 1, S 
was given the 21 aniseikonic trials again 
but in a different random order. He was 
then given the 21 monocular trials (each of 
the seven patterns at each of the three real 
slants). These were also presented in a 
different random order for each S. 


Results 


Monocular trials —Figure 1 shows 
the mean error, defined as the dif- 
ference between the true slant and the 
obtained slant setting, for each stimulus 
pattern at each slant. Overestimations 


of the true slant were scored plus. Un- 
derestimations were scored minus. 

An analysis of variance of the error 
scores was carried out. The main ef- 
fects for slant, stimulus pattern, and 
the interaction between them are all 
statistically significant, F (2, 30) = 
46.9, p < .001; F (6, 90) = 36.2, p< 
001; F (12, 180) = 2.23, p < .01, re- 
spectively. Scheffé’s (1959) method 
was used to determine the significance 
of the differences between the mean 
errors for the seven stimulus patterns, 
The mean errors for the horizontal line 
patterns (C and D) and the grid pat- 
terns (E and F), which are all close to 
zero, do not differ significantly. The 
mean errors for the vertical line pat- 
terns (A and B) and the random dot 
pattern (G) do not differ significantly 
from each other and are significantly 
greater than the mean errors for C, D, 
E, and F (p< .01), Pattern density 
appears to have had no effect on the 
accuracy of slant judgments. 

These results show that, at least un- 
der the stimulus conditions used, linear 
perspective is a highly effective slant 
cue, whereas foreshortening and irregu- 
lar gradients are extremely poor slant 
cues. Adding foreshortening to linear 
perspective did not increase the ac- 
curacy of slant judgments. The ef- 
fectiveness of linear perspective has 
previously been demonstrated by Clark, 
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Fic. 1. Mean errors for seven patterns as 4 
function of slant (monocular trials). 
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Smith, and Rabe (1956) and Cibis and 
Gerathewohl (1957). 

Aniseikonic trials —Figure 2 shows 
the mean slant setting for each stimulus 
pattern at each lens magnification, The 
unbroken line shows the theoretical 
aniseikonic slants, calculated from 
Equation 1, representing fully stereo- 
scopic responses. 

An analysis of variance of the slant 
settings was carried out. The main ef- 
fects for stimulus pattern, lens magni- 
fication, and the interaction between 
them are all statistically significant, F 
(16, 102) = 10.4, p < .001; F (2, 34) 
= 74.9, p < .001; F (12, 204) = 3.59, 
p < .001, respectively. 

As judged by Scheffé’s (1959) 
method the mean slants for the vertical 
line patterns (A and B) and the ran- 
dom dot pattern (G) do not differ sig- 
nificantly from each other and are 
significantly greater than the mean 
slants for the horizontal line patterns 
(C and D), p < .05, and the grid pat- 
terns (E and F), p < .05. The means 
for C, D, E, and F do not differ signifi- 
cantly from each other. 

Since C, D, E, and F were shown to 
contain significantly more effective per- 
spective cues than A, B, and G in the 
monocular trials and also showed sig- 
nificantly less aniseikonic slant, the 
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Fic. 2. Mean aniseikonic slant for seven 
Patterns as a function of lens magnification. 
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hypothesis that the susceptibility of a 
surface to aniseikonic slant distortion is 
inversely related to the effectiveness of 
perspective cues present is supported. 
Pattern density had no significant ef- 
fect on the degree of induced aniseikonic 
slant. 

With reference to Fig. 2, the induced 
slants for Patterns A, B, and G are 
approximately 2° above the theoretical 
line. In the binocular practice trials 
with no lens present, mean overestima- 
tions of 2.75°, 1.75°, and 2° were ob- 
tained for real slants of 10°, 14°, and 
18°, respectively. Therefore, the over- 
estimations in the aniseikonic trials are 
probably a function of some general 
property of the experimental situation 
and not an effect of the lenses per se. 

There is a suggestion in these results 
that the mean slants for the grid pat- 
terns (E and F) are intermediate to the 
slants of the horizontal and vertical line 
patterns, although by Scheffé’s (1959) 
method the means for E and F were not 
found to differ significantly from the 
means for C and D. There is also a 
suggestion that the curves diverge as 
lens magnification increases. As 
judged by Scheffé’s method, the mean 
slants for 3% and 4% lens magnifica- 
tions do not differ significantly, which 
suggests that as conflict increases re- 
liance on perspective increases in ac- 
cordance with Ittelson’s (1960) find- 
ings. In order to investigate these 
suggested trends, a further experiment 
was carried out using a wider range of 
lens magnifications, 


EXPERIMENT II 


Method 


Since pattern density had no effect in Exp. 
I, only Patterns B (vertical lines), D (hori- 
zontal lines), and F (grid) were used in 
Exp. II. The lens magnifications used were 
1%, 2%, 3%, 4%, 6%, and 8%. Six and 8% 
magnifications were obtained by superimpos- 
ing two 3% and two 4% lenses, respectively. 
The theoretical slants (calculated from 
Equation 1) for the new magnifications are 
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4.5° for 1%, 26° for 6%, and 32.5° for 8%. 
The apparatus, method, and procedure were 
the same as in Exp. I except that monocular 
trials were not included and only 14 Ss were 
used. 

Binocular practice trials were given with 
real slants of 0°, 5°R, 30°R, 40°R, 10°L, 
and 20°L in random order. Each of the 
three stimulus patterns was then presented 
with each of the six lens magnifications in 
random order. The same 18 trials were then 
repeated in a different random order. 


Results 


Figure 3 shows the mean slant set- 
ting for each stimulus pattern at each 
lens magnification. An analysis of 
variance was carried out. The main 
effects for stimulus pattern and lens 
magnification and the interaction be- 
tween them are all significant at the 
.001 level, F (2, 26) = 14.81; F (5, 
65) = 56.92; F (10, 130) = 13.28, re- 
spectively. 

The means for the three stimulus pat- 
terns differ significantly from each 
other at the .01 level as judged by the 
Scheffé (1959) method. Straight lines 
were fitted to each of the three curves 
shown in Fig. 3. Departures from 
linearity were tested by means of ap- 
proximate F ratios, using in the de- 
nominator a linear combination of the 
Ss X Lens interaction mean squares, 
and using Satterthwaite’s (1941) ap- 
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Fic. 3. Mean aniseikonic slant for three 


patterns as a function of lens magnification. 
(Exp. II). 
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proximation to determine the denomi- 
nator degrees of freedom. Departures 
from linearity were found to be insig- 
nificant for all three curves. 

The parallelism of the three fitted 
straight lines was tested, and F (2, 
130) = 60, p< .001. Therefore, the 
null hypothesis that the curves are 
parallel can be rejected. The remain- 
ing portion of the interaction after re- 
moval of the SS for parallelism was in- 
significant. It can be concluded that 
the interaction is entirely accounted for 
by the differences in slopes of the 
three curves. 

In the binocular practice trials an 
overestimation of approximately 2.5° 
was obtained for each real slant. A 
constant of 2.5° was subtracted from 
each mean for Pattern B. A mean 
square for fit to the theoretical line was 
calculated and was insignificant. It 
can be concluded that the curve for B 
fits the theoretical line, with the reser- 
vation that the estimate of constant 
error obtained from the practice trials 
might not be correct for the aniseikonic 
trials. 

Individual differences—The curves 
of six Ss were very similar to the aver- 
age curves shown in Fig. 3. The set- 
tings of three Ss were considerably less 
than the theoretical slants (for all pat- 
terns) although the order B > F > D 
was clearly preserved. Five Ss relied 
almost completely on stereoscopic cues 
for all patterns at all magnifications ex- 
cept 8% where the curves of two Ss 
showed a sharp divergence in the di- 
rection of the average results. Even 
for these five Ss the small differences 
between the means for the three pat- 
terns tended to be in the direction 
B>F>D. 

Individual reliability was high rela- 
tive to group variability. Pearson 
product-moment correlation coefficients 
were calculated between the first and 
second trials for Patterns B, D, and F 
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at 4%, 6%, and 8% magnifications. 
The coefficients range from .52 to .89, 
All are significant at the .01 level ex- 
cept for B4% and B6% which are 
significant at the .05 level. 

It appears that the results obtained 
represent stable responses to conflicting 
cues although the weight given to each 
set of cues varies among Ss. When 
questioned, Ss did not report difficulty 
in making their settings and did not 
report vacillation between incompatible 
slants. Many Ss expressed surprise 
when informed after the experiment 
that the slants experienced were lens 
induced. Unfortunately the question- 
ing was informal rather than system- 
atic. 


Discussion AND CONCLUSIONS 


The results of both experiments show 
that stimulus patterns with ineffective 
perspective slant cues demonstrate full 
aniseikonic slant distortion. Those pat- 
terns with effective perspective cues also 
demonstrate aniseikonic slant distortion 
but to a significantly lesser degree. Ex- 
periment II shows that in both instances 
the degree of distortion increases linearly 
with lens magnification, although the rate 
of increase is lower for stimulus patterns 
with effective perspective cues. 

The fact that the average curve for 
Pattern D did not deviate significantly 
from linearity despite its parabolic appear- 
ance means that the hypothesis that reli- 
ance on perspective cues increases as lens 
Magnification increases is not supported. 

The other trend suggested in the re- 
sults of Exp. I, namely that the mean 
aniseikonic slant settings for the grid 
pattern are intermediate to the slants of 
the vertical and horizontal line patterns, 
is clearly borne out by the results of 
Exp. II. 

In a previous experiment (Gillam 
1964), five dots in a line were found to 
be sufficient stimuli for a full stereoscopic 
response to aniseikonia, Therefore, there 
were ample stimuli in the horizontal line 
Patterns for the stereoscopic cue to be 
used if the dots had not formed a linear 


perspective cue configuration. Since the 
addition of vertical lines to the horizontal 
line pattern increased the dominance of 
stereopsis in the slant resolution of the 
surface, it can be concluded that the 
presence of stimulus configurations, which 
are strong stereoscopic stimuli but not in 
themselves effective perspective cues, can 
modify the effects of perspective on the 
degree of aniseikonic slant induced. 

The study of cue conflict has been 
neglected, as compared with the study of 
cue effectiveness. Yet, responses to cue 
conflict provide a better indication of the 
degree to which different types of cue 
are actually relied upon in conditions 
where multiple cues are available, as in 
everyday life. 
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PRETASK INFORMATION IN CONCEPT IDENTIFICATION 
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The purpose of the study was to determine the effects of pretask in- 
formation about the relative frequency of different stimulus events 
upon concept identification. 3 levels were used: No pretask informa- 
tion, Il; more-less information, I2; and exact relative frequencies, I3. 
Ss receiving exact pretask information identified the concepts in 
significantly fewer trials than the other 2 groups, while Ss receiving 
more-less information were significantly superior to those receiving 
no pretask information. This effect was independent of the constitu- 
tion of conjunctive concepts utilized. The information seemed to have 
its major effect through the reduction in number of trials Ss spent in 
learning the relative frequencies of different responses, e.g., through 


probability matching. 


Several studies (Grier & Bornstein, 
1966; Mandler & Cowan, 1962; 
Suppes & Ginsberg, 1963) have noted 
that S's’ responses in concept identifica- 
tion tasks will probability match the fre- 
quency of occurrence of the different 
stimuli before correct identification 
takes place. Mandler, Cowan, and 
Gold (1964) have suggested a two- 
process. learning model to account for 
these results; S must learn how fre- 
quently a given response is required 
(probability matching) and he also 
must learn when the response is re- 
quired (event matching). This model 
is supported by the Grier and Born- 
stein study which found that a concept 
requiring probability matching was 
more difficult to learn than one in 
which the relative frequency of the 
concept events agreed with S’s initial 
base rate of guessing. 

The fact that concept difficulty seems 
to depend in part on the amount of 
modification that Ss must make in their 
guessing habits suggests the necessity 
of extending Mandler, Cowan, and 
Gold’s model to include the specifica- 

1 The authors would like to express thanks 
to John W. Binford of the University of 
Louisville for his helpful suggestions and 
encouragement. 


tion of an initial guessing state. Ac- 
cording to such an extended model Ss 
start concept identification in an ini- 
tial specified guessing state; after 
some experience on the task their re- 
sponses will probability match the fre- 
quency of occurrence of the different 
stimuli, if they do not already do so; 
then event matching occurs and the 
learned state is attained. 

On the other hand, if concept iden- 
tification is viewed as an information- 
collecting task (Hunt, 1962), rather 
than as a learning task, the three-state 
model would predict that two kinds of 
information are collected—information 
about the relative frequency of events 
and information about the identity of 
the events. Furthermore, the value of 
this information should be independent 
of its source, that is, Ss should be able 
to use pretask information as well as 
information gained during actual task 
performance. While it seems cleat 
that pretask information about the 
identity of the stimulus events (€.8. 
all red flowers with two leaves are 
“roses”) will improve concept per- 
formance, it is not as obvious that SS 
will be able to use pretask information 
about only the relative frequency of 
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the different events to facilitate iden- 
tification. 

The purpose of the following study 
is to find out if and how Ss can use 
pretask information about relative fre- 
quencies to improve their concept iden- 
tification performance. The following 
specific hypothesis can be generated 
from the three-state model: Ss will 
successfully use this pretask informa- 
tion to facilitate the move from the ini- 
tial guessing state to the probability- 
matching state and, thereby, reduce the 
total number of trials prior to correct 
concept identification. 


METHOD 


Experimental design—A 3X3 factorial 
design was used consisting of three levels of 
pretask information and three different con- 
junctive concepts to be identified. In Pre- 
task-Information Treatment Il, Ss received 
no information about the relative frequencies 
of stimulus events; in 12, Ss were told only 
that “there will be more B-stimulus events 
than A events”; and Ss in I3 were told that 
“there will be 75% B-stimulus events and 
only 25% A-stimulus events.” The three 
conjunctive concepts were constructed from 
the same set of stimulus dimensions and 
values and were introduced for the purpose 
of eliminating prepotency factors. 

Stimuli—The stimuli were geometric fig- 
ures which varied along three binary dimen- 
sions—form, orientation, and location; the 
form could be either a cross or a circle; an 
enclosing rectangle could be oriented either 
horizontally or vertically; and the pair could 
be located either to the right or left of a 
vertical center line. In each case a concept 
was determined by the conjunction of two 
stimulus values from two of the dimensions 
with the third being irrelevant. Positive in- 
stances of the concept were designated A by 
E while noninstances were labeled B. 

The 2°= 8 different stimulus combinations, 
Printed on standard yellow bond and fastened 
in a three-ring notebook, were randomized 
into 15 blocks of eight with the restriction 
hese stimulus appear only once in each 

lock. 

Procedure —The S observed each stimulus 
for 3 sec. At the end of the presentation S 
indicated on an answer sheet whether the 
stimulus event had been an A or a B. When 
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he had responded the correct label was given 
by E and the next stimulus event presented. 
All Ss either received a maximum of 120 
trials or terminated the experimental session 
with a correct identification as indexed by a 
minimum of 16 successive correct responses. 


RESULTS 


Analysis of variance of error scores 
revealed a statistically significant pre- 
task information effect, F (2, 171) = 
5.31, p < .01. Since neither the type 
of concept used nor the Information x 
Concept interaction approached signifi- 
cance, all further analyses were based 
on information treatments only with 
Ss being collapsed across concepts. 

Efficiency of performance-—Four 
kinds of evidence indicate that Ss who 
received more precise levels of pretask 
information were more efficient at con- 
cept identification, The learning pa- 
rameter estimate, p, based on error 
scores for Treatments Il, I2, and I3 
were .114, .138, and .180, respectively. 
Testing all pairs for equality of param- 
eters (Drews, 1964) revealed signifi- 
cant differences in each case. For I1 
vs. 12, x? (1) = 11.27, p< .01. Cor- 
respondingly for I2 vs. I3 and I1 vs. 
13, x? (1) =15.36, p< .01, and x? 
(1) = 52.80, p < .01. The data pre- 
sented in Table 1 concerning mean 
trials to the last error (LE) and the 
number of learners (N) for each of 
the pretask-information treatments also 
reveal the facilitory effect of more pre- 
cise information. 


TABLE 1 


MEAN NuMBER oF TRIALS SPENT IN VARIOUS 
STATES OF THE CONCEPT IDENTIFI- 
CATION PROCESS 


State 
Group N 
RGS PMS LE 
I 28 40.04 | 31.78 | 71.82 
12 39 35.63 | 19.64 | 55.27 
13 48 27.83 | 19.63 | 47.46 
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TABLE 2 


THEORETICAL DISTRIBUTION OF THE NUMBER oF A RESPONSES 
EXPECTED IN 10 TRIALS UNDER RGS ann PMS 


State 0 1 2 3 4 


5 6 7 8 9 10 


RGS 00 01 04 12 -20 
PMS | .05 19 -28 .25 15 


25 .20 12 04 01 -00 
05 01 -00 -00 -00 -00 


The concept identification process — 
If Ss are in a random-guessing state 
during the first 10 trials, then equal 
numbers of them should record totals 
of A responses that are above and be- 
low the median value of five. Using 
a chi-square analysis it was determined 
that I1 Ss were responding with sig- 
nificantly more As than Bs, x? (1) = 
5.40, p < .05. Correspondingly, 12 Ss 
were responding with essentially simi- 
lar numbers of As and Bs, while 13 Ss 
were emitting significantly fewer As 
than Bs, x? (1) = 7.23, p < .05. Thus, 
it appears that the more precise the 
pretask information was, the more Ss’ 
initial base rates of responses were 
modified toward the actual frequencies. 
Similar analyses were conducted on the 
last 10 trials preceding the last error 
and indicated that all three groups had 
essentially modified their base rates to 
probability-matching levels, i.e., 25% 
As and 75% Bs. Analyses for I2 and 
I3 treatments proved to be statistically 
significant, x? (1) = 12.10, p < .05, 
and x? (1) = 6.60, p < .05. The test 
for Il, however, only approached sig- 
nificance, p = .10, but it should be re- 
called that it was this group that was 
originally biased in the direction of 
emitting significantly more As than Bs. 

An information arialysis on the re- 
sponses up to and including the last 
error indicated that before correct 
identification, Ss’ responses contained 
little or no information about the stim- 
ulus identity, H (t) = .04 bit/stimu- 
lus, for all three groups combined. In- 
dividually, the amount of stimulus in- 


formation transmitted by Ss in I1, I2, 
and I3 was .05, .04, and .03, respec- 
tively. 

The theoretical distribution of the 
number of A responses expected in 10 
trials under a purely random-guessing 
state (RGS) and a guessing state with 
probability matching superimposed on 
it (PMS) is given in Table 2. It is 
clear that when S has either zero, one, 
two, or three A responses in a block 
of 10 trials he should be declared in a 
PMS, and when he has four or more 
A responses he is declared in a RGS. 

Using this decision procedure and 
beginning with the last complete block 
of 10 trials before the last error for 
each S, that point at which the transi- 
tion from RGS to PMS occurred was 
determined. The mean number of 
trials spent in RGS and PMS for each 
of the three pretask-information groups 
is given in Table 1. Pretask informa- 
tion, as presented to Ss in I2 and I3, 
had the effect of reducing the number 
of trials spent in RGS. The reduction 
in trials over Il was 11% and 31%, 
respectively. The fact that the number 
spent in PMS was also reduced to 
the extent of 38% over the Il treat- 
ment was totally unexpected. 


Discussion 


The results show clearly that Ss can 
use pretask information about only the 
relative frequencies of stimulus events to 
improve their concept identification per- 
formance. The three-state model pro- 
posed in the introduction seems necessary 
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to account for these data. It. specified 
two possible ways in which this informa- 
tion could be used by Ss: It could facili- 
tate the move from the initial random- 
guessing state to the probability-matching 
state and/or the move from the prob- 
ability-matching state to event matching. 
Evidence was presented that both transi- 
tions were facilitated. The fact that the 
number of trials prior to probability 
matching was reduced was not totally 
unexpected. It indicates that informa- 
tion about stimulus frequencies can be 
more efficiently transmitted through pre- 
task manipulations than through Ss 
accumulating experience with the actual 
stimulus events over trials. That the 
Same pretask information also aids the 
event-matching process was unexpected 
and at present remains unexplainable. 
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While temporal summation for brightness has been found to be 
independent of the distribution of energy within the critical duration, 
there is some question as to whether the same conclusion holds for 
temporal summation in form perception. Backward and forward 
masking experiments have failed to show a critical duration for form 
which suggests that interruption in the energy presentation impairs 
temporal integration. Exp. I failed to find a critical duration for. 
form when light energy was presented in 10 pulses separated by 
intervals (IPIs) varying from 2-48 msec. Form identification ap- 
peared to be a monotonic decreasing function of pulse-train duration 
(PTD) even when the PTDs were less than 100 msec, Exp, II 
replicated Exp. I in part and in addition found that form identifi- 
cation was significantly poorer at all PTDs for 2 longer light pulses 
separated by a single dark interval than for 10 pulses separated by 9 


shorter dark IPIs. 


Many instances of backward and 
forward masking in vision can be un- 
derstood in terms of limitations in tem- 
poral resolution in the visual system. 
If a ring masking stimulus precedes or 
follows a form by an interstimulus in- 
terval (ISI) of 50 msec. or even longer, 
identification performance for the form 
is comparable to what is obtained if 
ring and form are presented simul- 
taneously, Similarly, a flash of light 
preceding or following presentation of 
a form at short ISIs yields perform- 
ance comparable to what is obtained 
when the flash and form are presented 
simultaneously. The masking by the 
ring stimulus is attributable to the 
greater difficulty in identifying a form 


1This investigation was supported by 
United States Public Health Service Re- 
search Grant MH-1206 and a United States 
Public Health Service Research Career Pro- 
gram Award K6-MH-22,014. 

2 The third author, as a National Science 
Foundation undergraduate research partici- 
pant at the University of Illinois, summer 
1967, was supported by Grant NSF GY 2538. 


surrounded by a ring than a form in 
an uncluttered field, And masking by 
a flash results from luminance sum- 
mation and resulting contrast reduction 
for the form stimulus (Eriksen, 1966b; 
Eriksen & Collins, 1965). 

The best known instance of the lack 
of temporal resolution in the visual 
system is Bloch’s law (1 x T =K). 
Within certain limits, a 10-ml. light 
pulse of 50-msec. duration is judged 
to be equally as bright as a 5-ml. pulse 
of 100-msec. duration. The temporal 
interval over which this relationship 
holds is designated the critical duration 
and for brightness judgments it is gen- 
erally found to be on the order of 100 
msec. Recent work with form percep- 
tion has suggested appreciably longer 
critical durations (Kahneman & Nor- 
man, 1964). 

The relation between Bloch’s law 
and masking seems obvious. That is, 
if a ring masking stimulus and a form 
are presented within the critical dura- 
tion, the result should be a composite 
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perception. The form-identification 
task should be essentially the same as 
if the ring and the form were combined 
in a composite stimulus. There is, 
however, an anomaly. The masking 
data provide no indication of a criti- 
cal duration. Instead, masking has 
been found to decrease in a monotonic 
function with even small ISIs (Erik- 
sen, 1966b; Eriksen & Collins, 1964, 
1965). Kahneman and Norman’s 
(1964) data indicate a critical duration 
of approximately 200-300 msec. for 
form identification. One would ex- 
pect, therefore, performance to be iden- 
tical with the level obtained when test 
and masking stimuli are presented si- 
multaneously over ISIs as long as the 
critical duration values that Kahneman 
and Norman’s data have indicated. 
In fact, even an ISI as short as 10 
msec, results in an improvement in 
performance over the level obtained for 
simultaneous presentation. 

_ It is possible that the interruption 
in stimulation in the form of the ISI 
is responsible for the failure to obtain 
a critical duration. For brightness 
judgments, however, discontinuity in 
the light presentations has been found 
to be a matter of indifference. Per- 
formance is still determined by the 
amount of energy that occurs within 
the critical duration irrespective of 
whether the energy is presented con- 
tinuously or in intermittent pulses 
(Blackwell, 1963; Davy, 1952; Lich- 
tenstein & Boucher, 1960). 

__The reciprocity relation for form 
identification is different than for 
brightness judgments. This has al- 
ready been suggested by the finding of 
longer critical durations for form iden- 
tification (Kahneman & Norman, 
1964; Kahneman, Norman, & Kubovy, 
1967). Kahneman (1964, 1966) 
among others has speculated that part 
of the differences in reciprocity ob- 


tained for form perception and bright- 
ness judgment is accountable in terms 
of form being mediated by higher ner- 
vous centers, whereas brightness judg- 
ments are mediated by the retina and 
lower centers in the visual system. 
These considerations, as well as the 
empirical differences in critical dura- 
tions, suggest that reciprocity for form 
perception with pulsed light cannot be 
assumed merely because the relation 
has been found to hold for brightness 
judgment. In fact, there is some indi- 
cation that discontinuity detectors in 
the visual system may function to break 
up integration of pattern-type informa- 
tion over time (Eriksen & Collins, 
1967, 1968). 


EXPERIMENT I 
Method 


Subjects—Four female students at the 
University of Illinois served as paid Ss. All 
had normal or corrected-to-normal vision. 

Apparatus—The equipment was a modi- 
fication of a 10-channel tachistoscope. Ten 
individual light sources were used, one for 
each field. The “on” time of each light 
source was controlled by a separate Tek- 
tronix wave form generator while the dark 
intervals, or interpulse intervals (IPIs), 
were timed and controlled by nine Tektronix 
pulse generators. Both the wave form 
and pulse generators were calibrated before 
and checked periodically throughout the ex- 
periment by means of a Tektronix oscillo- 
scope. The timing of these components 
never varied more than 2% from their origi- 
nal calibration. 

The 10 light sources were Sylvania F4T5/ 
CWX fluorescent lamps. At one end of the 
visual alley the light from each source was 
directed to the same spot on a milk glass 
diffusing screen by 4-in. American Optical 
Co. fiberoptic light guides. This arrange- 
ment produced a one-channel tachistoscope 
with 10 independent light sources, The 
luminance of each source was adjusted for 
equality with the other sources at the dif- 
fusing screen by eye and by the oscilloscope 
before and periodically during the experi- 
ment. The total luminance of the 10 sources 
presented simultaneously was 1.26 ml. mea- 
sured by a MacBeth illuminometer. Rise 
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and decay time of the lamps are specified by 
the manufacturer to be less than .5 msec. 

The stimuli were the capital letters A, 
T, and U in News Gothic style and each 
subtended 10 min. of visual angle from a 
viewing distance of 38 in. They were photo- 
graphed on slide transparencies, white on 
black, and mounted between standard 35- 
mm. slide covers. The area around the let- 
ters was painted to assure 99+% contrast. 
The Ss viewed the stimuli monocularly with 
their dominant eye through an artificial pupil 
of 2.5 mm. The fixation point was always 
present in the center of the field and was 
provided by a glowing orange cross, The 
stimulus was presented in one of the four 
quadrants in the viewing field 20 min. of 
visual angle from the center of the field. 
The maximum edge-to-edge distance of the 
stimuli was 1° of visual angle. 

Procedure—All Ss were run under two 
conditions of pulse-length duration. Under 
the 2-msec. pulse condition each of the 10 
lamps was pulsed for 2 msec. in making up 
the various pulse trains. The 4-msec. pulse- 
duration condition was identical except the 
individual lamp pulses were all 4 msec. in 
duration. In both conditions the pulses were 

formed into pulse trains which were com- 
posed of 10 physically discrete pulses, 
one pulse of light from each source. For 
each condition a base-level performance was 
determined for conditions of essentially con- 
tinuous light stimulation. For these base- 
level observations, the onset of each light 
pulse, after the first, occurred .6 msec. after 
the offset of the previous pulse. Phe- 
nomenally, and as viewed on the oscilloscope, 
each base-leyel condition pulse train formed 
a single pulse, 20 msec. long for the 2-msec. 
pulse condition and 40 msec. long for the 4- 
msec. pulse condition, 

The 16 experimental conditions were 
formed by inserting IPIs of equal duration 
between the 10 light pulses. IPIs of 2, 4, 
6, 8, 12, 28, 32, and 48 msec. were used, re- 
sulting in pulse-train durations (PTDs) of 
38, 56, 74, 92, 128, 272, 308, and 452 msec. for 
the 2-msec. pulse condition and 58, 76, 94, 
112, 148, 292, 328, and 472 msec. for the 
4-msec. pulse condition. 

Each S was run at a luminance level which 
allowed approximately 75% identification ac- 
curacy under the base level of essentially 
continuous light. The level of luminance was 
changed by the use of neutral density filters 
to obtain this accuracy level for each S. 
Practice effects caused the average base level 
of recognition accuracy to be higher than 


the originally determined levels. The actual 
base levels ranged from 73.5-88.5%. The 
presentation of stimuli was arranged in 24 
trial blocks so ordered that each letter ap- 
peared an equal number of times, and in 
each quadrant of the field an equal number 
of times. Within these constraints and the 
additional constraint that no letter or quad- 
rant was presented more than three times in 
a row, the order of presentation was ran- 
dom. Six random sequences were used to 
avoid learning effects, 

Each experimental session consisted of a 
series of five trial blocks, two blocks of one 
PTD, two blocks of another PTD, and one 
block of the base level. Within a session 
only 2- or 4-msec. pulses were used. The 
order of presentation of the base-level block 
was counterbalanced across sessions. Pulse- 
train duration sessions were counterbalanced 
in either an ABBA or an ABAB order. 
The order of PTD conditions was coun- 
terbalanced within pulse lengths by a ran- 
domized Latin square design. 


Results 


In Fig. 1 the data for the 2-msec. 
pulse-duration condition is shown. 
Here the percentage of identification 
accuracy is plotted as a function of 
PTD in milliseconds. The heavy 
solid line is the average performance 
of the four Ss. The individual curves 
also are shown. Figure 2 presents the 
same data for the 4-msec, pulse-dura- 
tion condition. A three-way analysis 
of variance (Ss X PTD xX 2- and 4- 
msec. pulse-duration conditions) was 
performed on the data of Fig. 1 and 2. 
The only significant effect was for 
PTD, F (8, 24) = 88.73, p < .01. 

Inspection of Fig. 1 and 2 suggests 
that for both pulse-duration conditions 
identification performance decreases 
progressively as PTD is increased. 
For the 2-msec. pulse-duration data 
there is no suggestion of a critical 
duration over which reciprocity holds. 
If we accept 100 msec. as a conserva- 
tive critical duration for form, we 
would expect performance to remain 
constant from the base level out to a 
PTD of 92 msec. or more. The least 
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Fic. 1. Average and individual S’s percentage of identifications as a function of pulse-train 
duration for the 2-msec. pulse-duration condition, 


squares best fit regression line for the 
five data points within the 92-msec. 
pulse-train duration has a slope of 
—.92. This slope differs significantly 
from zero, t (18) = —2.75, p < .01. 

The data from the 4-msec. pulse- 
duration condition are a little less un- 
equivocal. If we wish to stay within 
a total pulse-train duration of 100 
msec. (IPIs 6 msec. and less), the 
slope of the regression line is —.45 
which is not significantly different from 
zero, t (14) = —1.14, p > .10. 


Discussion 


The above results are in close agree- 
ment with Zuercher’s (1965) findings. 
He also studied form identification with 
pulsed light flashes but had employed 
black on white, front-lighted stimuli. In 
neither experiment is there evidence for 
a critical duration over which time-inten- 
sity reciprocity holds for form identifica- 
tion. The failure to obtain reciprocity 


seems most likely to be attributable to the 
discontinuity or interruption in the pre- 
sentation of the energy. Kahneman 
(1964) and Kahneman and Norman 
(1964) have found rather long critical 
durations for form perception when the 
energy is presented continuously. That 
the discontinuity or IPIs are the critical 
variable is suggested also by the back- 
ward and forward masking data (Erik- 
sen, 1966b; Eriksen & Collins, 1965) and 
by the impairment in pattern integration 


3 Their findings received some confirma- 
tory evidence in the present study. In the 2- 
msec. pulse-duration condition, the base level 
consisted of essentially 20 msec. of continu- 
ous energy (less than .6 msec. between suc- 
cessive pulses). The base level in the 4- 
msec. pulse condition consisted of essentially 
a 40-msec. continuous pulse. The Ss 
required almost exactly twice as much lumi- 
nance for the base level in the 2-msec. dura- 
tion condition as in the 4-msec. duration con- 
dition indicating that reciprocity was obtained 
when the energy was essentially continuous 
in presentation. 
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Fic. 2. Average and individual S’s percentage of identifications as a function of pulse-train 
duration for the 4-msec. pulse-duration condition. 


over time when stimulation is discontinu- 
ous (Eriksen & Collins, 1967, 1968). 

That discontinuity or interruption in 
the energy presentation is a critical vari- 
able in determining whether reciprocity 
holds for form perception has both em- 
pirical and theoretical support. The 
masking studies involve the discontinuity 
in the stimulation as did the work on 
pattern integration over time. Interrup- 
tion in energy presentation is, in itself, 
a stimulus and might be expected to stim- 
ulate discontinuity detectors in the visual 
system. That “off” receptors may serve 
such a function has been suggested by 
Granit (1947), and Barlow (1961) who 
also speculated along these lines. 


EXPERIMENT II 


In the preceding experiment there 
were always nine dark intervals equal 
in duration and evenly spaced between 
the 2- or 4-msec. light pulses. In 


Exp. II the number as well as the 
duration of the dark intervals were 
manipulated. The control or 10-pulse 
condition was identical to the 4-msec. 
pulse condition of the preceding ex- 
periment. There were 10 4-msec. 
pulses of light with nine IPIs of 2, 4, 
or 6 msec. In the 2-pulse condition 
the period of darkness occurred as a 
single interval (18, 36, or 54 msec.) 
between two 20-msec. light pulses. 
These IPIs or dark intervals matched 
the total dark time per stimulation of 
the 10-pulse condition and gave PTDs 
of 58, 76, and 94 msec. across the two 
conditions. Again a base-level condi- 
tion was employed under which per- 
formance was assessed with 40 msec. 
of essentially continuous light. 

This experiment, in addition to pro- 
viding a replication of the more critical 
PTDs of the preceding study, also may 
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be expected to provide information on 
the existence of discontinuity detectors. 
If such detectors exist, it would seem 
that single dark intervals of 18, 36, and 
54 msec. would be more effective stim- 
uli for such detectors than nine 2-, 4-, 
or 6-msec. dark intervals interspersed 
among 10 light pulses. 


Method 


Subjects—Eight University of Illinois stu- 
dents, two female, served as paid Ss, All 
had normal or corrected-to-normal vision 
and one had served as an S in the preceding 
experiment. 

Apparatus—The equipment and stimulus 
materials were identical to those employed in 
the preceding experiment. 

Procedure—Each S served two prac- 
tice sessions followed by six experimental 
sessions. The luminance level was adjusted 
for each S to obtain approximately 75% cor- 
rect form identifications under the base-level 
condition which consisted of 40 msec. of 
essentially continuous light. Forty-eight 
judgments were obtained from each S for 
the base-level condition and 96 judgments 
for both the 2- and the 10-pulse conditions 
at each of the three values of PTD, Ex- 
perimental treatments were counterbalanced 
within and across Ss and across sessions. 
Following the experimental sessions each S 
returned for an additional session during 
which his identification performance for the 
three forms was determined for a 20-msec. 
continuous pulse at the luminance level un- 
der which he had run during the experimental 
sessions. 


Results 


The percentage of correct form iden- 
tifications were analyzed in a three-way 
analysis of variance (2- and 10-pulse 
conditions, PTD, and Ss). The 2- 
and 10-pulse conditions were signifi- 
cantly different at the .01 level, F (1, 
7) = 13.5, and PTD was significant 
at the 05 level, F (2, 14) = 4.3. None 
of the interactions including those in- 
volving Ss approached significance. 

In Fig. 3 the percentage of identifi- 
cations is shown as a function of PTD 
for the 2- and the 10-pulse conditions. 
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Fic. 3. Percentage of identifications as a 
function of pulse-train duration and 2- and 
10-pulse conditions. (The solid triangle 
plotted at 40-msec, pulse-train duration rep- 
resents performance under the base-level 
condition of 40-msec. continuous light.) 


Also shown in Fig. 3 is the perform- 
ance under the base-level condition of 
a 40-msec. continuous light pulse. 
Performance for the 2-pulse condition 
is poorer than for the 10-pulse condi- 
tion at all PTDs even though the light 
energy in the PTDs is identical for 
both conditions, Distributing the total 
dark interval per stimulation over nine 
IPIs leads to significantly better per- 
formance than if the total dark time 
was contained in a single interval be- 
tween two longer light pulses. 

The data for the 10-pulse condition 
are in close agreement with the results 
obtained in the preceding experiment. 
Performance is best if the total light 
energy is contained in a continuous 
40-msec. pulse and deteriorates pro- 
gressively with increasing durations of 
the dark IPIs. 

While reciprocity is not achieved, 
there is still some summation occurring 
in the 2-pulse condition even with an 
IPI of 54 msec. This can be seen by 
comparing the identification perform- 
ance at the 94-msec. PTD with what 
would be expected from two indepen- 
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dent chances to perceive. During the 
last session each S was run under the 
condition of a single 20-msec. pulse 
presented at the luminance level that 
had been used for that S in the preced- 
ing part of the experiment. Identifica- 
tion accuracy for five of the Ss with 
the 20-msec. pulse did not deviate ap- 
preciably nor significantly from the 
chance level (334%). The remaining 
three Ss achieved 43%, 53%, and 54% 
identification accuracy, respectively. 
The data of these three Ss obtained 
under this condition were used to pre- 
dict what their level of performance 
should have been on the basis of two 
independent chances to perceive. Since 
three-category confidence judgments 
had been obtained during the judg- 
ments with the 20-msec. pulse, the 
computations were made via the 
method described by Eriksen (1966a). 
The predicted level of performance was 
45%, 56%, and 57%, respectively, as 
compared with the actual performance 
under the 94-msec. PTD of 67%, 76%, 
and 74%, respectively, for these three 
Ss. Since, under the independence as- 
sumption, chance performance was pre- 
dicted for the other five Ss, it would 
appear that an IPI of 54 msec. is not 
too long to permit some summation. 


Discussion 


The results of both experiments taken 
together indicate quite clearly that a crit- 
ical duration over which time-intensity 
reciprocity holds does not exist for form 
perception in the same sense as it does 
for perceived brightness. Within the 
critical duration perceived brightness 
seems to be determined solely by the 
amount of energy irrespective of how it 
is distributed over the duration interval 
(Blackwell, 1963; Davy, 1952; Lichen- 
stein & Boucher, 1960). Critical dura- 
tions for form have been obtained only 
when the light energy was continuously 
distributed over the stimulation interval. 


D. L. SCHURMAN, C. W. ERIKSEN, AND J. ROHRBAUGH 


It is not surprising that interruption of 
light stimulation should impair temporal 
integration in form perception. With 
continuous stimulation there is adaptive 
value in having temporal summation or 
integration even over the long critical 
duration that Kahneman (1964) has 
found. Such summation leads to maxi- 
mum enhancement of contours and defini- 
tiveness in perception. But there is need 
for mechanisms in the system to detect 
discontinuity in stimulation. If summa- 
tion with continuous stimulation exists 
for 300 msec. or even longer in the per- 
ception of form, some mechanism must 
be available to terminate or interrupt 
this integration when stimulation changes. 
Otherwise visual perception would be 
like a montage. An appreciable period 
of time would be required following a 
new visual fixation or new stimulation 
before the new stimulus could be per- 
ceived completely and not have composite 
characteristics of the preceding stimula- 
tion. It also seems reasonable that such 
discontinuity detectors would have thresh- 
old functions comparable to other re- 
ceptors. Thus, within limits, the greater 
the interruption the greater the inhibition 
or impairment of temporal summation. 

Finally, it is to be noted that the pres- 
ent results lend substantial support to the 
interpretation of certain instances of 
masking phenomena as due to varying de- 
grees of summation of test and masking 
form into a composite perception. The 
anomaly that was pointed out in the intro- 
duction of this paper, namely, that mask- 
ing decreases progressively even with 
short ISIs, is found to be consistent with 
the temporal summation effects obtained 
for form perception when the stimulation 
is interrupted or discontinuous. 
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REACTION TIME FOR NUMERICAL CODING 
AND NAMING OF NUMERALS! 


JAMES D. WINDES? 
University of Arizona 


Reaction times (RT) for 2 different identification tasks, numeral 
naming and quantity naming, have been compared in 2 experiments. 
Exp. I used single Arabic numerals for the numeral naming task and 
corresponding amounts of arbitrary figures for the quantity-naming 
task, Exp. II used the same Arabic numerals in different correspond- 
ing amounts for both tasks. 18 Ss served in each experiment, 9 under 
one task condition and 9 under the other. Exp. I showed an insignifi- 
cant difference between the 2 tasks. Exp. II showed that RT was 
slower for naming the different quantities of numerals than for 
naming the numerals themselves, even when the stimuli and naming 


responses were identical. An interpretation in terms of identification- 
task conflict, as distinct from response conflict, is presented. 


There has long been evidence that 
reaction time (RT) for naming con- 
ventional printed designations of sim- 
ple characteristics is faster than for 
naming the characteristics themselves. 
For example, Brown (1915a, 1915b) 
found that RT for naming colors and 
quantities was longer than for naming 
corresponding words and numerals, 
Brown further concluded that the dif- 
ference was not due simply to differ- 
ences in practice. 

Some investigators, such as Peter- 
son, Lanier, and Walker (1925) and 
Morin and Forrin (1965), have pro- 
posed that such differences in RT may 
be due to a greater response conflict in 
naming concrete stimulus features as 
opposed to naming words and symbols. 


1This study was conducted during the 
author’s tenure as a National Aeronautics 
and Space Administration Trainee as part of 
the dissertation requirement for the degree 
of Doctor of Philosophy at the University 
of Arizona. National Science Foundation 
Grant GB 3955 helped in support of the 
study by enabling the purchase of some lab- 
oratory supplies. The author is indebted to 
Thomas E. Sitterley for technical assistance 
and to James E. King and Neil R. Bartlett 
for their encouragement and many helpful 
criticisms throughout the course of the study. 

2 Now at Northern Arizona University. 


Studies such as those of Stroop (1935) 
and Klein (1964), however, suggest 
that other types of conflict may be in- 
volved. Stroop used color words 
printed in colors different from those 
designated by the words, and found that 
RT was even slower for color naming 
as compared to word naming than it 
was when separate color and word 
stimuli were used. Klein enlarged upon 
Stroop’s finding to show that RT for 
naming the colors of words grouped 
into different semantic categories in- 
creased as the closeness of the word 
meanings to the colors increased. 
These studies raise the question as to 
whether word stimuli which come close 
in meaning to color attributes in a 
color-naming task might not increase 
some type of conflict other than re- 
sponse conflict alone. 

In considering this question, one 
should note the fact pointed out by 
Cattell (1886) and more recently by 
Oldfield and Wingfield (1965) that a 
thing must be identified in order to be 
named, for processes of identification 
may or may not be the same for the 
same naming responses. Despite the 
necessity for distinguishing between 
identifying and naming, however, the 
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measuring of RT for making different 
types of identifications usually has been 
confounded with that for making dif- 
ferent naming responses, if not for re- 
acting to different stimuli, 

The present study attempted to test 
for possible RT differences between 
different types of identifying processes 
that were not associated with different 
naming responses or stimulus condi- 
tions. The method used number stim- 
uli instead of color stimuli, but was 
based on the interference technique 
devised by Stroop. 


MeEtTHop 


Subjects—The Ss were 36 undergraduate 
volunteer male students at the University 
of Arizona, 18 being used in each of the two 
experiments. All Ss were paid for their 
work, 

Apparatus—An Industrial Electronic En- 
gineers three-unit in-line readout assembly 
(Series 130) was used to present stimuli con- 
sisting either of Arabic numerals (1, 2, 3) 
or crosses (+). Both the numerals and 
crosses could be made to appear in groups 
of one, two, or three identical figures, The 
stimuli were projected onto a viewing screen 
as light figures against a dark background, 
The figures were $ in. in height, and the 
readout assembly was mounted on a display 
panel 3 ft. from S in a sound-resistant cham- 
ber. The chamber isolated S to the extent 
of 25-30 db. SPL from adjoining room 
sounds and had indirect lighting sufficient 
for $ to see the viewing screen on the display 
panel. The S’s head was held in position by 
a chin rest. 

Presentation of the stimuli was pro- 
grammed through the use of a California 
Technical Industries eight-channel paper 
tape reader (Model 220), with successive 
outputs pulsed by two Hunter Decade In- 
terval timers. White noise of sufficient in- 
tensity to mask the sounds of the tape 
reader, which was located outside S’s cham- 
ber, was presented throughout the course of 
the experimental sessions by means of a 
Grayson-Stadler noise generator (Model 
455C). The S controlled a floor switch 
which caused the stimulus to appear after a 
random foreperiod of 1- to 3-sec. duration. 
The presentation of the stimulus caused a 
Hunter Klockounter to begin measuring S’s 
response which activated a Grayson-Stadler 


Voice-Operated Relay (Model E7300A) to 
stop the Klockounter and enabled E to 
record the RT. The E monitored S’s 
vocal response and sounds made in the ex- 
perimental chamber by means of a two-way 
intercom, 

Procedure.—The research was divided into 
two separate experiments. In both experi- 
ments, half the Ss were assigned the task 
of directly naming numerals, and half were 
assigned the task of naming quantities of 
figures (i.e, subitizing them, Kaufman, 
Lord, Reese, & Volkmann, 1949). In Exp. 
I, the first task condition (Ti) required the 
naming of the single numerals 1, 2, and 3, 
while the second (Ta) required the naming 
of the quantities of figures in the stimuli au 
++, and +++. In Exp, II, the same stimuli 
(4, 11, 111, 2, 22, 222, 3, 33, and 333) were 
used under both task conditions. The first 
(Tı) required the naming of the numerals, 
irrespective of their quantities; the second 
(Ts) required the naming of the quantity 
of numerals, irrespective of their identities, 

Since there were three different possible 
positions for the single-figure stimuli, each 
of these was programmed to appear ran- 
domly an equal number of times in each of 
the three possible positions on the viewing 
screen. Likewise, since each of the double- 
figure stimuli could occupy either of two 
possible positions (without separating the 
figures), they were programmed to appear 
randomly in each of these positions an 
equal number of times. 

Both experiments thus compared a num- 
eral naming task with a quantity-naming 
task according to a between-Ss design, which 
enabled each S to have more practice in 
performing only one type of task. Both 
experiments, moreover, required S's to make 
the same responses, “one” (R1), “two” (Rs), 
and “three” (Rs) under both task conditions, 

The design for Exp. I was different from 
that for Exp. II in that it lacked the Stimu- 
lus-Set Conditions Kı, Ks, and Ks, which cor- 
responded to the Stimulus Sets (1, 22, 333), 
(2, 33, 111), and (3, 11, 222), respectively, 
Each of these sets was formed to include 
each type of numeral and each type of 
quantity, and the set for Ky, unlike those for 
Ka and Ks, was formed to contain numerals 
in quantities equal to the quantities desig- 
nated by the individual numerals. 

All Ss sat for the same number of ex- 
perimental sessions and trials per session, 
Sessions were held for each S on suc- 
cessive days, except on two occasions in 
which two of the sessions were separated 
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by a period of about 10 hr. on the same 
day. A total of 136 RT trials were made 
at each session, with random orders of stim- 
ulus presentations predetermined according 
to the programmed tapes. Two different 
tapes were used so that each S had a short 
rest period halfway through each session. 
The first nine trials at the beginning of 
each tape were for warm-up only, and only 
the data collected during the last two ses- 
sions, exclusive of the warm-up trials, were 
used in the analysis of results. 


RESULTS 


Experiment I.—Mean RT scores for 
Ss under each experimental condition 
were calculated from the scores remain- 
ing after elimination of those associated 
with response errors, (The highest 
error ratio for a single experimental 
condition in both experiments was 
4.1% for Condition T2R2Ke in Exp. 
II.) The results of the analysis of 
variance showed that the numeral nam- 
ing group was faster than the quantity- 
naming group, but the difference was 
not statistically significant. The only 
significant effect was that of the differ- 
ent responses, F (2, 32) = 38.13, p< 
001. The response “one” was the 
fastest, with a mean score of 433.5 
msec. as compared to 474.3 msec. for 
“two,” and 466.6 msec. for “three” 
under the numeral naming condition, 
and a mean score of 460.2 msec. as 
compared to 497.7 msec. for “two,” 
and 484.8 msec. for “three” under the 
quantity-naming condition. Also, un- 
der both task conditions, the response 
“three” was slightly faster than the re- 
sponse “two.” 

Experiment II.—Table 1 and Fig. 1 
show clearly that the time required 
for the numeral naming task (T,) in 
Exp. II was shorter, F (1, 16) = 
17.30, p < .001, than that required for 
the quantity-naming task (T2). Again, 
significant response differences were 
found. To compare the stimulus-set 
differences separately under T, and 


TABLE 1 
SUMMARY TABLE FoR EXPERIMENT IT 
Source af MS F 

Task (A) 1 | 298,282.00 | 17.30** 
Swithin Groups (B)| 16 | 17,245.00 
Stimulus Set (C) 2 5,839.33 | 37.86** 
AXC 2 1,725.53 | 11.19** 
BXC 32 154.24 
Response (D) 2| 22,406.62 | 38.64** 
AXD 2 2,719.71 | 4.69* 
BXD 32 579.94 
CxD 4 502.45 | 8.06** 
AXCXD 4 107.64 | 1.86 
B X (C X D) 64 58.03 

* 

=p E01 


Tə, £ tests were used. These tests 
showed that, although both numeral 
naming and quantity naming took sig- 
nificantly longer under Ky and Kg than 
they did under K, (df =8, p < .05), 
the differences between Kə and Kg 
were not significant either for numeral 
naming or for quantity naming. 
However, a further ¢ test confirmed 
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Fic. 1. Mean RT for three naming re- 
sponses to multiple Arabic numerals under 
two task conditions. (The individual curves 
represent different stimulus sets as defined 
in the text.) 
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that it took longer to subitize the nu- 
merals in 1, 22, and 333 with the re- 
sponses “one,” “two,” and “three,” 
respectively, than it did to name these 
same numerals directly with the same 
responses, £ (16) = 5.15, p < .01. 


DISCUSSION 


The results offer no clear answer to 
the question of whether small visual 
quantities can be named as fast as cor- 
responding Arabic numerals. In par- 
ticular, Exp. I falls short of conclusively 
supporting Brown’s (1915b) conclusion 
that small quantities of arbitrary figures 
cannot be named as fast as corresponding 
Arabic numerals, just as colors cannot be 
named as fast as corresponding color 
words. 

Experiment II, however, shows that 
sets of stimuli which combine both con- 
crete and symbolic number characteristics 
affect RT in much the same way as those 
which combine both concrete and sym- 
bolic color characteristics. The results 
demonstrate that naming the quantity of 
a small group of Arabic numerals takes 
significantly longer than naming the nu- 
merals themselves, just as other experi- 
ments have shown that naming the color 
of a color word takes longer than naming 
the word. The concrete-symbolic inter- 
ference effect, therefore, is probably not 
a phenomenon peculiar to color identifica- 
tions and color naming alone. 

In addition, however, the inclusion in 
Exp. II of numeral-quantity combina- 
tions, Set K,, which did not conflict in 
their concrete and symbolic values re- 
vealed something about this type of inter- 
ference effect that the other studies have 
not. Whatever the source of the longer 
time required for naming the quantities 
of numerals rather than the numerals 
themselves under Condition Kı, it can- 
not be interpreted as due to numeral- 
quantity conflict in the stimuli in this set, 
for the members of this set lack numeral- 
quantity conflict; the quantity designated 
by the numerals in each stimulus is 
identical to the quantity of numerals in 
that stimulus. Insofar as the effect was 


due to conflict in stimuli, it was a con- 
textual effect, i.e., it was due to conflict 
in the other stimuli which might have 
occured but did not. 

Perhaps a stimulus which eventually 
leads a person to make a particular 
naming response also may lead him to 
attempt one general type of perceptual 
identification rather than another. Be- 
fore one can identify something as red 
and not blue, green, or some other color, 
he may have to get set to identify its 
color and not its shape, size, or some 
other feature; and which of these identifi- 
cation tasks he undertakes may depend, 
among other things, on the particular 
stimulus object itself. 

Klein (1964) refers to this attention- 
catching capacity of a stimulus as its 
attensive power, using Titchener’s term, 
and associates it with the semantic rela- 
tionships of a stimulus in the context of 
a particular naming task. The attensive 
capacity of a numeral or word to evoke 
an identification according to numerical 
or verbal content, instead of according 
to concrete sensory characteristics as 
assigned, thus might produce a situation 
of identification-task conflict with no con- 
comitant response conflict, For example, 
the stimulus 22 might lead a subject 
assigned the task of identifying the quant- 
ity of numerals to identify instead the 
numerals themselves. 

It is suggested, then, that two dif- 
ferent possible sources of delay in naming 
stimuli should be distinguished: identi- 
fication-task conflict and response con- 
flict. The first refers to conflict between 
attempting to identify a stimulus accord- 
ing to its color, size, or some other gen- 
eral feature rather than another. The 
second refers to a competition between 
different naming-response tendencies as 
readied to express a particular, specific 
identification. 
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CUE SELECTION AS A FUNCTION OF DEGREE OF 
LEARNING AND RESPONSE SIMILARITY? 


WILLIAM L. DAVIS, SAM C. BROWN, ann ELAINE RITCHIE 
Kansas State University 


test of Ist-letter cue selection was made, which inyolved presentation 


of just the Ist letter of each stimulus term. 


evidenced by correct recall of the R term combined with failure to 


reproduce correctly additional stimulus letters, 


comparable amounts of Ist-letter selection for low- and high-similarity 
groups after performance criteria of 2 and 4 pairs correct on a single 


trial, respectively. 
with low- than high-R similarity, 
opposite effect. 
characteristics for cue selection, 


Recently, Postman and Greenbloom 
(1967) presented data concerning the 
amount and locus of single-letter cue 
selection in  paired-associate (PA) 
learning. Unique to this study was an 
attempt to assess single-letter selection 
uncontaminated by possible mediational 
effects produced by reproduction of 
additional stimulus elements. To do 
this, Ss were shown a single-stimulus 
letter after PA learning and were in- 
structed to recall the remaining stimu- 
lus letters in the trigram and the re- 
sponse term previously associated with 
that trigram. Single-letter cue selec- 
tion was defined by a dual criterion: 
recall of the correct response to a given 
letter, and failure to reproduce the 
other stimulus letters. The results in- 
dicated that the greatest amount of 
single-letter selection occurred to the 
first letter of the trigram when the 
stimulus term was difficult to pro- 


1This investigation was supported by a 
Title IV National Defense Education Act 
Predoctoral fellowship to the first author, 
and by United States Public Health Service 
Research Grant MH-11179-03 to the second 
author. 


After 1 errorless trial, more selection occurred 
while further practice Produced the 
The results point to the importance of R-term 


nounce. Much less selection occurred 
to the second and third letters of the 
hard-pronounceability stimulus tri- 
grams or to single-letter cues when 
stimuli were easy to pronounce, 
Because Postman and Greenbloom 
introduced the cue-selection test after 
S had learned the list to a criterion of 
one perfect trial, it is not possible from 
their data to determine the extent of 
cue selection at other degrees of learn- 
ing. It is possible that early in learn- 
ing a single letter may operate as the 
functional stimulus and only later are 
additional letters acquired. On the 
other hand, Ss may attempt during 
early learning trials to respond to the 
whole unit, while later in learning, the 
functional cue becomes limited to a 
single letter. Still another possibility 
is that the amount of single-letter se- 
lection remains constant throughout 
learning. To help clarify the issue the 
present study compared single-letter 
cue selection after varying degrees of 
list mastery, using the Postman and 
Greenbloom technique. However, this 
investigation was limited to first-letter 


323 


324 W. L. DAVIS, S. C. BROWN, AND E. RITCHIE 


cue selection since Postman and Green- 
bloom found the greatest amount of se- 
lection to occur to this cue. 

It has been hypothesized elsewhere 
(e.g, Brown, Battig, & Pearlstein, 
1965) that one important factor affect- 
ing the relative magnitude of cue se- 
lection at various stages of learning is 
the degree of interference generated 
between pairs in the list. More spe- 
cifically, Brown et al. suggest that 
additional stimulus letters are acquired 
as a means of reducing interpair inter- 
ference (IPI), which has been demon- 
strated to develop maximally at inter- 
mediate stages of learning (Brown, 
1964). Thus, it would be expected 
that conditions producing an overall 
high degree of IPI should lead to a 
greater use of additional stimulus let- 
ters primarily during and following in- 
termediate degrees of learning than a 
list generating a relatively lesser 
amount of IPI. To test this hypoth- 
esis, IPI was manipulated in the pres- 
ent study by varying formal intralist 
similarity of the response terms. It 
was predicted, that a list composed of 
high-similarity responses would show 
less first-letter cue selection than a list 
made up of low-similarity responses, 


METHOD 


Lists—The learning materials consisted of 
two six-pair lists, one with low-response (R) 
similarity (LS) and the other with high-R 
similarity (HS). Both lists used exactly 
the same CCC trigrams as stimuli, with no 
repeated letters within or between stimuli. 
All stimulus (S) terms were difficult to pro- 
nounce, with pronounceability ratings rang- 
ing from 7.5 to 8.7 as determined by Under- 
wood and Schulz (1960, Appendix E). 

The R terms consisted of CVC trigrams. 
Responses in the HS list were highly simi- 
lar in letter composition (qis, qoy, siy, suq, 
yoq, yus), while the LS list contained R 
terms with no letter overlap (bov, dut, fap, 
kes, ryx, sih). Both sets of R terms had 
an average Archer (1960) association value 
of 34% with a range 17%-44% and 14%- 
44% in the LS and HS list, respectively. 


For both lists, there was no letter duplica- 


tion within pairs or between S and R terms 
in the HS list. Letter overlap between stim- 
uli and responses in the LS list could not be 
avoided; however, no letter appeared more 
than twice in the list. 

Subjects and design—Each of 96 paid oS 
all summer session students at Kansas State 
University, was randomly assigned to one of 
eight equal-sized groups. The groups dif- 
fered from one another in terms of type of 
list learned (LS or HS) and degree of list 
mastery (a minimum of two correct on a 
single trial, four correct, six correct, or three 
consecutive errorless trials). Thus the 
eight groups, designated as LS2, LS4, LS6, 
LS6+3, HS2, HS4, HS6, and HS6+3, re- 
spectively, constituted a 2 X 4 factorial design 
with two levels of R similarity and four 
levels of degree of learning. Following at- 
tainment of the appropriate learning cri- 
terion, a first-letter cue-selection test was 
administered separately to each S within 
each group. 

Procedure.—Learning of the PA list was 
carried out under the study-test (recall) 
method which began first with the individ- 
ual presentation of the pairs for study, fol- 
lowed by the successive presentation of just 
the S terms for attempted response recall. 
The materials were prepared on 2X2 in. 
slides and presented through a Carousel pro- 
jector at a 3-sec. rate with a 20-sec. interval 
between trials and between study and test 
series within a trial. Order of presentation 
of the list was varied by shuffling the slides 
after each study and test series. 

When S reached the learning criterion of 
the group to which he was assigned, practice 
on the PA list terminated and the cue-selec- 
tion test began approximately 1 min. later. 
In the cue-selection test the first letter of 
each S term, followed by five blank spaces, 
was shown separately on the screen. The 
S was instructed first to give the missing 
two letters of the S term and then to spell 
aloud the three letters of the R term which 
was previously paired with that item. Order 
of exposure of stimulus letters was varied 
unsystematically across Ss within each 
group, and Ss were given unlimited time 
to respond to each stimulus letter. 


RESULTS 


Table 1 presents the mean number of 
trials required by each of the eight 
groups to reach criterion on the PA 
list. Both the main effect of degree 


_ 


CUE SELECTION AND THE RESPONSE TERM 325 


TABLE 1 
MEAN PERFORMANCE MEASURES 


Correct Responses 


patel pect es on Cus Selection 
ee 

in z ; alles 

$ Tis RE | Totar | Met Uist | Total 
2 3.25 | 4.58 | 3.92 | 1.83 | 1.67 | 1.75 
4 5.42 | 9.08 | 7.25 | 2.58 | 2.58 | 2.58 
6 9.67 | 21.33 | 15.50 | 5.33 | 5.25 | 5.29 


6+3 |11.17 | 16.17 | 13.67 | 5.00 | 5.58 | 5.29 
Total | 7.38 | 12.79 3.69 | 3.77 


of learning (DL) and the overall faster 
learning of List LS than HS proved 
to be significant, Fs > 19.87, beyond 
the .05 level, the minimum criterion for 
all analyses reported herein. In addi- 
tion, there was an interaction between 
DL and lists, F (3, 88) = 3.33, reflect- 
ing a divergence between the two lists 
with an increase in DL, although at 
DL6+3 the trend was reversed due 
to the greater number of trials to cri- 
terion for Group HS6 than HS6+3. 
However, in terms of total correct re- 
sponses the reversal was not evidenced. 
Thus, the discrepancy between Groups 
HS6 and HS6+3 would not appear 
to represent a serious problem par- 
ticularly since the data of primary in- 
terest involve within-group compari- 
sons. Furthermore, Table 1 shows 
that rate of learning had little, if any, 
effect on the total number of pairs cor- 
rectly recalled on the cue-selection test : 
DLewas the only variable that con- 
tributed significantly to the total num- 
ber of responses correctly recalled, F 
(3, 88) = 49.04. Neither the differ- 
ence between lists nor any orthogonal 
component of the Lists x DL interac- 
tion reached significance, Fs < 1, as 
determined by extended trend analysis 
of variance (Grant, 1956). Thus, in 
terms of total pairs correctly recalled, 
the data indicate that performance on 
the cue-selection test was comparable 


for LS and HS groups learning the 
PA list to the same criterion, 

The determination of first-letter cue 
selection requires the application of 
the dual criterion, i.e., inability to re- 
produce missing stimulus letters and 
correct recall of the R term on the 
cue-selection test. Reproduction of one 
or both stimulus letters, when the re- 
sponse is correctly recalled, would sug- 
gest that these additional letters are 
components of the functional stimulus. 
Thus, for each S, correct responses 
given on the cue-selection test were di- 
vided into three response categories 
(RC) : SO, S1, and S2, corresponding, 
respectively, to 0, 1, and 2 stimulus let- 
ters correctly reproduced. ` Figure 1 
shows separately for the LS and HS 
lists the mean number of correct re- 
sponses in each of these categories at 
each DL. As shown, both lists pro- 
duced more overall SO (2.34) than S1 
(.67) or S2 (.72) responses, leading 
to a highly significant main effect of 
RC, F (2, 176) = 58.99. However, 
the number of SO responses relative to 
S1 and S2 varied across the four DLs 
as a function of list similarity. As can 
be seen, the magnitude of the superi- 
ority of the SO category is essentially 
the same under both lists at DL 2 and 
4. List LS then shows an increase 
in the relative number of SO responses 
at DL6, followed by a decrease at 
DL6+3. On the other hand, List HS 
shows little increase in relative number 
of SO responses until the 6+3 criterion, 
Trend analysis showed the SO curve 
for List HS to possess a significant 
linear component, F (1, 44) = 23.41, 
and the SO curve for List LS both a 
significant linear, F (1, 44) = 16.42, 
and cubic (curvilinear), F (1, 44) = 
5.50, component. Both the linear com- 
ponent of the RC x DL interaction, 
F (2, 176) = 9.74, and the cubic com- 
ponent of the RC X DL X Lists inter- 
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DEGREE OF LEARNING 


Fic. 1. Mean number of correct responses 
in each recall category as a function of 
degree of learning and response similarity. 


action, F (2, 176) = 3.26, were sig- 
nificant, further indicating the differ- 
ential effects of DL and List similarity 
to be reliable. All other orthogonal 
components of these two interactions 
were nonsignificant, as was also the 
RC x Lists interaction. 

It can also be seen from Fig. 1 that 
there were relatively more S1 than S2 
responses at DL6 but more S2 than S1 
responses at DL6+3. This reversal 


W. L. DAVIS, S. C. BROWN, AND E. RITCHIE 


prompted further analysis of the S1 
and S2 data, revealing a significant 
cubic component of the S1-S2 x DL 
interaction, F (1, 88) =6.69. How- 
ever, aside from a significant main 
effect of DL, F (3, 88) = 9.38, all 
other main effects and interactions in- 
volving S1-S2, Lists, and/or DL were 
nonsignificant. 

In the Postman and Greenbloom 
study it was reported that a significant 
inverse relationship was obtained be- 
tween relative amount of first-letter 
cue selection (i.e., SO responses) and 
speed of acquisition of the PA list. 
To determine the nature of this rela- 
tionship in the present study, an iden- 
tical analysis was performed. Spe- 
cifically, the product-moment correla- 
tion was computed between trials to 
criterion on the PA list and the per- 
centage of SO responses of the total 
number correctly recalled on the cue- 
selection test. The resulting r’s, cal- 
culated separately for each of the eight 
groups, were all nonsignificant and 
ranged from +.39 to —.37. The av- 
erage r for List LS was —.25 and for 
List HS, +.21. The difference be- 
tween these two r’s was nonsignificant, 
as was also the difference between lists 
at each DL. 

Despite the lack of a significant re- 
lationship between learning ability and 
cue selection, the possibility still exists 
that certain pairs in the list may be 
more disposed to first-letter selection 
than other pairs. The fact that the 
relative number of SO responses tended 
to increase with an increase in DL sug- 
gests. further that these pairs may be 
the last learned or more difficult pairs 
in the list. At early stages of learn- 
ing (DL 2 and 4), the pairs correctly 
recalled were presumably the easier 
pairs in the list. Accordingly, the 
mean number of PA errors per pair, 
for items falling into the S0, S1, and 
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S2 response categories, was tabulated 
separately for each S within each 
group. Although the results showed 
more overall PA errors for pairs in the 
SO (5.6) and S1 (5.9) than S2 (4.9) 
category, the difference in performance 
was not significant, p > .20, Like- 
wise, all interactions involving cate- 
gories fell short of significance, p’s > 
.20. Thus it could not be concluded 
that SO pairs were reliably more diffi- 
cult to learn than S1 or S2 pairs. 


Discussion 


The results of the present research 
show that amount of first-letter cue se- 
lection in PA learning is influenced sig- 
nificantly by the degree of formal intra- 
list similarity among the response mem- 
bers comprising the list. This demon- 
strates that the properties of the R terms 
determine, at least in part, what aspects 
of the nominal stimulus become the func- 
tional stimulus. In general agreement 
with the IPI hypothesis, more first-letter 
selection occurred with low- than high-R 
similarity following initial mastery of 
the list. However, this effect was limited 
not only to performance after one error- 
less trial, but actually reversed with prac- 
tice beyond this stage, due to both a de- 
crease in first-letter selection with low-R 
and an increase with high-R similarity. 

Interpretation of the cue-selection func- 
tion obtained with low-R similarity ap- 
pears relatively straightforward. As 
learning of the list proceeds through 
initial mastery, increased attention is di- 
rected toward the first letters of the S 
terms. However, with further practice, 
Ss acquire additional stimulus letters, 
perhaps as a means for increasing resist- 
ance to forgetting. These results are 
similar to those found by James and 
Greeno (1967). Using word-nonsense 
syllable compound stimuli and single-digit 
responses, these authors reported use of 
the word as the functional stimulus first 
to increase and then decrease with learn- 
ing. The generality of the phenomenon 
must be tempered, however, since a re- 


lated study by Houston (1967), also 
using compound stimuli, failed to find a 
significant relationship between DL and 
amount of cue selection. 

The increase in first-letter cue selec- 
tion with overlearning (DL6+3) of the 
high-R similarity list is puzzling. It 
implies that Ss remember less about the 
letter composition of the S term with an 
increase in practice (total number of 
pairs correctly recalled on the cue-selec- 
tion test was virtually the same after DL6 
and DL6+3). Moreover, this reduction 
in stimulus information was attributable 
solely to a decline in the frequency of S1 
responses (i.e., the correct recall of just 
one of the two missing stimulus letters). 
Thus it would appear that if Ss are un- 
able to easily incorporate all three letters 
into a single integrated stimulus unit, 
they will instead restrict their attention 
to the first letter of the S term. A similar 
but smaller decrease in S1 responses also 
was observed with the low-R similarity 
list. It may be argued that the increase 
in cue selection with overlearning under 
List HS is related primarily to differences 
in PA learning ability between the Ss in 
Group HS6 and HS6+3, i.e., more trials 
to criterion were required of HS6. 
Arguing against such an interpretation 
is the fact that PA learning ability was 
not found to correlate significantly with 
amount of first-letter cue selection. 
Moreover, if such a relationship exists, 
the Postman and Greenbloom (1967) 
study indicates that slow learners are 
more disposed to first-letter selection 
than fast learners. 

At face value, the cue-selection func- 
tion obtained with high-R similarity is 
inconsistent with both the results of 
James and Greeno (1967) and the func- 
tion displayed in the present study with 
low-R similarity. However, it should be 
noted that the present study used only 
three overlearning trials. Thus it seems 
reasonable to suppose that with further 
practice on the HS list a decline in first- 
letter cue selection would occur. Further 
research designed to systematically spec- 
ify the nature of this relationship is 
needed to resolve the ambiguity. 
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Finally, it also should be mentioned 
that the present study revealed a much 
higher incidence of first-letter cue selec- 
tion than did the Postman and Green- 
bloom (1967) experiment. In the latter 
study, Ss most frequently recalled both 
missing stimulus letters. The fact that 
this study used trigrams and Postman 
and Greenbloom single digits as responses 
may account for the discrepancy. De- 
spite the differences between the two 
studies, however, both investigations 
point to the usefulness of the Postman 
and Greenbloom technique as a means 
for studying stimulus-selection processes 
in verbal learning. Moreover, taken to- 
gether, the results of the two studies 
strongly suggest that factors influencing 
stimulus selection in PA learning are 
highly dependent on the characteristics 
of both the S and R terms in the list. 
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MAGNITUDE ESTIMATES OF ROTATIONAL VELOCITY 
DURING AND FOLLOWING PROLONGED INCREASING, 
CONSTANT, AND ZERO ANGULAR ACCELERATION * 
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Velocity of rotation was observed by 10 Os in a rotating simulator 
during and following accelerations about a vertical axis which varied 
between .5°-1.5°/sec* and .006°-.030°/sec* and continued for 2-3 min, 
Trials with 0 acceleration were also included. Magnitude estimates 
of velocity of rotation increased and then declined during the constant 
accelerations, while more complex effects occurred during increasing 
accelerations. | Aftereffects and an autokinetic effect were also 
observed. Current theory of vestibular function is considered in the 


light of these results, 


The model of the semicircular canals 
functioning as a heavily damped torsion 
pendulum, which was developed by van 
Egmond, Groen, and Jongkees (1952), 
has proved to be a useful concept in re- 
search on the perception of rotation and 
related phenomena. While this model 
typified the data for accelerations of 
short duration, it soon became clear to 
the proponents of the theory that data 
from many studies did not fit the re- 
sults predicted by the model (Ek, 
Jongkees, & Klijn, 1960; Groen, 1962, 
1965 ; van Egmond, Groen, & Jongkees, 
1952). This was verified by other 
workers for such phenomena as sec- 
ondary nystagmus, the oculogyral illu- 
sion, the response decline for subjective 
magnitude estimation of rate of rotation 
during constant angular acceleration, 
and for the perception of rotation in 
the opposite sense following the pri- 
mary effects (Brown, 1966; Graybiel, 
Kerr, & Bartley, 1948; Groen, 1962; 
Guedry, 1965; Guedry & Lauver, 1961; 
Hauty & Wendt, 1953). These work- 
ers showed that gross discrepancies 

1 This paper was prepared while the first 
author was on leave from the Psychology 
Department, San Jose State College, and 
was a National Research Council Postdoc- 
toral Resident Research Associate at Ames 
Research Center. 


occurred between cupular deflection 
and the duration and magnitude of the 
effects, whereas the model indicated a 
close relationship. Thus, the model, 
which was exclusively a description of 
semicircular canal activity, was shown 
to be inadequate for an understanding 
of prolonged vestibular stimulation, 
particularly for the perception of rota- 
tion. The present study was designed 
to extend these investigations by a 
magnitude estimation procedure under 
three conditions of prolonged rotation: 
constantly _ increasing acceleration 
(ramp acceleration), constant accelera- 
tion, and constant velocity. The pri- 
mary purpose was to study changes in 
subjective velocity by frequently esti- 
mating the magnitude of rotational 
velocity during 2 or 3 min. of accelera- 
tion and during a 2-min. period of con- 
stant velocity immediately thereafter. 
A secondary purpose was to study the 
perception of rotation during a 4-min. 
period of constant velocity. 


METHOD 


Apparatus—The Os were rotated in the 
Ames Man-Carrying Rotation Device 
(MCRD). The MCRD is a one degree of 
freedom simulator that rotates O about a 
vertical axis with continuous angular accel- 
eration variations to 30°/sēc* for a range of 
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durations limited by the maximum speed of 
45 rpm. Increments in angular accelera- 
tion as small as .01°/sec* can be presented 
and measured at low velocities. The accel- 
erations are monitored by ink writing record- 
ers driven by on-board angular accelerom- 
eters. The cockpit rotates on the shaft of a 
de motor which is controlled by voltage 
signals from an analog computer making it 
possible to change accelerations in a small 
fraction of a second. The rotation is free 
from significant vibrations, particularly at 
the lower angular velocities (+20 rpm) used. 
The O sits at the center of rotation in a 
special seat within a light-proof cockpit 5 ft. 
wide, 7 ft. long, and 5.5 ft. high. Communi- 
cation between O and E is maintained by 
means of a standard voice communication 
system. The room is ventilated by two fans, 
which also partially mask external noises. 
An aviator’s helmet, shoulder harness, and 
lap belt are used during all observations as 
safety precautions. 

Observers——The Os were 10 men in good 
health according to their own report, and a 
physical examination revealed no abnormali- 
ties. They all had normal hearing, and their 
responses to caloric stimulation were judged 
to be normal. Their thresholds for angular 
acceleration determined by a staircase method 
varied between .15° and 1.19°/sec* with a 
mean of .40°/sec*. All of them were familiar 
with the MCRD through previous observa- 
tions in it; each completed 8-22 (median = 
10) practice trials using the magnitude esti- 
mation procedure. 

Procedure—The O sat erect in the seat 
in complete darkness throughout all trials 
with his helmet pressed firmly against a U- 
shaped head rest that maintained his head 
in an erect, fixed position. The O’s task 
was simply to report the direction and 
magnitude of the perceived rotation through- 
out each trial following a ready signal. The 
perceived velocity of rotation was estimated 
by a method of magnitude estimation similar 
to that used by Stevens (1956) and Brown 
(1966). The specific scale values were 
arbitrarily selected by O during his prac- 
tice trials (Engen & Ross, 1966). No 
standardization observations were used (cf. 
Brown, 1966), and O was instructed to select 
scale values that would be most meaningful 
and comfortable for him. The only sug- 
gestions were that the scale begin at five or 
above and that fractions be avoided. As it 
turned out, three Os used five as a base num- 
ber while the other Os used smaller numbers. 
These base numbers represented each O's 
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minimal perception of rotation where he 
could first give both a direction of rotation 
and a magnitude estimate. A single base 
number was readily determined for each O 
by an examination of his first magnitude 
estimates. 

On each trial O gave a running account 
of his perceived bodily rotation. When he 
was reasonably sure that he was rotating in 
a particular direction, he indicated the direc- 
tion. In addition, he reported a magnitude 
estimate of his perceived velocity using the 
arbitrary numerical scale. As his perception 
of the rate of rotation increased, he reported 
proportionally increasing values which were 
recorded with the time of the report. If O 
perceived a decrease in his perceived velocity, 
he would then report a correspondingly 
smaller number. To maintain his level of 
alertness and to produce regular estimates of 
subjective velocity, O was directed to report 
his estimates frequently throughout each 
trial even if there was no change in the ap- 
parent velocity. If he did not report within 
15 sec. of his most recent report, E signaled 
with a tap on the microphone, and then O 
reported immediately. 

The following angular accelerations were 
used; three constant angular accelerations 
(1.5°, 1.0°, .5°/sec*) of 2-min, duration (Fig. 
1) and four ramp accelerations (.030°, .014°, 
010°, and .006°/sec*), the first being pre- 
sented for 2 min. and the remainder for 3 
min. (Fig. 2 and 3). A control trial of 4- 
min. duration with no change in angular 
yelocity was also included for each block of 
seven accelerations. The Os were informed 
that there would be seven different accelera- 
tions and a control trial, the different condi- 
tions being presented in random order. Fol- 
lowing the presentation of each angular 
acceleration, there was a 2-min. period of 
constant velocity during which O continued 
to report on his perception of rotation. 
rest period of at least 3 min. was given be- 
tween trials, the final minute of this always 
being at a constant angular velocity. The 
experimental variables were presented in 
random order within blocks of the eight con- 
ditions. Two to four conditions were 
presented in each session which lasted ap- 
proximately 25 min,, and at least 30 min. 
elapsed between sessions. The direction of 
the applied acceleration for the first trial of 
each session was randomized. The direction 
for succeeding trials was chosen to mini- 
mize the velocity adjustments required be- 
tween trials to limit the velocity to +20 
rpm. This procedure usually resulted in the 
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direction of rotation alternating between 
trials. A velocity adjustment also preceded 
the first trial of each session since many 
trials required a velocity change of more 
than 20 rpm. All velocity adjustments were 
made at 2°/sec?, and a minimum of 1 min. 
of constant velocity preceded each trial. 
Five trials were taken for each condition 
for each O. 


RESULTS 


The data were analyzed in two ways. 
First, plots were made of the magnitude 
estimates for each trial for each O 
using his own numerical scale. Magni- 
tude estimates were considered to be 
plus when they were in the direction of 
the acceleration and minus when they 
were in the opposite direction, For the 
control runs, plus indicated right and 
minus indicated left. Next, the five 
trials for each condition were com- 
bined in a single curve by plotting the 
medians of the five curves every 10 sec. 
throughout the trial. This produced a 
curve for each of the eight conditions 
for each O. Finally, group curves for 
the eight conditions were plotted from 
the means of the values for the 10 Os 
at each 10-sec. interval (Fig. 1-3). 
Magnitude estimates were equated 
across Os by converting all individual 
magnitude estimates to an arbitrary 
common base of one. Subsequent 
values were assigned by dividing each 
magnitude estimate by O’s base num- 
ber. A second analysis of the data 
was made by determining the average 
magnitude, duration, and latency of the 
magnitude estimates (Table 1). The 
computation of the duration and the 
magnitude of the effects was straight- 
forward, but the latencies presented 
some problems. The Os did not report 
the perception of rotation on all trials; 
therefore, the latencies were computed 
only for those trials in which the effect 
occurred. Consequently, an appreci- 
able error is involved for .5°/sec? and 
.006°/sec*. The error was negligible 


for the other five conditions because 
the effects were observed on practi- 
cally all of these trials. 

Perception of rotation during and 
following constant acceleration—The 
magnitude estimates for the 10 Os 
showed systematic changes during and 
following the 2-min. periods of constant 
acceleration (Fig. 1) in contrast with 
the control trials, which showed no 
systematic changes (Fig. 3). The gen- 
eral pattern of response was similar 
during the three levels of constant 
acceleration and confirmed earlier find- 
ings (Brown, 1966; Guedry, 1965). 
After a short latency, the magnitude 
estimates for the three accelerations in- 
creased for 30-40 sec. in the direction 
of the induced acceleration. Then, fol- 
lowing a short period at the maximum 
level, the magnitude estimates declined 
and reached zero before the constant 
acceleration terminated. This sequence 
of perceived rotation will be called the 
first effect (Graybiel & Hupp, 1946). 
Its duration was the time from the first 
veridical report of rotation to the time 
when the magnitude estimates returned 
to zero. It should be noted, however, 
that individual trials did not always 
show the continuous, smooth increase 
in the magnitude estimates seen in the 
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of the aftereffect, the magnitude esti- 
mates increase in the direction opposite 
to the applied acceleration for 20-30 
sec., the time to reach the peak magni- 
tude estimates increasing with the mag- 
nitude of the applied acceleration. 
Then, as with the first effect, the esti- 
mates gradually declined to zero and 
thus terminated the effect before the 
end of the trial. 

The data also show a clear relation 
between the magnitude estimates and 
the amplitude of the constant angular 
acceleration. Both the magnitude and 
the duration of the first effect and the 
aftereffect increased with the accelera- 
tion (Table 1, Fig. 1 and 4). The 
latency from the beginning of the accel- 
eration to the beginning of the first 
effect decreased as the acceleration in- 
creased. It should be noted that this 
latency for a response, which required 
a verbal, quantitative judgment, was 
several times longer at all acceleration 
levels than the latency found in a 
simpler task in which O responded by 
pressing a key as soon as the direction 
of rotation was perceived (Clark & 
Stewart, 1962). There was no system- 
atic change in the latency for the 
aftereffects, Both the magnitude and 
duration of the aftereffects were con- 
sistently less than they were for the 
first effects, but they were of approxi- 
mately the same magnitude. The indi- 
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vidual differences were substantial 
(Table 1) as were the differences in an 
individual O’s reports for successive 
trials. The variation in both the la- 
tency and the duration of the effects 
between Os resulted in an initial rise 
in the average magnitude estimates in 
the group curves before the time indi- 
cated by the mean latencies. Similarly, 
since the maximum magnitude esti- 
mates for different Os fell at different 
points in time, the maximum scale 
values on the curves were necessarily 
less than the mean of the maximum 
arbitrary scale values, Consequently, 
both types of data are required for a 
complete description of these effects. 
Perception of rotation during and 
following ramp  accelerations—The 
magnitude estimates of angular velocity 
during and following the constantly in- 
creasing (ramp) accelerations were 
also systematically related to the accel- 
eration, but the pattern of change was 
much more complex (Fig. 2 and 3) 
and frequently involved a waxing and 
waning of the perception of rotation, 
The fluctuation of the magnitude esti- 
mates was apparent in the group 
curves, but it was much more pro- 
nounced in the individual curves (Fig. 
5). The group curves showed a fairly 
long latency followed by an initial gen- 
eral tendency for the magnitude esti- 
mates to increase. Instead of the reg- 
ular increase shown in the group curves 
during the constant accelerations, how- 
ever, the typical response for the three 
smaller ramp accelerations was either 
a cyclic waxing and waning of the re- 
ported velocity or a prolonged constant 
velocity followed by an increase in the 
magnitude estimates. During an accel- 
eration of .030°/sec*, the magnitude 
estimates increased for about 100 sec. 
and then began to decrease. In gen- 
eral, the magnitude and duration of 
the reported rotation increased and the 
latency decreased as the magnitude of 
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nitude estimates in the direction of accelera- 
tion are +, in the opposite direction —.) 


the acceleration increased (Table 1, 
Fig. 2, 3, and 4). It should be noted 
that the magnitude estimates at .006° 
and .010°/sec* were very much alike, 
whereas those for the two larger accel- 
erations were substantially greater. 

An analysis of the curves for the 
individual trials revealed that all of the 
Os reported cycles of waxing and wan- 
ing of the magnitude estimates on one 
or more trials involving ramp accelera- 
tion (Table 1, Fig. 5). Three Os re- 
ported three of these cycles on one or 
more trials while four other Os re- 
ported two cycles on one or more trials. 
One or more cycles of waxing and 
, waning were reported during 79 of 
the 200 trials involving ramp accelera- 
tion. The optimum conditions for 
these cyclic perceptions of rotation ap- 
peared to be at .010° and .014°/sec* 
since they occurred in over 60% of 
these trials (Table 1). At these levels, 
the magnitude estimates sometimes 
reached zero before increasing on the 
next cycle (Fig. 5). In trials in which 
clear cyclic perceptions of rotation did 
not occur, the typical magnitude esti- 
mates showed a stepwise increase in the 
perception of rotation involving pro- 
longed periods of reports at the same 
estimated velocity. 
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The aftereffe-ts reported following 
the ramp accelerations were quite simi- 
lar to those observed when the accel- 
eration was reduced to zero following 
the constant accelerations (Fig. 2 and 
3). When the acceleration was re- 
duced to zero, the magnitude estimates 
in the direction of the acceleration fell 
to zero. There was then a rapid in- 
crease in the magnitude estimates in 
the opposite sense followed by a grad- 
ual decrease terminating before the 2- 
min. period of constant velocity termi- 
nated. There was a substantial in- 
crease in the duration of the aftereffect 
with increasing acceleration of the 
same duration, but again there was no 
systematic change in the latency for 
the aftereffect following the beginning 
of the constant velocity. 

Perception of rotation during con- 
stant velocity: Gyroautokinesis.—The 
seven conditions described so far have 
involved changes in angular velocity 
during the first part of the trial. The 
eighth condition was included primarily 
as a control trial, which produced the 
expectancy that on 4 of the trials there 
would be no stimulus. Therefore, on 
any trial, a continuous report of zero 
velocity might be appropriate. It was 
believed that the number of reports 1n 
the direction opposite to the direction 
of the applied acceleration would there- 
by be reduced, particularly at the be- 
ginning of the trials. A second pur- 
pose of these trials was to determine 
the nature and duration of any reports 
of apparent rotation when no stimulus 
was present. Additional data on ap- 
parent rotation were obtained during 
the regular trials. Illusory movements 
were judged to be indicated by the 
following reports: (a) all magnitude 
estimates reported during the control 
trials, (b) reports of movement during 
the early part of the regular trials 
which were in the opposite sense from 
the acceleration, and (c) reports of 
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inappropriate motion following the first 
effect or the aftereffect. All of these 
reports will be referred to as gyroauto- 
kinesis. 

Gyroautokinesis was reported by all 
Os, but the magnitude estimates were 
small and showed no systematic pattern 
of response. This is evident in the plot 
of the group data for the control trials 
which shows that the mean magnitude 
estimates are essentially at zero 
throughout the 4-min. reporting period 
(Fig. 3). It should be noted, however, 
that these are the means of the indi- 
vidual median magnitude estimates for 
each O. Only one O failed to report 
gyroautokinesis during at least one of 
the control trials, and the mean dura- 
tion of the 51 separate reports during 
these 50 control trials was 25.9 sec. 
These were reported on 26 separate 
trials. The maximum magnitude esti- 
mates fell at the first scale step used by 
O for 46 of the 51 reports of gyro- 
autokinesis during the control trials. 
Only one O went as high as 24 steps. 
His series of reports on a single trial 
represented a pattern of response very 
similar to the magnitude estimates reg- 
ularly reported during the constant 
accelerations. Gyroautokinesis, as de- 
fined above, was reported a total of 141 
times during the 400 trials. As would 
be expected, both the number and the 
duration of the reports tended to be 
greater during the control trials and 
during lower levels of acceleration 
(Table 1, Fig. 4). A smaller number 
of these reports was given during the 
final series of trials than during each 
of the first two series of trials. 


Discussion 


First effect—The general pattern of 
the magnitude estimates during the pro- 
longed constant accelerations was very 
similar to the changes in the perception 
of rotation and the oculogyral illusion 
reported by others (Brown, 1966; Gue- 


dry, 1965; Schmaltz, 1931) in spite of 
differences in methods used. The pro- 
cedure used here extended the duration 
of the constant acceleration, beyond that 
used by Brown and Guedry, so that the 
complete cycle of response decline was 
demonstrated before the acceleration was 
reduced to zero. An analysis of the indi- 
vidual trials also revealed a limited 
amount of waxing and waning and step- 
wise development of the perception of 
rotation (Table 1). This was not re- 
ported by the earlier investigators nor 
were these effects apparent in the au- 
thors’ group curves. These observations 
have significance for attempts to form- 
ulate a model for the perception of rota- 
tion. 

Ramp accelerations presented O with a 
much more complex and difficult task. 
Waxing and waning or a stepwise in- 
crease in magnitude estimates were com- 
mon. All Os reported that the observa- 
tions were occasionally difficult and at 
times completely confusing. Several Os 
reported that sometimes they felt they 
were oscillating rapidly to the right and 
the left at a fairly fast rate. Several 
also said, “I feel that I am rotating in 
both directions at the same time.” Pre- 
liminary observations by one O who had 
taken part in another related study re- 
vealed that he was unable to report mag- 
nitude estimates for the accelerations 
used in the experiment with the ramp 
accelerations continuing for as long as 
6 or 7 min. During these trials, the 
angular acceleration had reached a level 
far above his threshold determined by 
10-sec. pulses of constant acceleration. 
On the other hand, when greater levels 
of ramp acceleration were used, his pat- 
tern of response was similar to that of 
10 Os used in this study. Such observa- 
tions have not been reported by other 


investigators of ramp accelerations 
(Tumarkin, 1937; von Diringshofen, 
Kissel, & Osypka, 1964). There are 


probably two reasons for this. First, they 
instructed Os to make a report of rota- 
tion without correcting their responses. 
Second, they terminated the accelerations 
soon after a report of rotation was made; 
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therefore, their Os had no opportunity to 
perceive any subsequent fluctuations in 
the perception of rotation. It is also 
worth noting that if a “threshold” for 
the perception of angular acceleration is 
computed from the latency data, it in- 
creases with the level of the ramp accele- 
ration (Table 1). It would appear that 
the perception of rotation may be widely 
influenced by prolonged, subliminal 
stimulation by angular acceleration. 
These data suggest that during ramp 
accelerations the input from the semi- 
circular canals is modulated by some cen- 
tral nervous system mechanism, and this 
may lead to bizarre perceptual effects for 
suprathreshold accelerations. 

Aftereffects—The general pattern of 
the aftereffects was the same for both 
types of acceleration, and it is noteworthy 
that these effects were of the same order 
of magnitude as the first effects. All 
Os were able to report the aftereffects 
with no difficulty whatsoever. The 
change in the perceived velocity was 
smooth, although it frequently occurred 
with surprising suddenness following the 
ramp accelerations. At one moment O 
would perceive that he was rotating in 
the direction of the acceleration. Then 
suddenly he would perceive no rotation; 
and more rapidly than he could report ac- 
curately, he would perceive that he was 
rotating in the opposite direction. These 
shifts following the ramp accelerations 
were further complicated by the fact that 
O was sometimes reporting an increasing 
component of a cycle of rotation when 
the acceleration was reduced to zero. 
Such reports tended to increase for sev- 
eral seconds before the perception of ro- 
tation was reduced to zero and the after- 
effect occurred (cf. Brown, 1966). Con- 
sequently, the mean latency of the after- 
effects was long and unrelated to the 
level of the ramp acceleration. 

A final point related to the effects of 
“subliminal stimulation” is important. 
Magnitude estimates describing a first 
effect were not a prerequisite for reports 
of the aftereffect. Aftereffects were re- 
ported following a constant or ramp 
acceleration during 19 trials in which no 
first effect was reported. These observa- 


tions are in accord with Spitz’s (1965) 
report of aftereffects following subliminal 
visual stimulation by a rotating spiral. 
No detailed studies of subjective after- 
effects have been found, but the prolonged 
aftereffects reported by Os in this study 
appear to be in direct conflict with the 
oscillatory subjective effects reported by 
Schmaltz (1931) and van Egmond, 
Groen, and Jongkees (1952) for accelera- 
tions of shorter duration. 

Magnitude estimates as a function of 
Time X Acceleration—Brown (1966) 
undertook a mathematical analysis of 
changes in subjective velocity for the 
initial increasing period of the magnitude 
estimates during prolonged constant 
accelerations. He plotted the magnitude 
estimates on log-log coordinates against 
a Complex Time X Acceleration Quan- 
tity. ‘The equation required to fit his data 
for relatively large accelerations was S = 
.19 #-8(a-2.0) where S is the magnitude 
estimate, t is the time in seconds from 
the beginning of the acceleration, and a is 
the constant acceleration. The value, 2.0, 
is an assumed threshold level. It was 
possible to produce a similar plot of the 
ascending segment of the data and deter- 
mine a straight line fit. However, the 
authors also analyzed the magnitude esti- 
mates by establishing computer derived 
constants from a general equation of the 
form ME =A[¢@*(a-C)*] where ME is 
the magnitude estimate, ł is the time from 
the beginning of the acceleration, a is the 
imposed acceleration, and the other pa- 
rameters including C were free constants 
to be determined for a best fit by the 
least-squares procedure. This analysis, 
with C varying from .0 to 4 revealed 
a very low variation in the sum of the 
squared errors used as the best fit criter- 
ion for a substantial range of values of 
C. The authors made a similar analysis 
of Guedry’s (1965) data for subjective 
angular speed for the oculogyral illusion, 
and this also revealed an insensitivity to 
variability in C. 

In Brown’s (1966) analysis, C was 
assumed to be 2.0, which represented an 
arbitrarily selected threshold to give a 
good fit to the data. This value is a 
times the empirically derived threshol 
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for Os in this study, determined for 
accelerations of 10-sec. duration. Con- 
versely, the best fit for the data was found 
for a C of .1 or approximately one-quarter 
of Os’ measured threshold. Again, C of 
.37 produced the best fit for Guedry’s 
(1965) data. This value is close to the 
mean rotation threshold for Os in the 
study. However, if a threshold for the 
oculogyral illusion were to be used, it 
would probably be substantially below the 
rotation threshold because there is evi- 
dence that thresholds for the oculogyral 
illusion tend to be well below those de- 
rived from simple reports of the percep- 
tion of rotation (Clark, 1967). The 
term C, then, appears to be equivocal in 
meaning in these equations and of rela- 
tively little importance in producing the 
best fitting straight line. Nevertheless, 
C is important in the determination of 
the other parameters of the equation if 
one is to derive a meaningful mathemat- 
ical model for the phenomenon since it is 
a measure of sensitivity. Because the 
model requires a measure of sensitivity, 
the issue of the appropriate method to 
determine it is crucial. Furthermore, it 
would appear that the ‘significance of the 
other parameters has yet to be established. 
Finally, the evidence available suggests 
that it is questionable whether these data 
from magnitude estimates resulting from 
prolonged constant accelerations will be 
adequately described by a power function 
similar to those found for varied stimuli 
of known intensities. 

Mechanisms of vestibular function — 
The phenomenon of adaptation is one of 
the most ubiquitous and fundamental 
processes in sensory psychophysiology. 
Moreover, the nature of adaptation in any 
sensory modality reveals the basic mode 
of action of the particular perceptual sys- 
tem. Consequently, any thoroughgoing 
theory of vestibular function must account 
for these effects in a satisfactory manner. 
Tt is in the area of adaptation that the 
torsion-pendulum theory fails (Guedry, 
1965), although it predicts the gradual 
increase in the perception of rotation and 
the prolonged response decline following 
short periods of acceleration. The pri- 
mary results in this experiment requiring 


a more complex explanatory mechanism 
are the following: (a) the response de- 
cline in the perception of rotation during 
a prolonged constant acceleration, (b) 
the waxing and waning or stepwise in- 
crease in the perception of rotation, par- 
ticularly during the ramp accelerations, 
(c) magnitude estimates of zero during 
constant and ramp accelerations for as 
long as 2 min. when the angular accelera- 
tion was far above O's threshold deter- 
mined by other methods, and (d) the 
prolonged aftereffects of rotation. The 
rather bizarre perceptual effects during 
the ramp accelerations must also be con- 
sidered. Any comprehensive theory of 
the perception of rotation must take these 
into account as well as the short-term 
effects which are accurately predicted by 
the torsion-pendulum theory. 

Possible mechanisms that may account 
for the response decline of the perception 
of rotation during constant acceleration 
have been summarized by Guedry (1965). 
He considers both peripheral and central 
processes and favors a central mechanism 
in the light of his finding that nystagmus 
remains constant while the perception de- 
clines (Guedry, 1965; Guedry & Collins, 
1968). The authors’ data obtained 
during constant acceleration, of course, 
support his findings, and the observations 
made during the ramp accelerations and 
the aftereffects also support a central 
process since they are particularly dif- 
ficult to fit to a simple peripheral model 
(Schmaltz, 1931). On the other hand, 
they also require a more complex me- 
chanism than the observations during 
constant acceleration. These considera- 
tions suggest that a comprehensive theory 
of vestibular function must include some 
mechanism for the central modulation of 
information from the semicircular canals 
as described by the torsion-pendulum 
theory. The nature of the central mod- 
ulation would appear to be highly com- 
plex. The data indicate that the function 
is related to the duration of the stimula- 
tion and that it may be a nonlinear func- 
tion controlled by some switching mech- 
anism, The precise nature of this 
function will be the subject of further 
investigation in this laboratory. 
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Experimental data from various sources 
indicate that the effects of vestibular 
stimulation must be considered at several 
levels, of which four may be noted. The 
first is the activity of the cupulaendo- 
lymph system that has been described by 
a straightforward physical model (Gue- 
dry, 1965; van Egmond, Groen, & Jong- 
kees, 1952). In addition to peripheral 
neural connections, there are relatively 
independent central nervous system pro- 
cesses involved in three reflex and per- 
ceptual effects. The first of these is reflex 
nystagmus which has been studied in de- 
tail and is also dependent on higher ner- 
vous activity (Guedry, 1965). The sec- 
ond is the perception of rotation which, 
as indicated above, is relatively independ- 
ent of nystagmus. Finally, there are 
intersensory processes in the oculogyral 
illusion (Graybiel & Hupp, 1946; Guedry, 
1965) that involve interactions between 
vestibular and visual mechanisms. Each 
of these has its own unique attributes, 
and all of them must be considered in any 
comprehensive conception of vestibular 
behavior. 

Gyroautokinesis—The perception of 
movement in the absence of real move- 
ment is one of the classical problems in 
psychology. Autokinesis is one of many 
of these effects. Although visual auto- 
kinesis has been studied most extensively, 
autokinetic movement has been reported 
for auditory and tactual stimuli (Ander- 
son, 1965; Spigel, 1965). Reports of 
rotation in the absence of angular accele- 
ration are well known to investigators of 
the perception of rotation, but no system- 
atic investigations of such autokinetic 
movements have been found. Luria 
(1965) reported a “postural autokinetic 
effect” similar to what the authors have 
called gyroautokinesis, but her primary 
concern was with the effects of verbal 
and “apparatus suggestion” rather than 
with a detailed description of the effect 
itself. Although no comprehensive theory 
of the various autokinetic effects has been 
developed, a number of peripheral theories 
(e.g., the eye movement theory of visual 
autokinesis) have been disproved. The 
crucial experimental variables for an opti- 
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mum effect in the case of visual auto- 
kinesis were present in this experiment. 
First, there was a structureless field, 
darkness, which resulted in an absence of 
an articulated visual frame of reference. 
Second, there was an expectancy, or what 
Luria calls apparatus suggestion, that the 
cockpit would accelerate during seven 
out of the eight trials. The importance 
of suggestion in visual autokinesis has 
been well known for many years (Sherif, 
1935), and, for gyroautokinesis, is sup- 
ported by Luria’s study and by the obser- 
vations of Dorcus (1937) that the 
perception of rotation may result from 
hypnotic suggestion whereas nystagmus 
may not. Thus, gyroautokinesis may be 
understood as a result of minimal infor- 
mation regarding body position to the 
central nervous system associated with the 
expectancy of rotation. 

Evaluation of the method of magnitude 
estimation.—The observations in this ex- 
periment support Brown’s (1966) con- 
clusion that the method of magnitude esti- 
mation is a highly satisfactory technique 
to obtain periodic, direct estimates of 
angular velocity over time. Although the 
arbitrary scale values used by Os must 
be interpreted with caution, the method 
directs O’s attention to the specific per- 
ceptual experience in contrast with other 
methods. Furthermore, the pattern of 
response is clearly defined. Os were 
generally surprised at the ease with which 
the reports could be made even in the 
absence of the “standard stimulus” used 
by Brown. The magnitude estimates 
were judged to be easier to make than the 
threshold observations, Indeed, some Os 
were reporting what they felt were mean- 
ingful magnitude estimates and were giv- 
ing systematic patterns of response on the 
first practice trial using their own num- 
ber system. It would appear, therefore, 
that the rather prolonged practice series 
used with Os in this study was unneces- 
sary, although some practice trials are 
certainly indicated. In this respect, the 
method has a clear advantage over asking 
O to signal when he turns through some 
arbitrary angle (Guedry, 1965). The 
ease of reporting is all the more note- 
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worthy in the light of the novelty of re- 
porting velocity of rotation on the basis 
of vestibular information alone. The 
validity of the method is also supported by 
the similarity between the authors’ find- 
ings and those reported by Guedry 
(1965). Magnitude estimation is a 
simple, easily learned method of describ- 
ing subjective changes in the perception 
of angular velocity and appears to have 
value in a variety of experiments that re- 
quire rapid reports of changing magnitude 
of the perception of bodily rotation, 
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INTERFERENCE WITH AVOIDANCE BEHAVIOR: 
FAILURE TO AVOID TRAUMATIC SHOCK * 


J. BRUCE OVERMIER 


University of Minnesota 


Ss were exposed to inescapable traumatic shock. Then, either 24, 48, 
72, or 144 hr. later they were given instrumental avoidance training 


using a technique which eliminated all escape contingencies. 


Short 


time intervals between inescapable shock and avoidance training pro- 
duced severe interference with the acquisition of avoidance responding 


as compared with a control group. 
amount of interference decreased. 


But at longer intervals the 
These results imply that the 


interference phenomenon is not mediated by the concurrent presence of 


shock, as has been suggested. 


Overmier and Seligman (1967) and 
Seligman and Maier (1967) have 
shown that prior exposure to inescapa- 
ble shock results in transituational in- 
terference with the first 10 trials of 


escape/avoidance responding. How- 
ever, both experiments used the 
“method of emergence” avoidance- 


training technique (Solomon & Brush, 
1956), the early trials of which lead to 
instrumental escape behavior with typi- 
cally decreasing latencies from which 
avoidance responses later develop. The 
interference observed, then, was upon 
escape behavior. Seligman and Maier 
theorized that the probability of re- 
sponding during shock has decreased 
because S learned during inescapable 
shock that shock termination was inde- 
pendent of its responses, and that the 
presence of shock mediated generaliza- 
tion of nonresponding during shock to 
the new situation. 

Such a view suggests that this tran- 
situational interference with instru- 
mental behavior should not be observed 
if one were to focus on avoidance be- 


1 This research was supported by research 
grants to the author by the graduate school 
of the University of Minnesota and by the 
National Institute of Mental Health (MH- 
13558). The author wishes to thank M. A. 
Trapold for his criticisms of an earlier draft 
of this paper. 


havior and to train Ss using an avoid- 
ance-training technique that is not 
confounded with escape contingencies. 
In any case, it has yet to be demon- 
strated that exposure to inescapable 
shock subsequently interferes with in- 
strumental avoidance responding. The 
present experiment asked whether or 
not avoidance behavior is affected by 
prior experience with inescapable 
shock. 


METHOD 


Subjects—The Ss were 35 mongrel dogs 
with shoulder heights between 15 and 19 in. 
and were purchased from the Animal Hos- 
pital of the University of Minnesota. The 
Ss were maintained on ad-lib food and 
water in individual cages. During the course 
of the experiment 4 Ss had to be discarded 
because of health and apparatus program 
failures. One additional S was discarded 
because shedding of long hair short-circuited 
the shocking grids. The remaining 30 Ss 
were randomly assigned to four experi- 
mental groups and one control group of 
Ss each. nae 

Apparatus —There were two distinctively 
different apparatus units. The first unit was 
used to expose S to inescapable electric shoc! 
and consisted of a rubberized cloth hammock, 
through which S’s legs hung down, locate: 
inside a shielded sound-reducing cubicle. The 
S was secured into the hammock with leg 
ties. Shock was administered to S through 
stainless steel plate electrodes coated w 
electrode paste and taped to the footpads 
of the hind feet. The shock source con- 
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sisted of a variable autotransformer, step-up 
transformer, and large series resistor, A 
meter allowed E to monitor continuously and 
adjust the autotransformer. so that the cur- 
rent delivered to S was 6 ma. through 100 K 
ohms. 

The unit in which S was tested for in- 
terference with avoidance responding was a 
two-way shuttle box with two compartments 
separated by an 11-in. barrier, (see Solomon 
& Wynne, 1953). Each compartment was 
45X24 in. and 40 in. high with plywood 
walls painted flat black, and a hardware 
cloth ceiling. Above the hardware cloth 
ceiling over each compartment was a cham- 
ber containing a 150 w. lamp, and a 74 w. 
lamp, and two dual cone speakers, One 
speaker provided approximately 80 db, SPL 
of white noise into each compartment. The 
other speaker was used to present the audi- 
tory CS at approximately 10 db. above the 
noise level. A one-way mirror was mounted 
in the front of each upper chamber to 
permit continuous observation of Ss. Elec- 
tric shock was administered through a grid 
floor of 14 in. aluminum bars placed § in. 
apart. A commutator shifted the polarity 
patterns of the bars six times per second. 
The shock was 9 ma. through 50 K ohms. 

All stimulus presentations in both appa- 
ratus units were controlled by automatic 
relay circuitry at a remote location. 

Treatments—All treatments were admin- 
istered on successive days. The four ex- 
perimental groups received two treatments: 
(a) exposure to inescapable shock and (b) 
instrumental avoidance training in which 
escape was not possible. The control group 
received only the second treatment. 

Exposure to inescapable shock consisted 
of 60 presentations of 6 ma. shock, each of 
5-sec. duration, All inescapable shocks were 
unsignalled. The intershock interval aver- 
aged 80 sec. (range 40 to 120 sec.). 

Instrumental avoidance training followed 
exposure to inescapable shock by a length of 
time that differed for each of the experi- 
mental groups. The intervals were 24, 48, 
72, and 144 hr. 

Twenty-one instrumental avoidance train- 
ing trials were given each day until $ met 
a criterion of 10 consecutive avoidances in 1 
day or until 5 days had elapsed. In no case, 
however, did an S receive less than 3 
days of such training. Each instrumental 
avoidance training trial began with the onset 
of the auditory CS (1900 cps impure sine 
Wave tone) and the CS remained on until 
trial termination. The CS-UCS interval 


was 10 sec. If S jumped the barrier during 
this interval, the CS terminated and the US 
was not presented. Failure to jump the 
barrier during the CS-UCS interval led to a 
shock which was intense (9 ma,) but brief 
(5 sec.). The duration of the shock was 
so short compared to barrier-jumping laten- 
cies that the shock was inescapable. There- 
fore, this technique permits the study of 
avoidance acquisition unconfounded by es- 
cape contingencies. 

The intertrial interval used during ayoid- 
ance training averaged 80 sec. (range 40 to 
120 sec.). Occasionally S jumped the 
barrier during the intertrial interyal (ITI); 
if S had high-frequency intertrial responding 
(10/min), the gate between the compartments 
was closed and a 15-min, ITI administered, 
If S failed to cross the barrier either during 
the CS-UCS interval or during ITI for 10 
consecutive trials he was taken from the one 
side of the shuttle box and placed in the 
other. 


REsutts ? 


The five groups differed in the ease 
with which they acquired the avoid- 
ance response. One ŞS in each of the 
24- and 48-hr. experimental groups 
failed to make a single avoidance re- 
sponse and half of the Ss in both the 
24- and 48-hr. groups failed to reach 
the avoidance-learning criterion in 5 
days of training. All Ss in the 72- 
and 144-hr. groups reached the cri- 
terion within the 5 avoidance-training 
days. Figure 1 presents the mean 
number of avoidances per day for each 
group for the 3 days of avoidance train- 
ing that all Ss received. The control 
group is treated as if it were temporally 
the furthest removed from inescapable 
shock treatment. An analysis of vari- 
ance revealed that Ss showed learning, 
Days, F (2, 50) = 31.1, p < .001, but 
that the groups were differentially effi- 
cient, Groups X Days, F (8, 50) = 
2.31, p < .05. A test of the main ef- 
fects of groups was significant on Day 
3, F (4, 75) = 3.67, p< 01. Pair- 
wise comparisons for that day revealed 


2 All F and t tests were two-tailed. 
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Fic. 1, Mean number of avoidance re- 
sponses for each group on each of the 3 
days during which all Ss received avoidance 
training. (Each point represents a total of 
126 trials, 21 for each of six Ss.) 


that the 24- and 48-hr. groups differed 
from the 72-hr., 144-hr., and control 
groups, «= .05 critical difference be- 
tween daily means of 7.83. 

Table 1 gives the mean values of 
several acquisition indexes for each 
group. The differences in the acquisi- 
tion of avoidance responding as a func- 
tion of the time interval between ines- 
capable shock and avoidance training 
are reflected in these indexes of ac- 
quisition. The 24-hr. group tended to 
be poorer than the control group on 
every measure as a result of the ines- 
capable shock, but the effect of the in- 
escapable-shock treatment dissipated 
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with time. For example, both the 24- 
and 48-hr. groups required more trials 
to criterion than the control group, 
Dunnett’s t (25) = 3.21, p < .02, and 
2.66, p < .05, respectively, while the 
72- and 144-hr. groups did not differ 
from the control group, Dunnett’s t 
(25) = .09 and .50, both p’s> 50. 
Analysis of these relationships among 
the groups revealed either a significant 
upward or downward trend across 
groups, T test for trend, T < 1, p< 
.05, (Whitney, 1961) for every index 
in Table 1. 

Examination of Fig. 1 and Table 1 
suggests that avoidance responding by 
the 24- and 48-hr. groups was not 
interfered with equally, but the two 
groups did not differ significantly on 
any of these measures, though both 
were not always equally different from 
the control group. 


Discussion 


Exposure to inescapable shock inter- 
feres greatly with subsequent acquisition 
of instrumental avoidance responding in 
a task which has no escape contingencies. 
These results are contrary to Seligman 
and Maier’s (1967) theory of the me- 
chanism of interference which assumes 
that Ss learn not to respond during in- 
escapable shock and then evidence this 
learning during shock in a new context. 
Here interference was evidenced during 


TABLE 1 
INDEXES OF ACQUISITION 


Groups 
Index (Mean Values) Trend 
24 48 72 144 | Control 

fg Syst ioe FS a Se SEES eee 
Trials to First Avoidance 23.33.) 29.17 | 12.83 | 11.50 9.00 Downward 
Trials to Criterion 81.50 | 75.33 | 46.83 | 51.13 | 45.83 |Downwar 
Probability of Avoidance on First 

or Second Trial Following First f 

Avoidance* 20 40 50 50 50. | Upware 
Total Avoidances in 3 Days 16.17 | 19.83 | 31.00 | 32.50 | 32.50 Upward 
Longest Run of Avoidance in 3 Days 7.50 | 11.84 | 19.50 | 19.35 | 25.83 Upwar 


Calculations included only those Ss which actually made a first avoidance. 
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the presence of only a signal for shock. 
Of course, shocks could occur in the 
shuttle box, but shock was never concur- 
rently present at the time of the avoid- 
ance response and therefore could not di- 
rectly mediate the interference as Selig- 
man and Maier suggested. They are 
probably correct, however, in assuming 
that “helplessness” involves learning about 
the independence between responding and 
the presentation and/or termination of 
shock. Rescorla (1967) has recently 
demonstrated that Ss can learn about the 
degree to which two exteroceptive stim- 
ulus events are correlated. So it seems 
reasonable to assume that they can learn 
about the degree of correlation between a 
behavioral event and an exteroceptive 
one. 

Earlier investigations of interference 
due to exposure to inescapable shock have 
observed that, in contrast with control 
Ss, the occurrence of an escape response 
by experimental Ss often did not predict 
further escapes. The present experiment 
found that the occurrence of an avoidance 
response did not predict occurrence of 
subsequent avoidances for the 24- and 48- 
hr. Ss within a reasonable number of 
trials—i.e., that the avoidance did not 
“stick.” A possible implication of this 
finding is that fear reduction (at CS 
termination) or pain reduction (at US 
termination) are not as effective rein- 
forcing events for preexposed Ss as for 
Ss never previously exposed to pain and 
fear eliciting stimuli. 

This reduced effectiveness of the termi- 
nation of aversive stimuli as reinforcers 
(if that is what it is) is, however, only 
temporary. This experiment found that 
the effects of exposure to inescapable 
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shock upon subsequent avoidance re- 
sponding dissipate rapidly in time, leaving 
an apparently normal S after 72 hr. 
though some indexes of acquisition sug- 
gest that recuperation is not fully com- 
plete even after 144 hr. Overmier and 
Seligman observed that the interference 
effect is time-dependent but they reported 
that Ss were apparently normal after only 
48 hr. 

The present experiment suggests that 
the interference with avoidance behavior 
is more persistent than the interference 
with escape behavior. F urther, the inter- 
ference with avoidance behavior is not 
dependent upon the concurrent presence 
of shock. 
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SUPPLEMENTARY REPORTS 


EFFECT OF EAR STIMULATED ON REACTION TIME 
AND MOVEMENT TIME 


J. RICHARD SIMON 1 


University of Iowa 


In a unimanual choice RT task, Ss moved a control handle to the 
right or left from the midline of the body in response to monaural 
verbal commands of “right” or “left” which were presented to the 
right or left ear. Both RT and MT were significantly faster when 
the content of the command corresponded to the ear stimulated than 
when it did not; ie., the time required to initiate and execute the 
movement was affected by a cue irrelevant to the task itself, the ear 
in which the command was heard. RT was faster for abductive 
movements toward the side of the responding member whereas MT 
(for the right hand) was faster when movements were directed 


toward the opposite arm. 


In a recent experiment, Simon and Rudell 
(1967) noted a potent population stereotype 
which affected reaction time (RT) to verbal 
directional commands. ‘Their Ss pressed a 
right- or left-hand key in response to the 
commands “right” or “left” which were 
presented to the right or the left ear. RT 
was significantly faster when the content 
of the command corresponded to the ear 
stimulated (ie. right in the right ear or 
left. in the left ear) than when it did not 
(i.e, right in the left ear or left in the 
right ear). In other words, information 
processing was affected by a cue irrelevant 
to the task itself, the ear in which the 
command was heard. 

The present study was designed to de- 
termine whether the above-mentioned stereo- 
type is specific to a bimanual task or whether 
it is also present in a unimanual task involv- 
ing the movement of a lever to the right or 
left. If the former were true, it would 
suggest that the stereotype derives from a 
simple isomorphic association of right-ear 
stimulation with a right-hand response and 
left-ear stimulation with a left-hand re- 
sponse. If the latter were true, it would 
suggest that the stereotype is a manifesta- 
tion of a more basic orienting reflex; i.e, a 
reaction toward the source of stimulation 
(Razran, 1961). This study was also con- 
cerned with determining whether the stereo- 
type in question affected movement time 


(MT) as well as RT. 


17The author acknowledges the assistance of Philip 
Tolin in collecting the data. 


Method—The Ss’ task was to move a con- 
trol handle to the right or left from a center 
position in response to a recorded series of 
132 commands used previously (Simon & 
Rudell, 1967). Each command consisted of 
the words right or left and was presented 
through earphones to the right or the left 
ear in a predetermined random sequence. 
A klockounter which recorded RT started 
when a command was presented and stopped 
when S moved the handle away from the 
center position, Another klockounter which 
recorded MT started when the handle had 
been moved from the center position and 
stopped when the 10-in. lateral movement 
had been completed. The Ss were instructed 
to “react and move as fast as you can on 
each trial.” The Ss were 32 male and 32 
female undergraduates. All were right- 
handed. Half of the males and half of the 
females performed with their right hand 
while the other half of each group performed 
with their left hand. 

Results—Median RTs and MTs were 
computed for each S for each of the four 
experimental conditions; ie, right and left 
commands in the right and in the left ear. 

The left half of Table 1 summarizes the 
mean RT under each treatment condition for 
the right- and left-hand groups. Analysis 
of variance revealed no differences as a func- 
tion of the main effects of ear stimulated, 
command, or responding member. There 
was, however, a significant difference be- 
tween the sex groups, F (1, 60) = 7.69, P < 
01, with males responding significantly faster 
than females (377 vs. 419 msec.)- 
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TABLE 1 


REACTION TIME AND MOVEMENT TIME (MSEC.) TO VERBAL COMMANDS AS A FUNCTION 
OF EAR STIMULATED AND RESPONDING MEMBER 


Reaction Time Movement Time 

Responding Member | sein aT eg Command Command 
Right Left M Right Left M 
Right- (Preferred) Right ear | 365 415 390 182 169 175 
Hand Group Left ear 401 376 389 183 166 175 
M 383 395 389 183 168 175 
Left- (Nonpreferred) | Right ear | 392 424 408 166 168 167 
Hand Group Left ear 426 384 405 169 166 168 
M 409 404 407 167 167 167 
Right- and Left-Hand | Right ear | 378 420 399 174 168 171 
Groups Combined Left ear 414 380 397 176 166 171 
396 400 398 175 167 171 


The analysis also revealed a significant 
Command X Responding Member interaction, 
F (1, 60) =7.11, p < .01. Table 1 shows 
that the right-hand group reacted faster to 
the right command than to the left com- 
mand (383 vs, 395 msec.) whereas the left- 
hand group reacted faster to the left com- 
mand than to the right command (404 vs. 
409 msec.). This interaction is probably 
due to the nature of the reaction required to 
respond to the command. Both the right re- 
action with the right hand and the left reac- 
tion with the left hand involved an abduc- 
tive movement (from the midline of the 
body) toward the same side as the respond- 
ing member. In contrast, the left reaction 
with the right hand and the right reaction 
with the left hand involved an abductive 
movement toward the opposite arm. 

The major finding was a significant Com- 
mand X Ear Stimulated interaction, F (1, 
60) = 179.69, p < .001. This effect is pic- 
tured in the left half of Fig. 1. It will be 
noted that RT to the right command was 
faster when it was heard in the right ear 
than when it was heard in the left ear (378 
vs. 414 msec.) and, similarly, RT to the left 
command was faster when it was heard in 
the left ear than when it was heard in the 
right ear (380 vs. 420 msec.). Analysis of 
responses of individual Ss revealed that the 
difference between right and left ear re- 
sponses was in the predicted direction in 117 
out of 128 separate comparisons. 

The right half of Table 1 summarizes 
the mean MT under each treatment condition 
for the right- and left-hand groups. Analy- 
sis of variance revealed that movements to 
the left were significantly faster than move- 
ments to the right (167 vs. 175 msec.), F (1, 


60) = 30.99, p < .001. Inspection of Table 1 
reveals, however, that this difference ap- 
peared only in the right-hand group, Com- 
mand X Responding Member interaction, F 
(1, 60) =28.18, p < .001. 

The analysis also indicated that males 
moved significantly faster than females (158 
vs. 184 msec.), F (1, 60) =6.75, p < .05. 
The other main effects, ear stimulated and 
responding member, were not significant. 

Of perhaps greatest interest was the sig- 
nificant Command X Ear Stimulated inter- 
action, F (1, 60)=10.49, p< .01. The 
right half of Fig. 1 shows that movements to 
the right were faster when the right com- 
mand was heard in the right ear than when 
it was heard in the left ear. Similarly, 
movements to the left were faster when the 
left command was heard in the left ear than 
when it was heard in the right ear. Though 
the differences between the means were 
small, the direction of the differences was in 
the hypothesized direction in 75 out of 128 
comparisons with seven ties. This relation- 
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Fic. 1. Effect of ear stimulated on RT and MT. 
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ship was observed in both right- and left- 
hand groups but was more marked for males 
than for females—Command X Ear Stimu- 
lated X Sex interaction, F (1, 60) =6.62, 
p < 05. 

Discussion.—Results of the present study 
indicated that the speed of processing direc- 
tional commands (i.e., the words right and 
left) was affected by an irrelevant cue; that 
is, the ear in which the command was heard. 
RT to right in the right ear was faster than 
to right in the left ear, and, similarly, RT 
to left in the left ear was faster than to 
left in the right ear. These results, ob- 
tained on a unimanual task, paralleled those 
noted by Simon and Rudell (1967) on a 
bimanual task. 

Another notable result was that MT was 
also affected by the ear in which the com- 
mand was heard, movements being faster 
when the content of the command cor- 
responded to the ear stimulated than when 
it did not. Whereas the literature contains 
numerous examples of display characteristics 
affecting RT, there are apparently few, if 
any, examples of display characteristics af- 
fecting the rate of a ballistic movement once 
it has been initiated (McCormick, 1964). 

The response interference observed in the 
present studies to date is, apparently, unre- 
lated to sensory motor connections or hemi- 
spheric dominance since there is no inter- 
action between ear stimulated and handed- 
ness (Simon & Rudell, 1967), and there is 
also no interaction between ear stimulated 
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and responding member on either uni- 
manual or bimanual tasks. Initially, it ap- 
peared that the interference was due to a 
strong natural tendency to associate right- 
ear stimulation with a right-hand response 
and left-ear stimulation with a left-hand re- 
sponse. Indeed, in a bimanual key-pressing 
task with monaural pure tones as stimuli, Ss 
responded faster on blocks of trials involving 
uncrossed reactions (responding with hand 
ipsilateral to ear stimulated) than they did 
on trials involving crossed reactions (re- 
sponding with hand contralateral to ear stim- 
ulated) (Simon, 1967). Obviously, however, 
the interference observed in the present study 
(a unimanual task) cannot be explained on 
the basis of a simple isomorphic association 
between ear stimulated and ipsilateral hand. 
It may, instead, be a manifestation of the 
more basic orienting reflex (OR) which, 
in its fully developed form, can involve a 
specific molar reaction of turning toward 
the source of stimulation (Razran, 1961). 
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PALMAR SWEATING: | 
A QUICK AND SIMPLE MEASURE? 


JAMES M. DABBS, Jr. 
University of Michigan 


JEAN E; JOHNSON 
'niversity of Wisconsin 


anp HOWARD LEVENTHAL 
University of Wisconsin 


Fingerprints can reveal individual active sweat glands. 


Fingerprinting 


requires minimum equipment and does not restrict S’s freedom of 


movement. 


Changes in number of active sweat glands are associated 


with changes in subjective feelings of arousal, 


A variety of physiological measures are 
used to study man’s reactions to work, stress, 
and emotionally arousing situations. Gal- 
vanic skin resistance (GSR) is often used 
to indicate anxiety or arousal, but its use 
raises several problems. The equipment is 
complex and restricts an S's freedom of 
movement unless telemetering devices are 
used, and the volume of data can overwhelm 
an unwary investigator. 

Palmar sweating is closely related to GSR 
and should provide similar information 
(Montague & Coles, 1966). Thomson and 
Sutarman (1953) developed a simple and di- 
rect measure of sweating which, however, 
has attracted little attention in the United 
States. Their technique provides a count 
of the sweat glands active in a given area 
of skin at a specific point of time. 

The count is taken from a fingerprint made 
with a moisture-repellant solution? The 
solution is daubed on the fingertip, where 
it dries in 10-15 sec. The print is then 
lifted off with transparent tape and placed 
on a microscope slide. Under 10-15X mag- 
nification active sweat glands appear as holes 
along the ridges of the fingerprint, and the 
number of holes in a 4-mm. square area de- 
fines the Palmar Sweat Index (PSI). The 
technique of obtaining and scoring finger- 
prints is described in detail by Johnson and 
Dabbs (1967). Figure 1 shows fingerprints 


1 This research was supported by United States 
Public Health Service Grants NU-00183, NU-00228, 
and CH-00265 and by a grant from the United Health 
Foon The Bee a conducted while the 
authors were at Yale University. 

2 The solution consists of 5 gm. of polyvinyl formal 
(Formvar 15/95E; Monsanto Company, Springfield, 
Mass.), 15 ml, of butyl phthalate, 20 gm. of a semi- 
colloidal dispersion of graphite in trichloethylene (dag 
155; Acheson Colloids Company, Port Huron, SF 
and 100 ml, of ethylene dichloride. Polyvinyl formal 
withdraws from moisture in the orifices of active 
eas glands, leaving holes oe aaa ao he po 

utyl phthalate gives strength to print; 
provides visual contrasts: and ethylene dichloride is a 
solvent. i 


from one S taken about 2 min, apart, A 
change in sweating (number of open pores) 
is clearly visible. 

Fingerprinting can be repeated at intervals 
of less than 30 sec, making it possible to 
follow changes in sweating within an indi- 
vidual. Large individual differences in 
amount of sweating and in variation in 
sweating make comparisons between inde- 
pendent groups of Ss difficult, although the 
ø transformation suggested by Lykken, Rose, 
Luther, and Maley (1966) may control for 
these individual differences, Two or three 
measures in an experimental condition will 
produce relatively stable mean scores for a 
given S. 

This paper summarizes data obtained in 
three different situations. The results indi- 
cate the feasibility of using the PSI as a 
measure of arousal. 

Method and results—Measures were ob- 
tained from nine volunteers (four male and 
five female) flying in groups of three in a 
light aircraft. Flying was expected to arouse 
strong affect and cause increases in sweat- 
ing. Each flight lasted 1 hr. and followed 
a set procedure with no abrupt or extreme 
maneuvers. None of the S's were pilots and 
none had flown before with the pilot. A 
female S in each group acted as E, taking 
fingerprints on herself and the other Ss on 
signal from the pilot. Twenty-two prints 
were taken on each S: 3 in the aircraft be- 
fore takeoff, 11 in flight, 3 after landing, 
2 after leaving the aircraft, and 3 on a 
later date away from the airport, 

Mean PSI scores are shown in Fig. 2. 
A Treatment X Ss analysis of variance in- 
dicated highly significant differences across 
the points in time, F (21, 157) =6.18, p< 
001. Sweating in flight was significantly 
greater than sweating prior to takeoff, 
Feontrant (1, 8) =7.16, p <.05. 


Fic. 1. 


¢ 


After the flight Ss reported on their feel- 
ings during the flight and during the day 
as a whole. Reports were made with a mood 
check list of 67 adjectives (Helmreich & 
Radloff, 1968) indicating anger, happiness, 
fear, depression, arousal, and lethargy. The 
Ss were assigned scores to represent dif- 
ferences between their moods during the 
flight and during the day as a whole. 

These mood data are summarized in Table 
1. The Ss increased in happiness, fear, and 
arousal and decreased in anger, depression, 
and lethargy during the flight. Increases in 
sweating from before takeoff to during the 
flight were correlated with increases in hap- 
piness and arousal and with decreases in 
anger and depression. 

The question immediately arises as to 
whether sweating and moods are associated 
under other circumstances. Forty adult pa- 
tients (12 male and 28 female) undergoing 
major elective general surgery each com- 
pleted a mood check list on the evening be- 
fore surgery and again on the fifth or sixth 
postoperative evening. Two PSI measures 
were taken each time, and-mood and sweat 
changes from before to after surgery were 
computed.. PSI scores were also trans- 
formed to ø scores (Lykken et al., 1966), and 
mean PSI g changes from before and after 
surgery were computed. 


eC ail, taken with the PSI technique, 
he left figure shows closed pores an 
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showing active sweat glands as open pores. 
gt 


d the right figure shows open pores.) 

The patients decreased in fear and arousal 
and increased in happiness and lethargy 
from before to after surgery (Table 1). 
Sweating was associated with feelings of 
arousal, as in the flying study, but with none 
of the other moods. The change in sweat- 
ing from before to after surgery was slight 
and nonsignificant. 

Pupil size has been reported to increase 
with mental activity (Kahneman & Beatty, 
1967), and it seemed likely that palmar 
sweating, similarly activated by the sym- 
pathetic nervous system, would also increase 
with mental activity. Eight female Ss were 
given the task of identifying words they 
heard spelled backwards. Three and five 
letter words provided two levels of task 
difficulty. Experimental sessions consisted 
of an initial rest period, four task periods, 
and a final rest period. Each period lasted 
3 min., during which time three PSI meas- 
ures were taken. Half of the Ss were run 
through each of two task sequences, 3-5-3-5 
letters and 5-3-5-3 letters. h 

Mean PSI scores for the six periods were 
62, 75, 68, 63, 56, 53. This pattern shows 
an increase in sweating with the first task, 
followed by a steady drop to the end of 
the session. The three- and five-letter, tasks, 
which are subjectively quite different 1n di id 
ficulty, did not cause differences in sweating. 
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Fic, 2, Changes in mean number of active sweat glands while flying. (Measures were taken in the 
aircraft, after leaving the aircraft, and on a later date, Flight time was about 1 hr.) 


Discussion—Palmar sweating appears to 
increase with feelings of arousal. Other 
studies report decreases in sweating prior 
to surgery (Harrison, MacKinnon, & Monk- 
Jones, 1962), during self-imposed physical 
pain (Harrison, 1964), and while working 
multiplication problems (Johnson & Dabbs, 
1967). These situations all tend to direct 
an individual’s attention toward his own 
thoughts and feelings and away from the 


activity around him. Perhaps sweating is 
associated with readiness to interact with 
the environment, as well as with subjective 
feelings of arousal. 

The principal virtue of the PSI technique 
lies in its simplicity. Materials for taking 
prints can be carried in one’s pockets, The 
measure could be useful in a variety of 
situations where more sophisticated equip- 
ment is not available or not’ convenient, 


TABLE 1 


Moon CHANGES, AND CORRELATIONS BETWEZN MOOD AND SWEAT CHANGES, AMONG 
Ning FLYING Ss AND 40 SURGICAL PATIENTS 


Flying Study 


Surgery Study 


ae Mean Change | petyeen'Mood | Mean Change | petwaen Mood | between Med 
from Day-as. | Change and PSI | {rom before to | Change and PSI | Change and POT 
Whole to Flight Change after Surgery Change $ Change 

Anger —3.6* =.70* A —.16 —.30 
Happiness 8.4 1 prs Eee ‘10 
Deret aot) Too =20 -i15 

epression -3: À SOR 
Arousal 3.4 a ae p228 fees 
Lethargy ~26* : : 


—Uni t directly comparable between the two studies because different forms of the 
mood ches aoe er Te comnts the correlations in the final column, raw PSI scores were transformed 


to scores (Lykken et al., 1966) to correct for individual differences in range. 
* 
b< .05 


P< .05. 
b <.01, 
PD < 001. 
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READAPTATION AND DECAY AFTER EXPOSURE TO 
OPTICAL TILT 


x SHELDON M. EBENHOLTZ 
University of Wisconsin 


In Exp. I 16 Ss were exposed initially to a 15° optical tilt (T) for a 
ł-hr. period. During the 2nd 4-hr. interval 8 Ss (Group O) viewed 


the normal environment without prisms and with both eyes open. 


The 


remaining 8 Ss (Group C) were guided over the same path as those of 


Group O, but with both eyes covered by blindfolds. 


Exp. II differed 


in that T was 30° and the transfer period for Ss of Group O required 


monocular observation with T at 0°. 


In both experiments Group O 


readapted to the normal environment at about the same rate as during 


initial adaptation. 


Group C exhibited decay of adaptation but at a rate 


significantly slower than that of readaptation. 


The assumption that adaptation and per- 
ceptual development share many common 
processes (Held & Bossom, 1961) carries 
the implication that the “normal” preadapta- 
tion state is itself a state of adaptation 
(Ebenholtz, 1967). In the context of adapta- 
tion to optical titlt, the normal state there- 
fore may be regarded as one in which S is 
exposed to a tilt of 0°. It follows that ex- 
posure to the normal environment, after a 
period of adaptation, should yield a growth 
curve similar to that obtained during the 
initial adaptation period, but in the opposite 
direction. Thus S may be expected to adapt 
and then to readapt to the normal environ- 
ment, both at the same rate. There is, how- 
ever, the possibility that transfer to a tilt 
of 0° produces not readaptation but rather 
a decay or simply forgetting of the previous 
adaptation. In the present study the rates 
of readaptation and decay were compared. 

Method—In Exp. I two groups of eight 
female Ss each were exposed monocularly 

1 This research was supported by ee MH-13006-02 


from the National Institute of ntal Health, United 
States Public Health Service. 


to an optical tilt of 15° for a period of 30 
min. Throughout this interval Ss walked 
back and forth along a corridor. Only the 
right eye was used and the clockwise (CW) 
or counterclockwise (CCW) direction of tilt 
was counterbalanced over Ss within each 
group. The level of adaptation (LA) was 
determined by having S set a 12-in, luminous 
line to the apparent vertical in an otherwise 
light-free room. The Ss viewed the line 
monocularly at a distance of 5 ft. without 
prisms and with the head stabilized in a chin 
and forehead rest. Four measures of the 
upright were taken, with the rod starting 
position counterbalanced over CW and CCW 
positions, immediately prior to the adapta- 
tion period and at 15-min. periods thereafter 
for a total of 1 hr. The LA was defined in 
terms of the difference between the means 
of the pre- and postadaptation settings. 
After the initial 4-hr. period there was 
an additional ł-hr. interval in which Ss of 
Group O continued to move through the cor- 
ridor but with prisms removed and both eyes 
open. The Ss of Group C were guided over 
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the same path as that of Group O with their 
eyes covered by blindfolds. 

In Exp. II Ss were exposed to a 30° 
prism tilt and two rather than four measures 
of the upright were taken at each test inter- 
val. In addition, during the transfer period, 
Ss of Group O viewed the scene monocularly 
through prisms with an optical tilt of 0°, 
There were two males and six females in 
each group. All other procedures and con- 
ditions were identical with those of Exp. I, 
the major purpose of Exp, II being to 
maintain the conditions of monocular ob- 
servation both during initial adaptation and 
subsequent readaptation. 

Results—The LA during acquisition and 
transfer periods of Exp. I for both groups 
is represented in Fig. 1. After 15 min. of 
exposure to an optical tilt of 15° Groups O 
and C yielded LAs of 3.4° and 3.8°, re- 
spectively, # (14)=.97, p>.05. The cor- 
responding values after 30 min. were 4.6° 
and 4.7°, t(14) =.15, p>.05. After 15 min, 
of transfer the LA for Group C dropped 
to 2.6° whereas Group O yielded an LA of 
1.7°. The difference was not significant, 
(14) =129, p>.05. As Fig. 1 indicates, 
the differences between the two transfer 
conditions increased with time so that at 
the final test the LAs were .6° and 2.2° for 
Groups O and C, respectively, #(14) =2.25, 
p <.05. Although both Groups O and C 
showed a significant decrement in LA at the 


50, 


A 
o 
m 
o 


e 
Ti 
2 
58 
mc 
tas SPAN 
wW 
rete: 


[o] 15 30 5 30 
ACQUISITION TRANSFER 
EXPOSURE TIME (MINUTES) 


Fic. 1. Level of adaptation as a function of 
aeons time during acquisition and transfer phases 
xp. I). 


end of the transfer period, #(7) =6.22 and 
4.06, p < .01, respectively, the final LA for 
Group O only, of .6°, was not significantly 
different from zero, 1(7) = 1.02, p>.05. 

The analysis of Exp. II data revealed a 
pattern of significant (and nonsignificant) 
differences identical with that reported above, 
with the one exception that Group C yielded 
a significantly higher LA than Group O 
after 15 min. of transfer (p<.01). Thus 
after 30 min. of adaptation the LAs for 
Groups O and C were 6.6° and 6.8°, re- 
spectively, (14) =.183, p>.05. The com- 
parable values after the 30-min, transfer 
period were 22° and 2.1°, #(14) =2.81, 
p <.05. As in the case of Exp. I only the 
former value failed to differ significantly 
from zero, t(7) =.601, p> .05. 

Discussion—The results indicate that for- 
getting and readaptation proceed at different 
rates with the latter significantly more rapid 
than the former. This, taken together with 
the fact that readaptation was completed in 
the same amount of time as that required for 
initial adaptation suggests that (a) exposure 
to the normal environment is not unique re- 
lative to the effects of exposure to other 
optical tilts, and (b) continued visual sup- 
port is necessary for sustained adaptation, 

The present results are not consistent with 
those of Hamilton and Bossom (1964). 
These authors failed to find a significant dif- 
ference between the effects of decay and re- 
adaptation after exposure to a laterally dis- 
placed visual field. Direct comparisons of 
the effects of exposure to the two types of 
optical transformations are needed in order 
to determine whether or not the disparate 
results reflect fundamental differences or 
merely procedural ones, 
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EFFECTS OF ANXIETY AND INTERFERENCE ON 
SHORT-TERM MEMORY 


JOHN G. BORKOWSKI 
University of Notre Dame 


Anp THOMAS MANN 
Oberlin College 


Short-term recall for high- (HA) and low-anxious (LA) Ss was 
compared using a “single-item” technique designed to produce inter- 
item competition between 6 consecutive CCCCs and to evoke situa- 
tional anxiety. Results indicated that in the absence of specific, prior 
interference (PI) there was no difference as a function of anxiety 
level. As PI developed, the LA group recalled significantly more 
items than the HA group. However, the superiority of the LA group 
was not related to the absolute amount of PI. Rehearsal activity 
(ie. digit recitation) increased and STM decreased as PI developed. 
It was concluded that individual differences, such as anxiety, influence 


the course of short-term memory, 


Only a few studies have investigated the 
effects of motivation on short-term memory 
(STM). Kernoff, Weiner, and Morrison 
(1966) and Weiner (1966) found that when 
stimuli were associated with a monetary re- 
ward or shock, they were recalled signif- 
icantly more often at retention intervals of 
15 and 17 sec, than “neutral” stimuli. High- 
arousal words have been shown to be re- 
called more accurately than low-arousal 
words after long retention intervals (ie., 24 
hr.), but resulted in poorer recall after short- 
term intervals (Kleinsmith & Kaplan, 1963; 
Walker & Tarte, 1963). 

The purpose of the present study was to 
investigate the effects of motivation, as de- 
fined by manifest anxiety, on STM. Spe- 
cifically, this study attempted to determine 
the manner in which anxiety interacted with 
the development of proactive interference 
(PI) to influence the course of STM. 
Spence (1960) has used overt anxiety, as 
indexed by responses on the Taylor Mani- 
fest Anxiety Scale, to manipulate drive level 
in conditioning and verbal learning contexts. 
In general, high-anxious (HA) groups have 
demonstrated more rapid eyelid conditioning 
than low-anxious (LA) groups and also 
acquired “noncompetitive,” paired-associate 
lists more efficiently than LA Ss. When 
competition characterized list construction 
(i.e. strong interpair associations), HA Ss 
required more trials to reach criterion than 
LA Ss because of increased frustration asso- 


ciated with interfering response tendencies. 

The present investigation employed a 
“single-item” technique similar to that used 
by Keppel and Underwood (1962). The 
procedure as well as list construction were 
designed to generate potential interference 
and situational anxiety. In such a context, 
the recall of HA and LA groups was con- 
trasted in the presence and absence of PI. 

Method—A total of 330 introductory psy- 
chology students at Oberlin College were 
given the Taylor Manifest Anxiety Scale 
(Taylor, 1953). The mean score for the 
entire sample was 17.4 with a standard devia- 
tion of 7.5. 

The HA group (N=25) had a mean 
anxiety score of 29.6 and the LA group 
(N=25), 7.1. The groups were matched 
in terms of sex and achievement as measured 
by SAT scores. The mean verbal and 
quantitative score for the HA group was 
654, and for the LA group 660; the differ- 
ence between these means was not significant. 
The matching procedure was used because 
previous data have indicated that level of 
intelligence is related to potential susceptibil- 
ity to interference effects in STM (Borkow- 
ski, 1965). It should be noted that all Ss 
seemed unaware of the basis for their se- 
lection when contacted for the experiment 
proper some 2 mo. after responding to the 
anxiety scale. 3 

A short period of counting activity (15 
sec.) was included in the instructions. The 
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main task consisted of the presentation of 
six CCCCs on a Stowe memory drum. The 
presentation duration for each CCCC was 
3 sec, the retention intervals were 15 sec., 
and the recall periods 6 sec. During the re- 
tention intervals, rehearsal was controlled 
by instructing Ss to read digits as rapidly as 
possible to the beat of a metronome (3.6 
beats/sec). This rate was slightly faster 
than Ss’ recitation abilities and hence was 
assumed to generate situational anxiety. In 
order to produce interitem interference, the 
specific materials (GPBH, XCHD, DLFM, 
MQWG, LWZS, BSJC) were constructed 
according to the following rules: (a) one 
consonant from each item occurred in a dif- 
ferent position in the immediately succeeding 
item, and (b) one consonant was distributed 
in a new position for one of the nonadjacent 
items. Order of presentation was varied 
for each group to equate item difficulty 
across the six presentation positions. 
Results and discussion—The proportion 
of letters correctly recalled (responses were 
scored as correct even if given in an order 
different from the order of presentation) as 
a function of anxiety level and amount of 
prior interference is presented in Fig. 1. 
An inspection of these data shows that as 
PI increased, STM declined. Furthermore, 
in the absence of specific PI (ie, zero 
previous items) the short-term recall for 
the HA group was not significantly different 
from that for the LA group. However as 
PI developed, high anxiety resulted in a 
lower level of recall. The average superior- 
ity in recall over Items 2-6 for the LA Ss 
was 14%; since the differences in retention 
between HA and LA groups remained rela- 
tively constant as PI developed, the data 
were collapsed over Items 2-6. The subse- 
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quent analysis showed that recall for LA 
Ss was superior to that for HA, ¢(48) = 
3.03, p<.01. There were no significant dif- 
ferences in the number of misplaced re- 
sponses (ie, an incorrect letter which had 
been presented in a previous item) at each 
level of PI for HA and LA groups. These 
results suggest that, in the Presence of spe- 
cific interference, the STM of HA individ- 
uals was inferior to that of LA Ss when 
the two groups were matched in terms of 
intellectual achievement, However, in the 
absence of PI, there was no significant dif- 
ference as a function of anxiety level 
although recall for the HA group was 
slightly superior to that for the LA group. 
Based on Spence’s (1960) results with com- 
petitive and noncompetitive, paired-associate 
tasks, high anxiety might be expected to fa- 
cilitate STM when interitem competition is 
minimal, when Ss are thoroughly adapted 
to the experimental task. with extensive 
pretraining, or on the initial item of any list 
where “specific”? PI is minimal or absent. 
With respect to the last alternative, the high 
level of recall at zero PI (88%) might well 
have masked the potential superiority of the 
HA group. 

Analyses of counting activity showed no 
differences between HA and LA groups at 
each of the presentation positions. How- 
ever, at Position 1, the mean number of 
digits recited was 36 compared to 41 digits 
during the last retention interval. There 
was significantly more recitation during the 
last retention interval than during the first, 
#(49) =7.37, p<.01, 49 of 50 Ss reciting 
more digits during the last interval. These 
results are interesting in light of Murdock’s 
(1966) recent data which showed that as 
PI developed STM declined, but recall of 
the interpolated material improved. It ap- 
pears that at least a portion of the decline 
in STM associated with the development of 
PI in the single-item technique is attrib- 
utable to increased attention to the rehearsal 
task, 

Evidence is accumulating showing that in- 
dividual differences, such as anxiety and 
intelligence (cf. Borkowski, 1965), often 
interact with task-specific interference to 
influence the course of STM. 
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TACHISTOSCOPIC RECOGNITION THRESHOLDS AS A 
FUNCTION OF AROUSAL LEVEL?* 


GARY W. R. PATTON? 
Tujts University 


This experiment attempted to test the hypothesis that recognition 
thresholds are related to autonomic activity in a curvilinear manner. 
According to this hypothesis, lowest thresholds are associated with 
moderate sympathetic nervous system excitation, while higher thres- 
holds are linked with either low or high extremes of sympathetic ac- 
tivity. To test this proposition, tachistoscopic recognition thresholds 
for 4-digit numbers were determined for 24 adult male Ss during 
counterbalanced minimal, moderate, and high states of sympathetic 


excitation. 


Recognition thresholds were found to be significantly 


higher during the high levels of autonomic activity than during the 


minimal or moderate autonomic arousal conditions, 


When allowance 


was made for the fact that no truly low arousal state was achieved 
in this experiment, the data were explicable in terms of the predicted 


curvilinear relationship. 


Hebb (1955) proposed that various forms 
of human behavior are optimal during mod- 
erate levels of cortical activation, but that 
performance is degraded by either high or 
low extremes of activation. Since sympathe- 
tic nervous system arousal has been found 
to be correlated with cortical activation 
(Berlyne, 1960), the predicted curvilinear 
relationship between performance and corti- 
cal activation should likewise hold for per- 
formance and sympathetic arousal. 

As noted by Obrist (1962), the curvilinear 
hypothesis relating arousal to behavior has 
been partially or completely validated for 
motor performance and verbal learning. 
However, no previous experiment employ- 

1 This study is based on a portion of a dissertatio: 
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ing human Ss has been specifically designed 
to determine whether the relationship be- 
tween sensory thresholds and arousal is 
curvilinear. 

Thus, the goal of the experiment reported 
here was to test the applicability of Hebb’s 
curvilinear hypothesis to sensory threshold 
data by (a) inducing minimal-, moderate-, 
and high-arousal levels in adult male Ss, 
(b) validating the arousal level shifts with 
autonomic recordings, and (c) measuring 
concurrent changes in recognition thresholds. 

Method.—Three separate 1-hr. experi- 
mental sessions designed to produce high, 
moderate, and minimal arousal, respectively, 
were run on 3 consecutive days. The order 
and sequence in which the three conditions 
were presented was counterbalanced among 
Ss so that presentation schedules would not 
affect the results, Twenty-four paid male 
undergraduate students from Tufts Univer- 
sity were employed as Ss in this study. _ 

During each of the sessions, Ss identi- 
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fied 15 four-digit numbers which were pre- 
sented according to the ascending method of 
limits on a Gerbrands tachistoscope. Five 
of these numbers were not scored: the first 
2, which were used to eliminate warm-up 
effects, and the third, fifth, and eleventh 
numbers which, as will be shown, were used 
to produce varying arousal levels. Of the 
10 scored numbers, 5 were made readily 
identifiable by presenting them on clear plas- 
tic slides. The remaining 5 scored numbers 
were made more difficult by placing sheets 
of translucent tissue over the numbers so 
that they would be slightly blurred when 
viewed by Ss. 

High autonomic arousal was produced by 
telling S that he could win a $5.00 reward if 
he could do 10% better than his previous 
best performance, but that he would receive 
an electric shock on the second and third 
fingers of the left hand if he took too long 
to identify a number. Degree of stress was 
controlled among Ss by purposely making 
the third, fifth, and eleventh numbers more 
difficult than the rest and by applying a brief 
mild (4.50 ma.) electric shock from a Phy- 
siological Electronics Model 3901 inductor- 
ium to all Ss when these numbers were pre- 
sented. Because of the distracting effect 
of the pain, shock was never administered 
during presentation of the 10 remaining 
scored numbers. Thus, data were collected 
only when Ss were working under threat 
of electric shock, not under shock itself. 

Moderate arousal was generated simply 
by telling S that he could win a $.10 bonus 
by bettering his previous performance. Dur- 
ing the minimal-arousal session, to get S 
to relax and, thereby, facilitate production of 
the lowest possible levels of autonomic 
arousal, S was deceived into thinking that 
he was helping E repair defective equipment 
and, consequently, his responses to tachisto- 
scopically presented numbers would not be 
recorded. To add realism to the setting, 
the third, fifth, and eleventh numbers were 
given abnormally long, 10-sec. exposures to 
convince S§ that the apparatus really was 
malfunctioning. 

Three variables which reflect autonomic 
activity, heart rate, respiration rate, and 
palmar skin conductance, were recorded con- 
tinuously during all experimental sessions, 
and a 5-min. sample of pretest resting ac- 
tivity was taken prior to each session. Heart 
rate and respiration rate were measured on 
a Grass Model 6  electroencephalograph, 
while palmar skin conductance was recorded 
by a Hunter GSR amplifier, employing a 


palm-dorsum placement of zinc electrodes 
on S’s hand. : 

Results—To determine whether (a) the 
high-arousal session really did produce high- 
est levels of autonomic function, and (b) 
the minimal-arousal condition produced least 
evidence of sympathetic nervous system ac- 
tivity, two separate statistical analyses were 
undertaken. First, percentage changes in 
respiration rate, heart rate, and palmar skin 
conductance from the pretest rest period 
to subsequent test sessions were compared by 
analysis of variance (Lindquist, 1956). The 
upper portion of Table 1 contains the per- 
centage change in each physiological re- 
cording during each condition and indicates 
which differences were significant. Both 
the respiration rate data and the palmar 
skin conductance data fully supported pre- 
experimental assumptions regarding the rela- 
tive arousing effect of each experimental 
condition, while the heart rate data partially 
confirmed these assumptions. 

The second method of analyzing physio- 
logical reactions to the three experimental 
conditions involved comparisons of absolute 
values of heart rate, respiration rate, and 
palmar skin conductance recorded during 
each session, again by analysis of variance 
(Lindquist, 1956). The middle portion of 
Table 1 lists the absolute values of each 
physiological recording during each arousal 
condition and indicates which differences 
were significant. The respiration rate data 
completely confirmed expectations about the 
level of autonomic activity produced by each 
experimental condition, However, though 
the differences between the heart rate and 
palmar skin conductance data generally were 
in the expected directions, the differences 
between the sessions were not large enough 
to attain significance, 

The lower portion of Table 1 displays the 
mean recognition threshold times for the 
easy and difficult four-digit numbers during 
the minimal-, moderate-, and high-arousal 
sessions. An analysis of variance (Lind- 
quist, 1956) indicated that changes in arousal 
level produced significant shifts in recogni- 
tion thresholds, F (2, 36) = 4.32, p<.025. 
As can be seen in the lower portion of 
Table 1, increases in autonomic arousal were 
associated with corresponding increases in 
recognition thresholds. Also shown in this 
section of Table 1 are the significance levels 
produced when the differences between pairs 
of arousal conditions were compared by 
t tests. The recognition thresholds during 
the high-arousal condition were significantly 
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TABLE 1 


Autonomic Activity LEVELS AND RECOGNITION THRESHOLD VALUES DURING HIGH, 
MODERATE, AND MINIMAL ARousAL SESSIONS 


Arousal session 
Dependent Variable Minimal Moderate High 
M SD M SD M SD 
A ic Activit 
Pe cee aa Naat es —0,25 8.81 +6.34 15.59 +12.00 |% 19.68 
Heart Rate b.e 1.26 4,78 0.17 5.88 +4.50 8.57 
Palmar Skin Conductance™:be 11.85 40.56 +25.57 94.02 +116.84 191.90. 
of A Activit: 
bg terete A pax 16.91 2.33 17.87 2.76 18.84 2.43 
Heart Rate 79.38 11.80 83,33 10.31 83.33 11.25 
Palmer Skin Conductance 45.06 25.89 58.56 65,73 66.78 53.44 
Re ition Thresholds. 
Rasp Nube 43.30 6.30 44.00 8.60 45.60 10,50 
Difficult Numbers 71.50 11.10 72.50 12.20 76.30 11.60 


of difference between minimal and moderate conditions < .05, 


s 
4 of difference between minimal and hi 
ap 


conditions < .05, 


of difference between moderate and high conditions < .05, 


higher than during either the minimal- or 
moderate-arousal conditions ; however, thres- 
hold times recorded during the latter two 
conditions did not differ significantly. 

Discussion—The absolute values of the 
physiological data shown in the middle por- 
tion of Table 1 provide a clue as to why a 
linear relation, rather than the predicted 
curvilinear one, obtained between recognition 
thresholds and arousal level. Specifically, 
the data show that even during the minimal- 
arousal session, S was experiencing an aver- 
age or slightly above average level of sym- 
pathetic activity, For example, a medical 
examination can be expected to produce an 
approximately normal level of autonomic 
activity, although even this relatively innocu- 
ous situation is a mild stressor for some 
people (Adams, 1958). During an exami- 
nation, a physician expects to find a heart 
rate of 60-80 beats per minute in a sedentary 
adult male and a respiration rate of 14-18 
breaths per minute (Adams, 1958). The 
heart rate of 79.38 beats per minute recorded 
during the minimal arousal session and the 
respiration rate of 16.91 breaths per minute 
indicate that Ss were not in the low range 
of autonomic activity. Thus, a truly mini- 
mum level of sympathetic arousal, such as 
might occur during drowsiness when the 
heart slows down to between 50 and 70 beats 
per minute, was not achieved in this experi- 
ment. 

If a truly minimal level of sympathetic 


activity were not achieved, then the three 
levels produced were probably (a) normal, 
(b) above average, and (c) high levels of 
arousal. In other words, only the normal- 
to-high arousal range was covered in this 
experiment. In accordance with the hypo- 
thesis that recognition thresholds are related 
to arousal level in a curvilinear fashion, 
lowest thresholds should have coincided with 
normal arousal levels and increases in arou- 
sal should have produced progressive in- 
creases in recognition thresholds. The re- 
corded thresholds followed this pattern ex- 
actly. 

The finding that high states of sympathetic 
excitation significantly increase recognition 
thresholds provides partial confirmation of 
the more general hypothesis that human be- 
havior is optimal at moderate levels of sym- 
pathetic nervous system activity and inferior 
at either high or low extremes of excitation. 
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DELAY-RETENTION EFFECT AND INFORMATIVE FEEDBACK ? 


PERSIS T. STURGES, EDWARD P. SARAFINO,? anp PATRICIA L. DONALDSON 
Chico State College 


The delay-retention effect (DRE) was investigated when informative 


feedback (IF) was a light over the correct alternative. 


12 3rd-grade 


children were Ss in each condition of IF delay, Immediate (0 sec.) 


and Delay (10 sec.). 


Learning material consisted of 12 items, each 


with a state as the stimulus word and 2 cities from the state as re- 
sponse alternatives. Delayed IF resulted in a significant reduction in 
errors from acquisition to relearning; immediate IF did not. The 
results were interpreted as being inconsistent with the explanation of 
the DRE suggested by Markowitz and Renner (1966). 


Markowitz and Renner (1966) investi- 
gated the delay-retention effect (DRE) by 
separating tangible reinforcement (marbles 
which were exchanged for a toy) from feed- 
back (a combination of a light over the cor- 
rect alternative and a click if the response 
was correct or a buzz if it was incorrect). 
They found the DRE when feedback con- 
sisted of the simultaneous presentation of 
tangible reinforcement and feedback as used 
by Brackbill and her associates (e.g., Brack- 
bill, Bravos, & Starr, 1962) and also when 
Ss received immediate tangible reinforcement 
and delayed feedback. They hypothesized 
that watching the light flash over the correct 
alternative helps S remember the correct re- 
sponse and suggested that when tangible re- 
inforcement and feedback are immediate, Ss 
are less likely to watch the feedback light. 

Alternative explanations for these findings 
may be advanced. For example, it is pos- 
sible that the superior retention of the group 
receiving both immediate tangible reinforce- 
ment and delayed feedback resulted from 
these Ss receiving information about the 
correct answer twice. That is, with two 
alternatives, both feedback events provide 
information about the correct response. 
Such an alternative explanation should be 
empirically verifiable. Also, it appears pos- 
sible to test Markowitz and Renner’s hypo- 
thesis that the DRE is due to the fact that 
Ss with immediate reinforcement do not 
watch the feedback light. Specifically, this 

1 This research i i 
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hypothesis would predict that the elimination 
of tangible reinforcement should result in 
no significant difference between immediate 
and delayed feedback. The present study 
was designed to investigate both this alterna- 
tive explanation and Markowitz and Renner’s 
hypothesis by using only informative feed- 
back (IF)—i.e., the light over the correct 
alternative. 

Method—Twenty-four third-grade child- 
ren were randomly assigned, with 12 Ss 
in each group, to the two conditions of IF 
delay—Immediate (0 sec.) and Delay (0. 
sec.). The learning material consisted of 
12 word triplets, each typed on a 5X8 in. 
card. The name of a state was the stimulus 
word and two cities from that state were 
below it as response alternatives, with one 
of the cities the capitol. The S’s task was 
to learn the capitol for each state. For 
each successive presentation of the set of 
items, a different random order was used 
and the position of the alternatives was ran- 
domly alternated. 

The apparatus was an upright piece of 
plywood, 15 X 27 in., in the center of which 
was an opening for the stimulus cards ; 
2 in. below the opening were two levers, 
one below each response alternative on the 
cards; and 2 in. above the opening were 
two lights, one above each response alterna- 
tive. Depression of either lever activated 
a preset electronic timer and the light flashed 
on above the correct alternative after the 
appropriate delay interval (0 or 10 sec.). 
A second timer controlled the intertrial in- 
terval. 

The procedure was identical to that used 
by Brackbill with the following exceptions : 
acquisition was completed in one session, 
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relearning was conducted after 7 days, and, 
except for the first presentation set of items 
in which E first read the state and the cities 
aloud and then S read them aloud, S merely 
pressed the lever beneath his choice. 
Results——Number of trials and errors to 
criterion at acquisition and relearning were 
computed. Since analyses of trials and 
errors were essentially the same, only the 
error data are reported. An analysis of 
variance indicated significantly fewer errors 
at relearning than at acquisition, F (1, 22) 
=6.95, p<.025, and also a significant 
Acquisition Relearning X Immediate Delay 
interaction, F (1, 22) =6.09, p<.025 (see 
Fig. 1). Pair-wise comparisons with the 
Scheffé test were computed on these data. 
The difference between immediate and de- 
layed IF is not significant either for errors 
at acquisition or for errors at relearning. 
However, the reduction in errors from 


Immesiote O- ~~ -O 
Delays 2—v 


Mean number of errors 
o- eo eae oave 


‘Acquisition 


Relearning 


Fic. 1. Mean errors at acquisition and relearning 
for groups with immediate (0 sec, del: 
“sec.) I during acauaitien. ae ae 


acquisition to relearning (a savings score) 
is significant for delayed IF, p<.05, but 
not for immediate IF. In the four mean 
Scheffé comparison, the savings from acqui- 
sition to relearning with delayed IF is 
significantly greater than that with imme- 
diate IF, p < .025. 

Discussion—In the present study delayed 
IF during acquisition resulted in superior 
savings at relearning when the only feed- 
back was information about the correct 
alternative. These findings support Marko- 
witz and Renner’s conclusion that the DRE 
is primarily a function of feedback rather 
than tangible reinforcement. Thus, the su- 
perior retention by the group receiving 
immediate tangible reinforcement and de- 
layed feedback was not due solely to the 
double information provided about the cor- 
rect alternative. The current results do not, 
however, confirm Markowitz and Renner’s 
hypothesis that the DRE is due to attention 
to the feedback light by Ss receiving de- 
layed IF. In the present study both groups 
had to watch the light in order to learn 
the correct alternative, and yet the DRE 
still occurred. An adequate explanation 
seems to require further investigation of 
factors affecting the DRE. 
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Optical Society of America 
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Donald A. Riley, University of California at Berkeley. Designed for the advanced under- 
graduate and the beginning graduate student, this text presents a critical analysis of a 
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Paperbound. 171 pp. f, m 
“Excellent; one of the best ‘small’ books I have seen. The most articulate brief exposition 
of Discrimination Learning in print; useful for both freshman and graduate students. 
Peter Bedrosian, State University of New York at Buffalo 


also from Allyn and Bacon—Foundations of Psychological Research: 
Statistics, Methodology, and Measurement 
Kenneth H. Kurtz, State University of New York at Buffalo. Applies basic concepts of 
Statistics, ania, of science, and measurement theory to Bai PE research. THO 
text provides background for an understanding of contemporary theory and research. 


402 pp. 

Fundamentals of Experimental Design kne 
Jerome L, iversity of Massachusetts. The text combines an emphasis on under- 
standing eee aot pp and analysis with detailed development of computational 
formulas and an extensive presentation of numerical examples. 407 pp. 


Social Facilitation and Imitative Behavior ica aA 
Edward C. Si , Ronald A. Hoppe, and G. Alexander Milton, all of Miami University. 
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THE ASSOCIABILITY OF CVC PAIRS! 


WILLIAM E. MONTAGUE anp HAROLD O. KIESS? 
University of Illinois, Urbana 


In prior experiments S-generated associative devices or natural language 
mediators (NLMs) linking pairs of items have been shown to facilitate 
acquisition of paired associates. Since Ss are questioned about NLMs 
after learning, such reports may be a result of the questioning. There- 
fore, to obtain an a priori estimate of NLM probability this research was 
undertaken. Several hundred pairs, each composed of CVCs of about 
equal association value (AV), were shown for 15 sec. while Ss wrote down 
any NLM they could generate which linked both the stimulus and re- 
sponse. The AV level was varied between pairs. The proportion of Ss 
able to generate a NLM is the associability value (AS). As expected, 
AS and AV are correlated although AS varies considerably among pairs 
composed of items about equal in AV. Experiments run after the AS 
scale was obtained demonstrated that AS is valuable as a predictor of 
learning rate. In addition, AS values were highly correlated with the 
frequency of NLMs in postexperiment reports. It was concluded that 
the AS measure represents a valuable addition to our understanding of 
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the complexity of verbal learning. 


$. ASSOCIABILITY Norms 


Meaningfulness of verbal material 
has been recognized as an important 
variable in paired-associate (PA) 


1 This research was supported primarily by 
the United States Office of Education via Con- 
tract No. OEC 3-6-058375-0612, in part by 
both the Joint Services Electronics Program 
under Contracts No. DA 28-943, AMC0073 
(E), and Office of Naval Research Nonr 
3985(08), and in part by a grant from the 
University Research Board. Successful ac- 
complishment of the research would have 
been impossible without the assistance of 
Alexander Wearing, Rosemary Wearing, Clin- 
ton Walker, and Geri Kelly. 

> Current address: Psychology Branch 
Pioneering Research Division, United States 
Army Natick Laboratories, Natick, Massa- 
chusetts 01760. 


verbal learning and retention since 
Ebbinghaus’ pioneering work in 1885. 
In fact, meaningfulness has been 
found to be the most powerful variable 


‘influencing speed of acquisition in 


1 


verbal learning and has been the sub- 
ject of a considerable amount of re- 
search in recent years. Simply, the 
effect of meaningfulness on learning 
rate can be seen in PA learning experi- 
ments where a pair like RIQ-KIV takes 
considerably longer to learn than a 
pair like TEL-com. Since TEL and 
com are items to which Ss can give a 
relatively high number of associations, 
they are considered more meaningful. 
Meaningfulness, or association value 
(AV), defined either in terms of the 
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number of associations given in a 
limited time period or the proportion 
of Ss giving an association to each 
verbal unit, has proved effective in 
influencing the rate of learning in 
many experiments (Goss & Nodine, 
1965; Underwood & Schulz, 1960). 
Although there is no doubt that 
meaningfulness is a major variable, a 
number of results indicate that it can- 
not account for all the important phe- 
nomena. For example, in a number 
of experiments involved in the recent 
controversy over “‘one-trial” or “all- 
or-none” learning, the rate of learning 
for pairs composed of items of equi- 
valent meaningfulness differed con- 
siderably (Underwood & Keppel, 
1962). Such differences between 
pairs equated for meaningfulness may 
merely reflect the unreliability of the 
meaningfulness measure. However, 
it seems more likely that it is based on 
the unique relationship between pair 
items. The associability or encode- 
ability of a pair is probably not based 
in any simple fashion upon the mean- 
ingfulness of items comprising the 
pair. 
í In several recent experiments, dur- 
ing postexperiment questioning, Ss 
reported using various means such as 
sentences or words to associate or link 
items together. The ubiquity of such 
reports by the authors’ Ss and by 
other experimenters’ Ss (e.g., Bugel- 
ski, 1962; Clark, Lansford, & Dallen- 
bach, 1960; Runquist & Farley, 1964; 
Underwood, 1964, 1965; Underwood 
& Schulz, 1960) points to the need for 
systematic investigation of this media- 
tion. From the reports it can be seen 
that Ss use various means of learning 
the pairs (Martin, Boersma, & Cox, 
1965) and, in addition, certain pairings 
are easier because Ss find them easier 
to associate, A single word may be 
used to link two items, or a phrase or 
sentence may be generated which in- 


cludes the items. At other times Ss 
may learn a pair by recognizing that 
the items sound alike when pro- 
nounced or they may encode (trans- 
form) the items into words. These 
techniques are referred to as natural 
language mediators (NLMs) and they 
have been found to be important in 
learning and recall (Adams & Monta- 
gue, 1967; Kiess & Montague, 1965; 
Montague, Adams, & Kiess, 1966; 
Montague & Wearing, 1967a, 1967b). 
This research has shown that S-gen- 
erated associations between items, 
not the meaningfulness of individual 
items per se, produce superior reten- 
tion in comparison to instances where 
such associations are not present. 
Unfortunately, knowledge about 
NLMs is generally by means of an in- 
terview or questionnaire given on com- 
pletion of the experiment, asking Ss 
how they went about learning each 
pair. Subjective reports of this kind 
are suspect. It is possible that Ss 
construct answers to “please” the ex- 
perimenter in accord with the de- 
mand characteristics of the experi- 
ment (Orne, 1962) so that the NLMs 
reported might not be accurate de- 
scriptions of what is learned. | It is 
possible that some or all NLMs are 
not causal in learning, but may just be 
a correlate of the learning process 
(Adams, 1967). To ascertain their 
status in the associative process it is 
necessary that the probability of a 
NLM be manipulated independently 
of other variables known to affect 
learning rate. Therefore, some inde- 
pendent measure of the probability or 
likelihood of NLMs is necessary. 
Such a measure could be systematic- 
ally manipulated to ascertain its rela- 
tionship with other important vari- 
ables and increase our understanding 
of verbal learning. 

Undoubtedly, items high in AV 
should be easier to link than those of 
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low AV. Richardson and Erlebacher 
(1958) hypothesized that items high in 
AV have more associations and these 
associations can be used somehow to 
facilitate linking such pairs compared 
to pairs of lower AV. Their Ss’ rat- 
ings of pairs generally agreed with 
the hypothesis. However, it is the 
authors’ contention that AV and ease 
of linking or associability, although 
correlated, are conceptually different 
and that this difference needs exami- 
nation. Pairs of items of a given level 
of AV are liable to differ considerably 
in associability, and thereby, in ease 
of learning. To support these con- 
tentions quantitative estimates of the 
ease of associating various types of 
item pairs were obtained. The 
authors first attempted to determine 
the frequency with which Ss form 
associative connections between item 
pairs of different AV levels. The 
proportion of Ss generating a NLM 
for each pair is the scale value of as- 
sociability (AS). After the AS scale 
was obtained several experiments 
were undertaken to examine the effect 
of variation in AS on learning rate and 
retention. In addition, a subsequent 
scaling was undertaken to ascertain 
the role of stimulus and response AV 
in AS value and to relate AS value to 
the measures obtained by Richardson 
and Erlebacher (1958). In their 
study, Richardson and Erlebacher 
took pairs of words, CVCs, and con- 
sonant syllables of different levels of 
AV and had Ss rate them for ease of 
learning (EL, an estimate of how fast 
S could learn the pair) and common 
meaning (CM, degree to which pair 
items denoted the same meaning). 
These scale values should be related to 
the AS scale, 

_ Three scalings of separate sets of 
items were done (designated AS-1, 
AS-2, AS-3) which differed only in 
the sets of items used. The scaling 


procedures described for AS-1 will 
apply also for AS-2 and AS-3. 


ASSOCIABILITY 1 
Method 


Materials —Three hundred twenty pairs of 
CVCs were selected from Archer's (1960) 
norms. Each pair was composed of items 
approximately equal in AV. Eighty pairs 
were constructed from the 160 lowest AV 
items on the norms. Similarly, 80 pairs 
were constructed from the 160 highest AV 
items, which are mostly three-letter words, 
Two other blocks of 160 items from above and 
below the middle of the norms were used to 
form two more sets of 80 pairs yielding 320 
pairs in all. Within each block of 160 items, 
40 pairs were formed from the 80 items with 
the lower AV and 40 from the higher AV items 
to ensure that the AV of both items in the pair 
was approximately equal. Within these con- 
straints the pairings were random. The AV 
ranges for the four sets of 80 pairs were: low 
= 2-13%; low-middle = 38-47%; high- 
middle = 66-67%; and high = 99-100%, 

Four series of 80 pairs each were con- 
structed. For each series an equal number 
of pairs (n = 20) were selected randomly from 
each of the four AV levels and dispersed at 
random throughout the series. These series 
were used to counterbalance order effects in 
presentation. In addition, 40 pairs were 
selected at random (10 from each level of AV) 
to be used to check the reliability of the rat- 
ings. Ten of these pairs were added to each 
series rather than presenting them in a block. 
However, no pair appeared twice within a 
series. The final series used were thus 90 
pairs long. 

Subjects —The Ss were 240 male and female 
undergraduate volunteers from the University 
of Illinois who were paid for their participa- 
tion. Equal numbers of males and females 
were run. 

Procedure.—F our different presentation or- 
ders of the four 90-pair series were given to 
subgroups of 30 Ss each by means of a 4 X 4 
Latin square. The rows of a square were sub- 
groups of Ss and the columns represented the 
order of presentation of the four series. Or- 
der of pairs within a series was the same for all 
Ss. The square was run twice, once for males 
and once for females. Each pair was pre- 
sented by a projector for 15sec. Between the 
first and second, and the third and fourth 
series there was a short rest period of about 2 
min., and at the end of the second series there 
was a 5-min. rest. 


TABLE 1 


TYPES AND NUMBERS OF PAIRS 
UsED 1N AS-3 


Type of Pair 


Equal number of pairs were 
taken from each AV level used 
in AS-1 to be used to test inter- 
group reliability, 

These pairs were scaled by 
Richardson and Erlebacher 

1958). 

These were pairs scaled in AS-1 
but with S and R terms re- 
versed here to examine bidi- 
rectionality. Twenty pairs were 
selected from each AV level. 

Pairs with high-stimulus AV 
(96-100) and low-AV(13-22) 
response. 

Pairs with low-stimulus AV(13-— 
22) and high-AV(96-100) re- 


sponse. 
Pairs with high-AV stimuli and 
responses, 
Pairs with low-AV stimuli and 
responses. 


^ Pairs were lost because of clerical errors discovered 
after scaling, 


The Ss were asked to write the first associ- 
ative device, if any, that a Pair suggested to 
them in an answer booklet with numbered 
spaces for 10 responses on each page. In- 
structions gave examples of associations for 
both high- and low-AV pairs. Emphasis was 
placed on forming an association to both of the 
items in the pair, and not just one of the mem- 
bers, The Ss were informed that they were 
not expected to have associations for every 
pair and if they had no association to leave 
the response space blank. Following the in- 
structions, a series of 10 practice slides com- 
posed of pairs of varying levels of AV was 
given. The Ss were permitted to ask ques- 
tions about the procedure after the practice 
series. In order to aid Ss in keeping their 
place in the booklet, another slide projector 
showed the ordinal number of each pair in the 
series. 
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In this scaling the primary intent 
was to obtain a wider Tange of pairs 
which would be suitable for future 
experimentation. Therefore, rather 
than pairing items randomly, the 
authors constructed pairs with mini- 
mal interitem similarity. 
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Method 


Materials—Three hundred twenty pairs 
of CVCs were constructed from Archer's 
(1960) norms. As in AS-1, within-pair AV 
was approximately equal. Seventy pairs 
were constructed from CVCs at each of the 
same four levels of AV used in AS-1. How- 
ever, the pairing of items was not random, 
Pairs were constructed with no consonants 
and wherever possible no vowels in common. 
In only two instances was it necessary to re- 
peat a vowel. Ten pairs at each level of AV 
were taken from AS-1, added to the list, and 
were used to determine between-groups re- 
liability for the AS values. In addition, as in 
AS-1, 10 items were chosen from each AV 
level to be given twice to obtain test-retest 
reliability. Therefore, four series of 90 pairs 
were constructed in the same fashion as those 
in AS-1, 

Subjects —The Ss were 120 male and 120 
female undergraduates who were paid for their 
participation. 


Procedure —The procedures used for coun- ` 


terbalancing the series and presenting the 
items were identical to those used in AS-1. 


ASSOCIABILITY 3 


Several purposes motivated a third 
scaling. First, it seemed necessary to 
compare the AS scale with the ease of 
learning (EL) and common mean- 
ing (CM) scales developed by 
Richardson and Erlebacher (1958). 
Second, pairs were included with S-R 
order reversed to determine whether 
AS would change appreciably. This 
was an attempt to see if the AS values 
are bidirectional. Third, sets of pairs 
of unequal item AV were included to 
explore the importance of stimulus 
and response AV in determining AS. 


Method 


Mateials—A total of 320 pairs of CVCs 
were selected from a variety of sources. The 
types and number of pairs for each type used 
are listed in Table 1. “i 

Four series of 80 pairs were constructed with 
an equal number of pairs from each type in- 
cluded in each. In addition, 10 pairs from 
each series were selected and used to measure 
test-retest reliability. Ten of these were 
added to each series making each series 90 
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pairs long. No pair appeared twice within a 
series. 

Procedure-—The procedure was identical 
with that for AS-1 and AS-2. 

Subjects —The 320 pairs were rated by 120 
male and 120 female undergraduates from the 
University of Illinois who did not participate 
in the other scalings. They were paid for 
their time. 


Results of AS-1 and AS-2 


For each of the pairs S’s response 
was categorized as indicating that he 
generated a NLM for the pair or not. 
If S reported any device linking the 
items, such as noting letter or sound 
similarities, constructing single mean- 
ingful words by manipulating all or 
just a few of the letters, or using the 
items in a meaningful phrase or sen- 
tence, it was scored asa NLM. Re- 
ports of no associations, construction 
of nonsense polysyllables, and omis- 
sions were placed in the other 
category. 

Before assessing the relationship 
between the scalings, and between AS 
and AV, it is necessary to examine the 
effect of two relevant variables: sex 
and sequence or practice effects. For 
each S the proportion of pairs on 
which associations were reported was 
calculated for each of the four series 
of 90 pairs. In analyzing data from 
both AS-1 and AS+2 these scores were 
entered into a Latin-square analysis 
of variance (ANOVA) with sex, sub- 
group, ordinal position of the 90-pair 
series, and the different series them- 
selves as factors (Winer, 1962, pp. 
554-563). A pooled within-Ss error 
term was used. In both scalings fe- 
males gave significantly more associ- 
ations than males, AS-1: means of 59 


and 54%, F (1, 232) = 80.14; and 


AS-2: means of 55 and 50%, F (1, 
232) = 66.18. In AS-1 and AS-2 
there was a significant decline in the 
Proportion of associations given over 
the four series of 90 pairs, AS-1: F (3, 


TABLE 2 


CORRELATIONS OF MEAN AV FOR THE PAIRS 
FROM AS-1 anD AS-2 wits MAS 
AND FAS VALUES 


AV level 
Correlation aes 
Low- | High- | 4; 
Low | Low-| High- | igh mal, 
AV-MAS 
AS-1 17] .01 .01 | .05 | .23 
AS-2 .06 | .09 | —.04 | .29 | .69 
AV-FAS 
AS-1 10} .10 104 | .04 | .28 
AS-2 —.02 | .06 .04 | .17 | .68 
MAS-FAS 
AS-1 -19 | .85 .86 | .88 | .97 
AS-2 .14 | .85 .88 | .89 | .98 


708) = 56.01; and AS-2: F (3, 708) 
= 2.64. The mean AS-1 scores as a 
function of the order of the series for 
males were .58, .54, .53, and .52 while 
those for females were .64, .60, .59, 
and .58. The AS-2 scores were very 
similar. 

Intercorrelations between AS values 
of each item for males and females 
were high for both AS-1 and AS-2 as is 
shown in the lower right cells of 
Table 2. As might be expected due 
to the limited range, the correlations 
between AS values for the sexes with 
levels of AV shown in Table 2 are 
somewhat lower. 

Correlations between mean AV (i.e., 
average of stimulus and response AV), 
male AS (MAS) and female AS (FAS) 
within levels of AV were low and 
insignificant for both AS-1 and AS-2, 
Table 2 shows the correlation coeffi- 
cients between AV, MAS, and FAS for 
pairs of different AV levels. Taken 
over four levels of AV the correlation 
was significant both for MAS and 
FAS but accounted for only about 
5-8% of the variance in AS-1, As 
can ‘be seen in the table, the coeffi- 
cients in AS-2 were considerably 
higher, accounting for nearly 50% of 
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AssoctABILity VALUE oF Parrs or CVCs OF APPROXIMATELY EQUAL ASSOCIATION VALUE 


TABLE 3 


FROM THREE SCALINGS ARRANGED IN TERMS OF INCREASING MEAN AS VALUE 


CVC Pair CVC Pair 
. AS No. 

ot Sea & | & |mas ras | 46 of Scal-| AS, | R | Mas} Fas | 3 
s R ee piri ed 
YUX |qeh 2 14| 13| 8 |14 |11 Noy | quh 3 Moe 12) 18" A a 20) 
FEP | yoq 3 14| 14| 10 | 13 |11 DUJ |xep 2 13| 13| 18 | 21 | 20 
GEX | 2tj 2 9 4] 8] 16 | 12 YEV | gag 2 5} 11] 20 | 19 | 20 
yıv | quw 2 7|/ 13| 9 | 16 |13 xiy | si 3 4 4] 17 | 23 | 20 
XAF | jth 2 8| 13] 8 | 18 | 13 XUY | x77 1 3 2| 20 | 19 | 20 
ZI¥ i 1 8| 6] 13 | 14 | 13 XUW | xeg 1 5 5] 19 | 20 | 20 
yviy | xth 1 3} 2) 10 | 17 | 13 cIQ | sov 2 15| 15| 18 | 22 | 20 
YAV | 2uq 2 9] 12) 13 | 14 | 13 Yox | mij 2 16 7| 17 | 23 |20 
XEQ | kuj 2 DS 6] 12 PAS oe KEJ |geb 2 16| 16| 13 | 27 | 20 
yıx | huj 2 9| 14) 7 | 22 | 14 VAJ | geq 3 12| 11] 20 | 20 |20 
QOJ | suv 2 4| 13] 12 | 17 | 14 xoc | vof 1 9| 10] 22 | 18 | 20 
XAJ | vuq 2 EHO LY} 18 | 15 FEJ | voj 1 11| 11] 18 | 23 | 20 
XUT | ney 2 8) 15] 11 | 18 | 15 xoj | qu 1 2 5| 13 | 27 | 20 
GUC | pev 2 47| 46| 6 | 23 | 15 TEJ | qiw 3 9| 10] 20 | 21 | 20 
xy | auy 3 2 3) 15 | 14 | 15 xIy |goh 2 4 8| 15 | 26 | 20 
ziy | tej 2 8| 9] 15} 15 | 15 || QEF | bof 2 8] 16] 19 | 22 | 20 
vip | fej 2 16] 11] 12 | 18 | 15 KUQ |zaj 2 11 8| 19 | 22 | 20 
xaL |fep | 2 | 12] 14] 13 | 18] 15 || ciw | feq 2 | 16] 11] 19 | 22 | 20 
ZEG | yig 2 6] 10} 10 | 21 | 15 
YOF | qav 2 10| 14] 10 | 21 | 15 XOD | sug 1 12] 12] 21 } 21 | 21 
XOG | yug 3 9 9| 13 | 18 | 15 QEB | dij 3 16| 16| 20 | 22 | 21 
XUJ | yev 1 SP Sil 13 [18 [115 YAV |xaq 1 9| 10] 20 | 22 | 21 
ZUJ | xah 1 5 S| 12 | 19 | 15 YEQ | xil 2 4| 13] 20 | 22 | 21 
vij |qug | 2 9) 14] 11 | 21] 16 || zm |yeb | 3 | 13| 13] 22 | 21 | 21 
YEV | «uj 3 ‘5, 3| 16 | 16 | 16 XUG | kuj 3 6 7| 20 | 23 | 21 
QEF | wug 1 8| 8] 16 | 16 | 16 wug | gef 3 8 8| 18 | 24 | 21 
ZIJ | xy 1 4) 4] 15 | 17 | 16 YIV. | wuj 1 7 8| 16 | 27 | 21 
cij | xoh 1 8| 7] 16 | 17 | 16 || woy | gah 2 13| 14] 18 | 25 | 21 
XEM | gov 2 11| 14] 17 | 16 | 16 Moy | yig 2 13| 11] 17 | 26 | 21 
Qux |riw | 2 9| 15] 11 | 22] 16 || GEQ | xot 2 11| 9] 15 | 28 | 21 
a0 guq A ž n 8 | 24 |16 : 

yey 11 | 22 | 16 FOJ 3 11| 11] 18 | 25 |2 
VUB | vef 1 12] 12} 11 | 22 | 16 XIG oe 3 11| 11] 18 | 25 | 22 
vue |n | 3 | 14] 15| 13 | 20] 17 || cue |aif 1 | 12] 12] 18 | 26 | 22 
XIB | faj 2 6| 10} 15 | 18] 17 || Gag aaj 1 11| 13] 18 | 26 | 22 
YEF | x1 1 11| 11} 13 | 30/17 || KEV | goc 2 46| 46] 18 | 27 | 22 
YUF | xo) 1 9| 9| 14 | 19 | 17 QAM | jah 3 44| 44| 18 | 27 | 22 
ZEH | gok 2 TAPS 15 f 19] 47 KUJ |xug 1 7| 6| 20 | 24 | 22 
zuy Gus 8| 9) 18] 17}17 || an | xey 1 6| 6] 24 | 21 | 23 
zux |poj | 2 | 14| 15] 17] 18] 18 || oxy | guj 1 6| 3] 17 | 28 | 23 
Jor | buv 3 19| 15] 19 | 16 | 18 ZUF | xec 2 8 9| 18 | 28 | 23 
Buv |zeg | 2 | 15) 15] 18) 18] 18 |] cuy [ziw | 2 | 15] 5] 18 | 28 | 23 
VUB ji 2 | 12| 13| 16 | 20} 18 || yw |poj | 3 | 14| 15| 23 | 22 | 23 
ZOF b 2 11 6] 15 | 21 | 18 PUV | yih 3 22| 13| 22 | 23 | 23 
QAZ | yay 1 | 10) 9) 18} 18/18 || xuz |xuf | 3 5| 3] 20 | 25 | 23 
xup jzuf | 1 | 8) 8) 23/13) 18 || xom |ouf | 3 | 10| 67] 19 | 26 | 23 
XEQ |xaj | 1 6| 5] 19] 17] 18 || ver |xoc | 2 | 11] 9| 18 | 28 | 23 
viw |xev | 2 | 14) 6/12 | 24/18 || mor [wej | 2 | 40] 42] 15 | 31 | 23 
YAF | cej 3 19| 17| 16 | 21 | 18 QUH | noj 1 12| 12| 20 | 26 | 23 
Koy |ciw | 2 | 12| 13/20] 17/18 || xr |xac | 3 | 17| 18| 24 | 22 | 23 
YEQ | xuo 1 4) 4/15 | 22 | 18 || way |nuq 3 18| 13| 22 | 24 | 23 
zog |yub | 2 | 10| 14) 17] 21] 19 |! cee | aaj 1 | 11] 12] 19 | 27 | 23 
QEV |jiy | 2 | 12| 14] 14] 23] 19 || coy [xah | 2 | 10| 15] 21 | 26 | 28 
Xuk |xaz | 1 S| 2] 21] 18} 19 || xoz |xuq | 1 5] 3] 18 | 28 | 23 
my, |xad | 2 | 13) 7/16} 23} 19 || Qos |xeb | 3 | 19] 17] 23 | 23 | 23 
XOL | gec 2 9| 16| 14 | 24 | 19 x1Q_ |xej 1 5 3| 23 | 23 | 23 


ee 
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TABLE 3 (continued) 


CVC Pair T CVC Pair 
AS No. AS No. 
of Seal, ‘AV | ay | Mas|Fas | 36 ot Seal &, | Æ | Mas| Fas | M, 
s R s R = 
voQ | tiw 3 15| 18] 19 | 28 | 23 HIB | zan 1 44| 43| 33 | 26 | 29 
QEV | sof 1 12] 11] 23 | 24] 23 |] Qom |xew | 1 8} 7} 28 | 31 | 29 
vic | luj 1 10| 10] 25 | 22 | 23 
sty | yoq 2 | 14| 14] 23 | 24] 23 || xar | gij 1 | 12| 13] 28 | 31 | 30 
GOX | nuv 2 12| 15] 26 | 33 | 30 
xoL | yuf | 3 9| 9] 23] 25 | 24 |] Koy | wir 1 | 12} 12] 33 | 27 | 30 
ar | cug 3 | 12| 12] 21 | 27 | 24 || goy |zoj 1 4| 3] 28 | 32 | 30 
QH | xiw 1 6 3| 23 | 25 | 24 WEX | gay 2 38| 41] 31 | 29 | 30 
xUP | giy 1 7| 8] 25 | 23 | 24 || cox | ber 2 | 40] 44] 28 | 33 | 30 
XIC | gex 1 10] 10] 24 | 23 | 24 || cex |qu | 1 9| 9] 33 | 28 | 30 
XUF | xuz 1 3 5| 23 | 25 | 24 DEJ | xem if 13] 11] 31 | 29 | 30 
Lay |vup | 2 | 14] 14] 22 | 26 | 24 |}! xos | gix 1 | 12] 11] 28 | 33 | 30 
yay | vef 2 7| 12| 20 |28 | 24 |! ruv | gof 3 | 16] 20} 31 | 30 | 30 
GEJ | gaz 2 9] 10| 18 | 29 | 24 || ow | tej 1 | 10| 9] 23 | 38 | 30 
que | yih 2 | 15| 13] 23 | 25 | 24 |! ray | xot 1 | 10] 9] .27 | 34 | 30 
XUL | hij 2 7| 10| 22 | 27 | 24 
vog |xud | 2 | 15] 6] 19 | 29} 24 |] zur | rej 1 | 38] 39] 33 |28 | 31 
zr | pej 3 | 20| 17] 22 |27 |24. || rev |hag | 1 | 46| 45] 31 | 31 | 31 
xen |xoy | 1 5| 4] 33 | 29 | 31 
XEZ | xab 1 7| 8| 25 | 24 | 25 xos | xog 1 6| 6| 24 | 38 | 31 
HAQ | ziv 2 | 45| 39] 19} 31 | 25 || ouy |gej 3 3} 6] 27 | 36 | 31 
WUQ | xir 2 | 13| 12] 18 | 32] 25 || rup [kex | 1 | 40| 39] 25 | 38 | 31 
QUF | xor 3 19| 19] 20 | 30 | 25 VA) | xis 2 12| 14] 30 | 33 | 31 
XET | gaj 1 11] 11] 20 | 30 |25 || coo |xum| 2 | 13] 13] 25 | 38 | 31 
GuQ | foj 1 11] 11] 20 | 30] 25 || srw | cof 2 15] 10} 23 | 40 | 31 
XER | qaj 2 | 11} 12] 20 | 31 |25 
NIJ | zej 1 7| 6| 23 | 28] 25 || toy |gac | 3 | 17] 17] 28} 36 | 32 
vor | yix 1 10| 9] 24 | 27] 25 |] coc | yos 1 | 39] 38| 28 | 36 | 32 
vur |xom | 1 10| 10| 24 | 27 | 25 || Pex | dut 2 | 40| 44] 28 | 36 | 32 
RAX | yoc 1 | 47| 44] 22 | 29] 25 || Kez | miv | 3 | 16] 17] 30 | 34 | 32 
XAW | dij 2 8| 16| 18 3 26 || rev |dog | 2 | 40| 44] 28 | 37 | 32 
XER | yi 1 12| 11] 22 | 30 | 26 
xM oe 2 | 11] 13] 22 | 30 | 26 || vem | lig 1 a 5 J F E 
YUJ | xo; 30 | 26 WUM | fi 
EREE E ee ae eee EE 
KUQ | fe 1 | 11} 11] 28 | 25] 27 || vag | tof 2 4 
Qc oe 2 | 14] 13] 25 | 28] 27 || ZAN | hid 3 | 43| 44| 32 | 34 | 33 
FAP | puw | 1 | 44| 43| 26 | 28 | 27 || xur | xik 3 8| 6] 33 | 33 | 33 
cuw | vok 2 | 38| 42] 25 | 29] 27 || QEP | tux 3 | 13| 16] 33 | 34 | 33 
KEX | lug 2 39| 42| 19 | 35 |27 GUJ | gox 1 11| 12| 42 | 25 | 33 
VOH | ruj 3 | 16] 18] 30 | 24] 27 || tQ |gep | 1 | 41] 41] 32 | 35 | 33 
crip |moy | 3 | 47| 45] 23 | 32 | 27 || ruB }wix | 2 | 40| 44| 33 | 34 | 33 
yos | pug 2 38} 42) 30 | 37 | 33 
XOV | xuh 1 3 4| 24 | 31 | 28 KuB | nop 2 70| 72) 29 | 38 | 33 
Yus | mi 2 | 38] 42] 26 | 30 | 28 
SEW ok 3 7| 8| 29 |27]|28 || xy |xad | 1 8| 7| 30 |38 |34 
LIY | zep 2 | 46| 46] 28 | 28 | 28 || Toz | tev 1 | 40] 40] 27 | 41 | 34 
ZEV |gax | 3 | 19| 19] 28] 29] 28 || wEQ |zan | 2 | 38| 43] 28 | 40 | 34 
XAG | boj | 3 | 19| 16] 25 | 32 | 28 || xum |ciw | 3 | 13| 15] 31 | 38 a 
XUL | xev 1 7| 6] 30 | 27 | 28 || moy | cid 1 | 45} 47] 33 | 35 af 
GEJ | vuj 1 9| 9] 23 | 34 | 28 || was | gok 1 | 39| 40] 30 | 38 | 3 
Yur | giv 2 9| 11] 25 | 32 | 28 
XUR Joj 2 9| 11] 23 | 34 |28 || vor |zad 1 | 38] 38| 31 | 38 z 
XOP | xok 1 7 7| 25 | 32 |28 HUJ |xan 3 14| 14| 28 | 42 3 
XID | xuc 1 9| 10| 27 | 31 | 29 TIQ | vay 2 41| 43| 35 38 3 
vip | paf | 1 | 40| 40] 24 | 33 | 29 || coj | 20g 1 | 11] 10] 31 4 z 
XIR | xef 1 5|. 3] 23 | 34 |29 || zaJ | xiz 1 8| 6| 29 | 41 2 
KH |quv | 2 | 15] 9] 25 | 33 | 29 || HEG | tud 2 | 45| 47) 31 | 40 
Zor |muj | 2 7| 10| 21 | 37 |29 || BEW | geb 1 | 42| 42] 35 a 2 
PAF [nuj | 2 | 40| 44] 20 | 38| 29 || yuk |qam| 2 | 40| 44] 29 
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TABLE 3 (continued) 
CVC Pair CVC Pair 
AS No. ASNo. 
af Seal: Re ‘eh = x | ——r ‘Sea Ay | & | Mas} ras | 46 
s R ne s R 
HEG | sug 1 45| 47] 35 | 37 | 36 CEV | hiy 2 38| 43| 41 | 42 | 42 
NIR | gep 2 38| 41| 35 | 37 | 36 VAD | jor 2 39| 44| 41 | 43 | 42 
Goz | muq 2 45| 38| 34 | 38 | 36 VUN |d 2 40| 41| 40 | 43 |42 
GOQ |xim 1 13| 11] 32 | 41 | 36 BIM | dol 2 68| 72| 38 | 46 | 42 
NEQ | Rib 1 42| 44| 32 | 41 | 36 MOG | gic 3 45| 45| 47 | 37 | 42 
TOZ | ruc 2 40| 47] 35 | 38 | 36 Loz | piv 2 40| 44] 38 | 47 | 42 
YOT | puh 2 38| 42| 33 | 39 | 36 REJ | hib 2 39 | 44| 42 | 43 | 42 
JAH | gam 1 44| 44| 30 | 43 | 37 MIP | piv 1 42| 44| 44 | 40 | 42 
XEK | xus 1 8| 6} 37 | 37 | 37 
soz | mid 2 38| 44| 38 | 36 | 37 LIG | vox 2 46| 46] 38 | 47 | 43 
WOG | suy 2 38| 41| 34 | 40 | 37 TAF | nod 1 71| 72| 33 | 52 | 43 
DOY | gac 2 39| 47| 32 | 43 | 37 VOK | goy 3 42 | 43| 44 | 41 | 43 
CIP ux 3 75| 75| 38 | 37 | 37 BEQ | hun 1 41| 41] 36 | 49 | 43 
WOG | cek 3 38| 40| 38 | 37 | 37 ZEN | vax 2 38| 41] 41 | 43 | 43 
xis | juf © 14| 18| 32 | 43 | 37 NUH | peq 2 38| 44| 43 | 43 | 43 
PUQ | fu 1 42| 43| 29 | 45 | 37 KUB | ril 3 70| 69| 43 | 43 | 43 
BAZ | fo 2 38| 43| 33 | 44 | 37 YUK | fos 1 67| 68| 37 | 49 | 43 
a las | 7 | 8) 818/818 
Nì wi: 1 GuB | doy 1 
mola | t | 4a aal oe | 37 |p || wor |se | 2 | 68| 73] 39 | 48 | 43 
CEK | wog 1 40| 38| 35 | 40 | 38 XOF | xaf 1 6 8| 38 | 50 | 44 
Kaz | guv 2 39| 42| 39 | 36 | 38 VAM |rul 2 68| 76| 43 | 45 |44 
was |sin | 2 | 39| 47| 37] 38| 38 || BIP |vom | 1 | 71| 70) 42 | 47 | 44 
vox | ze; 1 46| 46| 37 | 38 | 38 
Gen |bew | 3 | 42| 42| 36 | 20 | 38 || tr [ioe | 1 | 46] 45] 42 | 48 | 45 
XAM | geb 2 38| 42| 40 | 36 | 38 SUT | huc 3 70| 69| 43 | 47 |45 
CEK | raj 2 40| 44] 33 | 43 | 38 cry | ruk 2 40| 44| 46 | 44 | 45 
HUK | vot 2 67| 72| 36 | 40 | 38 TIY | bez 2 40| 44] 39 | 51 | 45 
xuc | xid 3 10| 9] 40 | 36 | 38 JUK | vad 3 40| 39| 42 | 48 | 45 
xus | xek 3 6 8| 37 | 39 | 38 XEC | xun 1 9 9| 44 | 46 | 45 
BEH q 1 44| 44| 32 | 44 |38 JAT | gow 1 41| 42] 40 | 50 | 45 
TEZ | kag 2 38| 41] 33 | 43 | 38 PAG | fey 2 74| 68| 40 | 51 | 45 
ZON | yan 1 43| 43| 38 | 39 | 38 GUD | hg 1 42| 41| 41 | 50 | 45 
BIM | fey 1 68| 68| 32 | 45 | 3g || ZIV. | up 1 39| 39| 39 | 52 | 45 
REZ |wiy | 2 | 38| 42] 35 | 42 | 38 || VIB | son 2 | 40| 43| 44 | 47 | 45 
zm |xog | 2 6| 9] 33 | 44 | 38 || L0Q | rx 2 | 38| 43] 47 | 44 | 45 
NUG | hof 2 66| 74| 43 | 48 | 46 
xix xut | 1) 6) 8] 40} 38/39 |! ror |yaw | 2 | 75| 76| 41 | 51 | 46 
miy |gel | 1 | 43| 43) 36 | 42 | 39 
cay | hek 1 74| 74| 41 | 51 | 46 
PP pog | 1) 42) 41) 34) 43) 39 || Bie |ced | 2 | 71| 73| 45 | 47 | 46 
Rig |jat | 2 | 38| 41| 36 | 42 | 30. || Be |< pi laa ias] 4p |50, | 46 
ser |mof | 3 | 41| 40] 39} 40] 30 || we Za | 3 | 16| 44] 43 | 49 | 46 
or (oe EE we 2 43 | 39 || vos kas | 1 | 38] 30| 40 | 53 | 46 
my i | t | 10] 40) 33] 45/50 | Ser Jess | 2 | a2) asl 43 | #9 | 4s 
s 38| 41) 38 | 41 | 39 || iuw |hes | 2 | 39| 41] 43 50 | 47 
RUC | guc 1 47| 47| 36 | 43 | 40 cS ad > 30 17 2 52 | 47 
WEC [fas 2 | 38| 42] 32 | 48) 40 || wig | zen 3 38| 38| 43 | 51 | 47 
JOW | jek 3 39| 39| 38 | 43 | 40 LUB | fes 1 69| 69| 46 | 48 | 47 
voc |zap | 2 | 66| 70] 41 | 40} 40 |] ux |yat | 2 | 46] 46] 43 | 51 | 47 
DQ jji | 1 | 41| 41] 38} 43] 40 || tz |g | 2 | 75| 75) 43 | 51 | 47 
KAG |pex | 1 | 41| 40) 34] 47 | 40 || nem | fim | 1 | 41| 42| 43 | 52 | 47 
Fuw |dag | 2 | 38| 38| 37] 44} 40 || xac |das | 1 | 72| 72] 39 | 55 | 47 
siB_|mah) 2 | 74| 76) 37| 44140 || ciz |cah | 3 | 39| 40| 48 | 47 | 47 
XUD fee 1 i 8| 38 | 43 | 41 
MOF | se 41 | 38 48 
xev jas | 1 | 45] 45133 | 49 | 41 || cw [nee | 7 | 40| So| ae | 51 | 48 
FIC [wey | 2 | 68) 72| 36] 47] 41 || zen | wig 1 | 38| 38| 44 | 51 | 48 
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TABLE 3 (continued) 
—— 
CVC Pair nS CVC Pair ie 
ARNG s R w M AS No, s R M 
alee of Scal- ay | av | Mas| Fas | AS Tere ey Av | Ay |MAs| Fas | #6 
HUP | vik 3 75| 74| 51 | 44 | 48 MAQ | jir 2 38| 41] 53 | 58 | 55 
NOF | wiv 2 38 | 43 | 43 | 53 | 48 DAS | nac 3 72| 721 59 |. 54 | 55 
DEG | sig 1 45| 47| 46 | 50 | 48 BIS | lav 3 72| 72) 58 | 53 | 55 
JUP | huw 2 39| 41| 38 | 58 |48 RIZ | wod 2 66| 70| 54 | 57 |55 
GOM | pov 1 47| 45| 48 | 48 | 48 CAK | neb 2 67| 71} 54 | 57 | 55 
VAX | suy 1 41| 41| 50 | 47 | 48 RAJ | wiv 1 44] 43| 49 | 62 | 55 
jag |huk | 1 | 68| 67] 48 | 50] 49 || Kor | nid 1 | 71| 72] 53 | 59 | 56 
GOY | vok 1 43 | 42| 43 | 54 | 49 TOB | hiz 2 66} 75| 50 | 63 | 56 
HOK | nuf 2 75| 69| 43 | 54 | 49 
VAD | juk 1 39| 40] 45 | 53 |49 MUY | lix 1 45| 46| 53 | 60 | 57 
PUH | wej 1 42 | 42| 45 | 53 | 49 SAR | hev 2 68| 70} 61 | 53 | 57 
CEN | yow 2 75| 73| 48 | 51 | 49 TIY | lug 1 40| 42| 53 | 61 | 57 
TUW | nem 2 38| 41| 44 | 54 | 49 WEH | gez 1 41| 42| 51 | 63 | 57 
HUC | sul 1 69| 70} 49 | 49 | 49 KOM | neg 1 69| 69| 50 | 63 | 57 
RAL | voc 1 67| 66] 50 | 63 | 57 
WIK | hes 2 67| 71| 46 | 54 | 50 DAP | lof 2 67| 70| 52 | 62 | 57 
DAP | wik 1 67| 67| 44 | 56 | 50 FOZ | fex 1 41| 42] 52 | 62 | 57 
HOF | fop 3 74| 75| 53 | 48 | 50 LIQ | daf 1 71| 74| 49 | 64 | 57 
WAP | pag 1 74| 74) 57 | 58 | 57 
TUJ | yop 1 10 9 | 48 | 53 | 51 WEX | wum| 1 38 | 38] 49 | 65 | 57 
VAG | dob 1 75| 75| 42 | 60 | 51 SEG | rud 2 68| 73| 53 | 62 | 58 
HEZ | gay 1 41| 41| 49 | 53 | 51 JOH | pem 1 45| 45] 63 | 52 | 58 
HOX | few 2 74| 76| 52 | 50 | 51 SEQ | deh 1 44| 44| 57 | 58 | 58 
LUQ | tiy 3 42| 40] 47 | 56 | 51 CAH | gis 2 40| 44] 55 | 61 | 58 
HUX | cip 1 75| 75| 49 | 53 |51 VER | lum 2 67| 73} 53 | 63 | 58 
NAK | hox 1 74| 74| 54 | 48 | 51 FIC | fum 3 68| 67| 58 | 58 | 58 
BIS | wuf 2 72| 67) 48 | 55 | 51 PIR | hok 1 74| 75| 51 | 65 | 58 
TEP | wix 1 45| 44/45] 8] 51 siy | gow 2 38| 42| 57 | 59 | 58 
pur | gid 1 | 44] 42] 42] 61 | 51 || sw |Ror | 1 | 74| 75| 52 | 65 | 58 
melee] i |e) asa 
Fos | mec 2 68| 67| 47 | 57 | 52 NEB | nav 
vis | kog 2 75| 76| 47 | 57 | 52 YUH | huz 1 45| 46] 57 | 61 | 59 
voy | wut 2 68| 69] 44 | 59 | 52 || Tav | hod 1 70) 69} 53 | 64 | 59 
HIQ | toh 1 47| 46] 54 | 49 | 52 HUS | pey 2 70| 67| 53 | 65 | 59 
CIP | nes 2 75| 72| 48 | 56 |52 || YAT | giz 1 46| 47| 52 | 67 | 59 
BEK | sot 2 66| 69| 46 | 58 | 52 
RAL | tid 2 67| 72| 47 | 58 | 52 TIZ | des 1 75] 75| 53 | 66 | 60 
Jos | nur 1 68| 67| 45 | 59 | 52 || FEZ | cil 3 n n 3 H eo 
VI, xo 52 LIB | vas 
x fee | act oe] at a A ee cide an] a Sage 
DIR |nac | 2 | 68} 72| 50] 55] 53 || QUT |yor | 1 
LAN | tif 2 75| 76| 49 | 56 |53 || vox | hev 1 68| 70| 53 | 67 | 60 
For | hof 1 75| 74| 45 | 60 | 53 GOH We 3 41| 40] 51 | 70 | 60 
cis | piy 1 39| 39| 51 | 54 | 53 VAM p 1 68| 70| 57 | 64 | 60 
JEK | jow 1 39 | 39| 51 | 55 | 53 JOH | tu 2 45| 47| 58 | 65 | 61 
VUN | guv 3 40| 42] 53 | 53 | 53 cos | ced 1 74| 73| 56 | 67 | 61 
HUY | bew 2 39| 42| 44 | 62 | 53 CIK | mec 1 67| 67| 58 | 66 | 61 
wos |qac | 1 | 45| 47| 41 | 66] 53 || xr |mep | 1 | 44| 46) 58 | 66 | 62 
FOH | bez 1 43| 44| 48 | 76 | 62 
SUG | heg 3 47| 45| 54 | 53 | 54 || wom | tir 2 67| 67| 53 | 72 | 62 
BEY | maj | 1 | 75| 74| 49 | 58 | 54 || oc |mog | 1 | 45| 45] 65 | 59 | 62 
per | maj | 2 | 69| 74| 50] 58|54 || Jor |mid | 1 | 44) 44) 55 | 09 | 62 
WEY | lum 1 72| 73| 45 | 63 | 54 CUN | dok 1 73| 72| 57 | 68 |63 
YUK | laz 2 | 66| 68| 50 |58 |54 || Kor |sib 3 | 75| 74] 63 | 63 | 63 
rum |wap | 2 | 67| 74] 47 | 62 | 54 || cow |jat | 3 | 42| 41] 63 | 62 | 63 
CAH | cig 1 40| 39] 49 | 59 | 54 KOG | jus 1 76| 76| 62 | 63 g 
vax |gis | 1. | 43| 44] 48 |61 |55 || wer |yir | 3 | 60) 44) 63 | 68 |63 
yor [wes | 1 | 39] 39| 51] 58| 55 |] po |gur | 1 | 67| 68) 58 | 68 | 6 
NEG | dup 2 69| 74| 55 | 54 |55 || FAS | woz 1 74| 68| 51 | 75 | 63 
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TABLE 3 (continued) 


CVC Pair CVE Pair |AS No, 
ETNE S| & masl ras| 4% || —— of Scar] & | ay | Mas} Fas | 26 
s R ing s R ing 
TAV |nur 2 70| 67| 62 | 64 | 63 DAW | lig 2 67| 71| 66 | 80 | 73 
KAS | hek 2 67| 74] 59 | 67 | 63 HIC ow 2 99 | 100 | 74 | 72 | 73 
REK |suz 2 73| 68| 58 | 69 | 63 FIV ep 2 66| 70| 72 | 74 | 73 
DAL | you 1 74| 73| 62 | 65 | 63 RET | seg 1 68| 68| 72 | 74 | 73 
SAR | cul 1 68| 68| 59 | 68 | 63 RUD | jun 1 73| 73| 69 | 77 | 73 
TOB | mex 1 66| 67| 63 | 65 | 64 || TAM | sid 3 98| 98| 69 | 78 | 73 
PAG | wap 3 74| 74| 63 | 66 | 64 || cor | fay 3 98| 98| 68 | 79 | 73 
RIV | cos 2 69| 74| 58 | 70 | 64 || TEK | ma 1 75| 74| 70 | 77 | 73 
cut |maz | 2 68| 72| 66 | 63 | 65 || Kom | vik 2 69| 74| 73 | 74 | 74 
YUH | nis 2 45| 41| 58 | 71 | 65 FES |jun 2 69| 73| 66 | 82 | 74 
PEL | fak 2 75| 76| 59 | 70 |65 VAN n 1 100 | 100 | 68 | 80 | 74 
BEY | mos 2 75| 1751 59 | 70 | 65 DUZ | log 2 99 | 100 | 76 | 73 | 74 
DES |tok 2 75| 69| 65 | 64 | 65 || MUN | cer 2 67| 71| 68 | 80 | 74 
LAV | bis 1 72| 72| 64 | 65 | 65 MIX | lag 2 99 | 100 | 74 | 74 | 74 
XAM | los 1 38| 40] 60 | 70 | 65 || TUG | bin 2 99 | 100 | 74 | 76 | 76 
FUM | fic J 67| 68| 68 | 62 | 65 MEX | pir 2 67| 74| 70 | 80 | 75 
RIL | kub 1 69| 70| 61 | 69 | 65 suz | pey 1 68| 67| 73 | 78 | 75 
vik | hup 1 74| 75| 63 | 68 | 65 wir | dus 2 69| 73| 71 | 79 | 75 
CER | civ 1 741| 71| 61 | 71 | 66 LAR | daz 1 76| 76| 73 | 78 | 75 
NES | rek 1 72| 73| 64 | 68 | 66 MUN | tir 1 67| 67| 68 | 84 | 76 
TOG | bun 3 97 |100 | 72 | 61 | 66 DUS | maz 1 73| 72| 75 | 78 | 76 
coz | rel 1 74| 72| 62 | 72 | 67 || cop | hic 1 99| 99| 73 | 80 | 76 
FEN | cov 1 74| 73| 64 | 70 | 67 TOK | hus 1 69| 70| 73 | 80 | 76 
NAM | dup 1 73| 74| 63 | 73 | 68 TIL | bow 3 97| 99| 77 | 76 | 76 
PAZ | gel 2 38| 43| 67 | 68 | 68 KIM | yet 3 98| 97| 78 | 76 | 77 
PES | mul 2 68| 44| 62 | 73 | 68 || RIM | bun 2 99 | 100 | 73 | 80 | 77 
WUR | cet 2 69| 69| 68 | 68 | 68 JAQ | cun 2 68| 73| 73 | 81 | 77 
MUY | sig 2 45| 47] 70 | 66 | 68 || pir | wuf 1 68| 67| 74 | 80 | 77 
yus | nis 1 40| 41] 61 | 75 | 68 || ver | wur 1 70| 69} 71 | 86 | 78 
CET | vew 1 69| 68| 56 | 80 | 68 CIK | fas 2 67| 71| 79 | 78 | 78 
HUY | luw 1 39| 39) 64 | 72 | 68 KAS | cak 1 67| 67| 77 | 81 | 79 
YAK) | lum | 3 97| 98| 69 | 67 | 68 || BEL | mac | 1 99| 99| 76 | 82 | 79 
DUG | zip 1 | 100] 100} 62 | 74 | 68 NUF | lof 1 69| 70| 75 | 83 | 79 
PES |daw | 1 68| 67| 66 | 71 | 68 || maz | dus 3 72) 73) 73 | 85 | 79 
PAR | dab 1 99| 99] 64 | 73 | 68 SOF | lek 2 69| 74] 73 | 85 | 79 
LIB | mos 1 75| 75| 59 | 78 | 69 || wac | ben 1 99| 99| 74 | 84 | 79 
Jos | bif 2 68| 76| 64 | 73 | 69 
FOD | fez 1 69| 71| 61 | 78 | 69 || TAG | dip 1 100 | 100} 73 | 86 | 80 
KOL | rab 2 14| 76| 68 | 71 | 69 || air | yer 2 66| 70| 81 | 78 | 80 
JAD | bok 2 66| 69| 60 | 78 | 69 TAP | wig 1 100 | 100 | 75 | 84 | 80 
KER | nok 1 76| 75| 68 | 70 | 69 BOK | det 1 69| 69| 74 | 85 |80 
QAN | goh 1 40| 41| 61 | 78 | 69 || caB | dip 2 | 100} 100] 77 | 83 | 80 
YUM | pad 3 90 | 100 | 69 | 69 | 69 || LEG | rim 1 99| 99| 78 | 83 | 80 
GUV | wun 1 42| 40] 75 | 86 | 80 
NUR | Jos 3 67| 68| 68 | 72 | 70 SAF | dux 2 69| 74| 82 | 79 | 80 
HES | kod 1 71| 73| 63 | 77 | 70 POT | duz 3 99| 99| 78 | 84 |81 
VOT |dux 1 72| 74) 62 | 78 | 70 RIV | saf 1 69| 69] 78 | 84 | 81 
DUP |nam| 3 74| 73| 68 | 73 | 70 BOX | wel 2 99 | 100 | 84 | 78 | 81 
sor | wut 1 70| 69} 68 | 72 |70 || wac | lay 1 99| 99| 80 | 83 | 82 
WOM | ver 1 67| 67| 63 | 77 | 70 KIN | mal 2 99 | 100 | 79 | 84 | 82 
MUL | vs 1 74| 75| 64 | 76 | 70 RAW | hit 2 99 | 100 | 81 | 83 | 82 
FAK | hiz 1 76| 75) 58 | 83 | 71 BOW | vet 2 99 | 100 | 77 | 88 | 82 
VEW | lav 2 68| 72) 64 | 78 | 71 || BEG | fad 2 |100} 100} 80 | 84 | 82 
CIL | stv 1 712| 73) 68 | 75 | 71 DEY | laz 1 68| 68| 81 | 85 | 83 
LEF | rof 3 67| 67] 68 | 74 | 71 sip | fan 1 99| 99| 80 | 86 | 83 
BUT Juan | 2 | 99/100) 67 | 76 | 71 || row | od | 1 | 100] 100] 79 | 87 | 83 
ROF | lef 1 | 67) 67| 68 | 74 | 71 || Buz | tip 2 | 99] 100] 84 | 82 | 83 
LAN | kol 1 | 75| 74] 66 | 78 | 72 || cat |mid | 1 | 99| 99] 81 | 86 | 83 
Moy |gud | 2 | 45| 42) 71 | 74 | 73 || wrx | got 1 99| 99| 81 | 86 | 83 
cas |rop | 2 | 75| 76| 73 | 73) 73 || vis | mul | 3 | 75| 74] 82 | 85 | 83 
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CVC Pair CVC Pair 

AS No, AS No, 

of Scal-] Ay | Ay | Mas] Fas | 44 of Scat] 8, |, | mas} Fas | 44 
S|" RY eae s | x | ims 
KEG | pad 2 99 | 100 | 84 | 83 | 83 sow | buz 1 99| 99| 89 | 90 | 90 
RUT | him 2 99 | 100 | 84 | 83 | 83 BIT | liz 1 | 100/100] 88 | 92 | 90 
KIN | lug 3 99| 98] 85 | 83 | 84 RUG | dot 1 | 100} 100} 88 | 92 | 90 
pir | lab 2 99 | 100 | 82 | 86 | 84 KEY | nut 2 99 | 100 | 92 | 88 | 90 
CUB | tex 2 99 | 100 | 82 | 86 | 84 RID | hut 2 99 | 100 | 89 | 91 | 90 
CUB | jan 1 99| 99| 78 | 90 | 84 JAN | met 2 99 | 100 | 88 | 92 | 90 
BEL | jug 2 99 | 100 | 84 | 84 | 84 PIC | kay 2 99 | 100 | 88 | 93 | 90 
MUG | let 2 99 | 100 | 83 | 87 | 85 SAF | riv 3 69| 69] 90 | 90 | 90 
RAH | few 1 99| 99} 82 | 88 | 85 MID | gal 3 99| 99| 87 | 93 | 90 
MAC | her 2 99 | 100 | 84 | 85 | 85 HOW | map 1 100 | 100 | 88 | 92 | 90 
PEN |rig 3 99| 99] 84 | 85 | 85 NED | may | 2 99 | 100 | 90 | 91 | 90 
SOW | jet 2 99 | 100 | 85 | 85 | 85 MAY | jug 3 | 100] 100] 88 | 93 | 90 
vic | fun 2 99 | 100 | 79 | 91 | 85 JEW | far 1 | 100} 100] 90 | 91 | 90 
CON | rib 3 | 100] 100] 86 | 84 | 85 DON | sir 1 | 100} 100} 88 | 93 | 90 
SIN | dog 2 99 | 100 | 86 | 85 | 85 FUR | rut 1 99| 99] 88 | 93 | 90 
JUG | may 1 | 100} 100} 86 | 85 | 85 CAB | pet 1 | 100} 100] 92 | 89 | 90 
RAP | cow 2 99 | 100 | 87 | 84 | 85 PUT | hog 2 99 | 100 | 89 | 92 | 90 
CAS | pel 1 75| 75| 82 | 90 | 86 MOB | fix 2 99 | 100 | 89 | 92 | 90 
JAP | not 2 99 | 100 | 83 | 89 | 86 PUB | tab 1 | 100} 100] 89 | 93 | 91 
ROY | for 1 |100] 100] 85 | 88 | 86 SAP | tom 3 |100| 99} 90 | 92 | 91 
KEG | bus 3 99| 99| 93 | 80 | 87 Row | dug 2 99 | 100 | 89 | 93 | 91 
RAW | pul 3 99| 99| 91 | 82 | 86 WAD | boy 2 99 | 100 | 89 | 93 | 91 
RAN |nag | 1 | 99| 99| 88 | 86] 87 || BIG |pug | 1 | 99] 99] 90] 93 |91 
SEG |ret 3 68| 68| 90 | 83 | 87 HAL |ten 1 99| 99] 89 | 93 | 91 
Low | jam 2 99 | 100 | 88 | 86 | 87 Jon | dew 1 99 | 99| 89 | 93 | 91 
CIV | jus 2 71| 76| 86 | 88 | 87 FIT | ham 1 | 100] 100] 90 | 93 | 91 
BAG | wit 2 99 | 100 | 86 | 88 | 87 saD | bin 3 | 100/100} 91 | 92 | 91 
FUR | sag 2 99 | 100 | 87 | 87 | 87 GUN | tim 3 |100) 99] 90 | 93 u 
SUN | lug 1 99| 99| 89 | 84 | 87 JAM | bay 1 | 100] 100) 91 Ra a 
SEW |can | 2 | 99] 100] 86 | 88 | 87 || max | gun | 2 | 99/100) 93 oles 
KEN | rub 3 |100| 99| 84 | 90 | 87 sıx | hat 2 99 | 100 | 92 | 9 Bt 
PAY | but 1 99| 99] 88 | 87 | 88 || Lip |mud| 2 99 | 100 | 92 A Bi 
BUS | keg J 99| 99| 87 | 88 | 88 TOY | den 2 99 | 100 S E 
LEG | map 2 99 | 100 | 88 | 88 | 88 REX | was 3 98| 97 a alias 
BED |cut | 2 |100| 100] 88 | 88] 88 || Tus | fed | 3 | 100) 99 RAA 
Lor | rip 2 |100} 100] 88 | 88 | 88 || sat | hip 2 99 100 NGS ites 
FAN | sip 3 99| 99] 86 | 90 | 88 cop | tin 2 99 | 100 Soi abaa oe 
PUT |raw 1 99| 99| 86 | 90 | 88 FIG |saw 2 99 a AAA 
MAC | bel 3 99| 99] 88 | 88 | 88 || REL | coz 3 72 AEEA 
BAN | dot 2 | 100} 100} 88 | 89 | 88 GUT | jim 2 99 eH anuliael ios 
Lor | sag 1 | 100] 100} 89 | 88 | 89 GUM | rob 2 3 99. 89.1 95.1192 
GUM | pig 1 | 99] 99] 83 | 95] 89 || RID | pat | 1 Hoo iad [ost eels 
CIV | cer 3 71| 71] 89 | 88 | 89 wit | lab 3 10011193193193 
TUG | sun 3 99| 99] 89 | 88 | 89 GEM | wax 3 | 100 ERA 
PAT | rib 2 99 | 100 | 90 | 88 | 89 ROB | kit 1 10% a E 
HAG | pub | 2 | 99|100| 88 | 89 | 89 || war |kip | 1 o 100 921 94 193 
HEP |ray 2 99 | 100 | 87 | 91 | 89 CUT | tar 1 Ms 100102 | 9411/93 
DUZ | pot 1 99| 99] 90 | 88 | 89 Now | lis 2 100 |-91 | 96| 93 
GAY | lid 1 99| 99| 88 | 91 | 89 || yes | bit 2 | 99 EEEE 
POT |mad | 2 99 | 100 | 85 | 93 | 89 || RAP | six 1 99 39| 92 | 95 | 93 
wa, Jeep | 1 | 98] 99] 87 | 92189 | siz jeer | 2 | 99| 100] 93 | 94 | 98 
TAD [sod] (4) 90) 001 8 Oka) By E eed at | 00 an E E 
var fne | 1 [190100] as | 93 | 90 || van fed | 2/102) 100) 93 | 98 | 38 
con (fe | 3 | 99) 99] 9188|30 | Bem |s | 1 |300] 100] 93 | 95 | 94 
a [iad] 2 1984] E 85 E [gee || 29) 800 | 100) 9 |, 93,1 04 
Gur |jar | 1 | 99] 99] 89 | 90/90 || ver |fox | 3 99 |100| 94 | 93 | 94 
SAK | sin 3 | 94] 99] 90 | 89 | 90 || way | jet 3 100 | 100| 93 | 95 | 94 
pat fie | 1 | 99} 99] 90} 89 | 90 || oR S | 3 | ’99| 100] 93 | 95 | 94 
BED | lag 1 |100!100! 89 | 90 | 90 || suc | wax | 2 3 
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TABLE 3 (continued) 


CVC Pair CVC Pair 
o. AS No. 
ot Seal! & | & |Masiras| 26 of Scal: Av | Ay | Mas| Fas | 3 
s R ing s R g 

BIN |sad 1 | 100} 100} 95 | 93 | 94 HAY | hot 1 100 | 100 | 98 | 93 | 95 
HOG | sob 1 | 100} 100] 93 | 95 | 94 top | log 1 100 | 100 | 95 | 96 | 95 
DAM | cup 3 | 100/100} 95 | 93 | 94 TAN | hid 1 99] 99| 94 | 97 | 95 
LIT | son 3 96 | 100 | 93 | 96 | 94 JOB | saw 1 100 | 100 | 97 | 94 | 95 
TAN | fox 2 99 | 100 | 95 | 93 | 94 FAT | son 2 99 | 100 | 93 | 98 | 96 
HID | pet 2 99 | 100 | 93 | 95 | 94 suM | cop 1 | 100/100] 97 | 95 | 96 
BUZ | sow 3 99] 99| 94 | 94 | 94 MUD | wel 3 | 100] 100} 95 | 97 | 96 
ROD | box 1 99} 99) 92 | 97 | 94 HID | lan 3 99| 99| 94 | 98 | 96 
MOB | mug 1 99| 99| 93 | 96 | 95 PAR | kit 2 99 | 100} 95 | 97 | 96 
TIN | nul 1 | 100} 100} 93 | 96 | 95 PIG | ken 2 99 | 100 | 95 | 97 | 96 
MAR | lad 1 99| 96 | 93 | 95 FOG | bad 1 99 | 99| 94 | 98 | 96 
BAR | fit 2 | 100} 100} 94 | 95 | 95 DOC | bet 3 98 | 100 | 98 | 94 | 96 
Gop | ned 3 99| 99| 97 | 93 | 95 PEN | win 2 99 | 100 | 96 | 97 | 96 
CAR | sit 3 99| 99| 93 | 97 | 95 DOG | bar 1 | 100} 100} 96 | 98 | 97 
BOY | men 1 | 100/100) 95 | 95 | 95 Liz | bit 3 | 100] 100) 96 | 98 | 97 
sor | wod 1 69| 70| 94 | 95 | 95 JON | pal 2 99 | 100 | 95 | 98 | 97 
HOP | not 1 | 100) 100] 94 | 95 | 95 BEG | gin 1 100 | 100 | 96 | 98 | 97 
Gas | ban 1 | 100) 100) 94 | 95 | 95 DOT | rug 3 | 100] 100} 98 | 96 | 97 
LAZ | dey 3 68| 68| 93 | 97 | 95 BAD | sip 2 99 | 100 | 96 | 98 | 97 
NED | god 1 99| 99| 95 | 95 | 95 BUY | won 3 99| 97| 98 | 97 | 97 
WAX yn 1 | 100} 100} 95 | 95 | 95 WIN | fat 3 |100| 99| 97 | 98 | 97 
BUD 1 | 100) 100) 94 | 96 | 95 MAN | run 1 99| 99| 95 | 99 | 97 
TOY |max | 1 99| 99| 94 | 96 | 95 Low | sat 1 99| 99| 98 | 98 | 98 
SAG | lot 3 | 100} 100} 95 | 96 | 95 BIG | wed 2 99 | 100 | 98 | 99 | 98 

sik | don 3 | 100} 100) 95 | 96 | 95 
the variance. The reason(s) for this and mean AS are shown. The AS 
difference is unknown. scores are percentages of 120 Ss in the 
The test-retest reliability coeffici- case of MAS and FAS, and 240 Ss for 
ents obtained by repeating 40 pairs the combined mean value. For those 


were high, 712 = .98 for both MAS and 
FAS in both AS-1 and AS-2. There 
were 40 pairs common to AS-1 and 
AS-2, their scale values correlated sig- 
nificantly (r = .98) for both MAS 
and FAS. Therefore, the scalings are 
quite comparable and the pairs from 
the two scalings were combined and 
are presented together with similar 
pairs from AS-3 in Table 3. 

Table 3 contains the main results: 
the 320 pairs from AS-1, 280 different 
pairs from AS-2, and 129? pairs from 
AS-3. Table 3 is organized by in- 
creasing mean AS for 120 males and 
120 females. For each pair, the scal- 
ing number (i.e., AS-1, AS-2, or AS-3), 
Archer’s (1960) AV for both the 
stimulus and response, the MAS, FAS, 


3 One pair too few was included. 


items from AS-1 repeated in the sub- 
sequent scalings the proportions are 
based on AS-1 Ss only. On 78% of 
the pairs in Table 3 FAS exceeds MAS 
and the mean difference is 4.9% (SD 
= 5.9%). Therefore, pairs where 


MAS scores are equal to (4%) or ex-, 


ceed (18%) FAS scores are unusual 
and should perhaps be avoided’ for 
use in groups mixed as to sex. 


Results AS-3 


The scoring of data for AS-3 was 
the same as that for the other scalings. 
Again, for each item an AS score was 
computed separately for males and 
females and for each S a mean A 
score was calculated for each of the 
four series. These scores were en- 
tered into the Latin-square ANOVA 


ORDERED BY INCREASING MAS 


TABLE 4 
ASSOCIABILITY OF Parrs oF Mixep STIMULUS AND Response AV SCALED IN AS-3 


CVC pair CVC pair 
&, | | mas | -ras & | & | mas | ras 
s R s R 
High-Stimulus AV, Low-Response AV (HL) Low-Stimulus AV, High-Response AV (LH) 
POD | vuw 97 19 25 30 zuv | tad 13 97 28 35 
Pus | gob 97 17 25 35 vos | nip 19 98 | 33 38 
HEM | gig 98 15 28 30 Moy | web 13 | 100 37 43 
YEN | tij 97 13 28 43 QUW | mit 13 100 38 37 
YAP | gux 95 19 30 39 ZUH | vic 20 97 38 35 
TOD | vih 98 20 32 36 JIK gab 21 98 38 43 
NIL gez 98 17 33 39 LIJ tux 17 97 38 45 
SUB yif 98 16 33 40 soy lic 19 99 38 43 
NOB | weg 98 20 36 55 BUW | soc 21 97 39 48 
DEB | fuv 97 17 38 50 Qoz | red 19 99 39 61 
HUG | ziw 97 13 38 47 XED | jot 13 98 41 43 
MOP | gav 99 14 38 47 DAJ | soy 22 96 43 43 
MEL | vuy 97 18 39 46 FIQ vel 20 96 43 51 
LAW | qug 100 14 40 45 JEQ | sal 14 98 43 51 
caD | jig 97 13 41 47 MEJ | doz 17 98 43 S7 
DUB | keh 96 22 41 44 cuy dan 15 99 44 49 
JAW | zeg 98 15 42 48 HOJ | nix 22 97 45 38 
PEW | jiy 97 14 42 51 NAJ bid 18 99 45 48 
JAY | nuv 98 15 43 48 XEL | nay 21 98 46 44 
GIT vaf 97 22 43 45 YUQ | hot 16 | 100 46 51 
JAG qox 99 19 43 52 WUB | mon 21 97 48 48 
LAP | wux 100 18 43 35 XAV | ton 16 98 48 55 
HUT | riw 100 15 45 61 YIB now 16 99 48 55 
WED | qug 100 18 45 42 ZIQ met 17 | 100 48 53 
PAN | sij 99 14 46 56 xow | led 19 | 100 50 53 
FUN | vaw 100 16 47 55 KIW | med 10 | 100 51 51 
BOP | jic 98 20 | 48 48 zux | jap 14 99 a F 
FIB woj 99 13 48 55 QAH | guy 14 99 32 Be 
WiL | qax 97 22 48 53 ZEC | say 17 99 BS es 
JIG gec 98 16 49 49 MEF | ron 22 B a H 
PAL | yub 100 14 49 63 Qoc | fad 21 aM z Be 
SET | wij 99 13 49 60 QOK | cap aS iid Be ee 
Joy | gec 100 17 50 48 RIH | sax A a a 2 
HUB | gek 98 i v Ap BAE 56 VEQ | bug 1 BA z5 
ROW | qas 99 17 | 53 | 63 CEQ | jim 19 PERA 
HIM | guc 100 15 | 53 64 xiL | rag 48 eal Bg 
rat | niy | 100 | 22 | 54 | 59 zos | pit 13 EREE 
FIX qag 100 17 56 55 yox | mat i Y Re A 
sur | mw | 96 | 22:| 56 | 64 xiv | pun l aeaee ia 
HIS gol 98 19 58 63 yov | nab Ob oe 28 Sg 
PAM | yec 97 13 | 58 | 63 caQ | fir pul oR alone 7 
TOW | xas 97 14 58 58 Hw | led a 90 30 59 
SEW NIGP 99 en. TANO Reva ee ZENA 19 | 100 | 59 | ot 
HEY | jug 97 1% 4} R 61 YEX | dig 46 98 |C% 87 
RIP | yog | 100 | 18 | 59 | 52 coy | les alaoo diseni] am 
Ler | suj | 100 | 16 | 61 | 67 JEV | kad 
it 14 99 61 53 
Hac | yal 99 | 21 | 62 | 73 qe | yes Daler uiicie sl tet 
Day | gop | 100 21 | 63 73 woy | cal AISO een a 
Fe EA ORT AT | es al ree Meee mle E a E (acai irs 
BAG ed 99 20 64 66 LIW | rum > 
P EH | tip 13 | 100 | 63 | 6 
HIT vui 100 17 66 65 QEH 22 99 64 67 
TAX | vu 97 19 66 63 FIW | hep 16 | 100 | 64 64 
Ger | biw | 100 15 | 67 | 67 vun | den “te eel 72 
RAM || zix 100 21 69 72 XAT | peg 19 | 100 66 68 
Ray | gem | 100) 21 | 69 | 73 || QUE | he | tt | to | m1 | 66 
FIN | xes 99 14 71 68 yux | mad FN 13 66 
Se | ae | OO ore a AET E a (eet ove Se 78 
HEN | xay 98 17 | 73 | 69 Pry | hum aor nee tae 38 
MIs | kuy 97 19 73 73 WEF | sox 
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with sex, subgroups of Ss, sequence of 
series, and the series as factors. Sig- 
nificant Sex, F (1, 232) = 32.95, Sub- 
groups, F (3, 232) = 49.70, and Sex 
X Subgroup interaction, F (3, 232) 
= 13.37, effects were found. As had 
been the case in AS-1 and AS-2 fe- 
males again gave more associations 
than males (means .55, .49). Al- 
though there was a trend toward 
fewer associations in the later series, 
it was insignificant, F (3, 708) = 1.89. 

Over all types of items the intercor- 
relations of MAS and FAS were gen- 
erally high, ranging between r = .74 
and .96. The test-retest reliability 
was high for the items given twice in 
this scaling, 71. = .98 and .99 for MAS 
and FAS, respectively. 

The intercorrelation of AS values 
for the 40 pairs common to AS-1 and 
AS-3 were very high for both males 
and females, r = .97 and .96, respec- 
tively. The AS scores of the 40 com- 
mon pairs were used inan ANOVA for 
each sex separately. There was no 
significant difference between mean 
AS value for the items in AS-1 and 
AS-3 for MAS, F (1, 78) = 1.32, or 
FAS, F <1. Therefore, the two 
scalings seem to be comparable. For 
this reason items composed of CVCs 
of about equal AV (i.e., types H-H 
and L-L) were included in Table 3 in 
the position determined by mean AS, 

The 80 RS pairs were those scaled 
in AS-1, reversed in order, and re- 
scaled in AS-3. Correlations of the 
AS-1 and AS-3 scales for these pairs 
for both MAS and FASare high. For 
MAS the correlation was .94 and for 
FAS .93. ANOVAs comparing the 
AS-3 values against the corresponding 
AS-1 values showed for both males 
and females that the AS of a pair re- 
mained the same regardless of the 
order of the stimulus and response 
items, F (1, 159) < 1, for both FAS 
and MAS. Since the AS values are 


comparable from the two scalings, 
these pairs have been included in 
Table 3 in their proper order accord- 
ing to mean AS. 

The H-L and L-H pairs had un- 
equal stimulus and response AV. 
However, the average AV (of the 
stimulus and response) for these pairs 
was about equal for both types. 
These items were scaled for AS to 
determine whether the stimulus or re- 
sponse is more important in forming 
an association. Table 4 shows these 
pairs, their AV, MAS, and FAS values. 
ANOVA showed that although the 
mean AS value was higher for females, 
F (1, 232) = 6.31, it did not differ 
between the pair types, F <1. For 
the H-L pairs AS was significantly 
(p < .01), although not strongly, cor- 
related with stimulus AV (.39 for 
MAS, .35 for FAS). Response AV 
was not significantly related to AS 
value (.20 for MAS, .13 for FAS). 
On the other hand, for the L-H pairs 
response AV was significantly (p < .05) 
but not strongly related to AS (r = .22 
and .25 for MAS and FAS) while the 
relation between stimulus AV and AS 
was approximately zero (.08 for MAS, 
15 for FAS). The AS score is, there- 
fore, more closely related to the item 
of a pair higher in AV, but this rela- 
tionship does not account for a large 
part of the variance. 

The RE pairs, taken from Richard- 
son and Erlebacher’s (1958) lists, 
their stimulus and response AV, their 
EL, CM, FAS, and MAS values are 
shown in Table 5. Goss and Nodine 
(1965, pp. 260-267) selected these 
items for an experiment. In their 
experiment the intent was to test 
Richardson and Erlebacher’s pro- 
posal that learning rate for PAs might 
be due to the strength of association 
between pair items rather than the 
meaningfulness of the items per sè. 
Pairs contained all combinations of 
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TABLE 5 


IreMs PREVIOUSLY SCALED By RICHARDSON 
AND ERLEBACHER (1958) AND USED IN AN 
EXPERIMENT BY Goss AND NopINE (1965) 

ORDERED IN TERMS OF INCREASING 
MAS 


AV 

CVC | cvc ELa | CM" | MAS | FAS 
SIR 

XEJ | FON 3 | 65 3.11 | 21 | 19 
VAF | QAP | 22 | 24 3.85 | 24 | 30 
LAJ | vux | 14 | 18 4| 26 | 16 
JAT | LEQ | 41 | 28 1| 28 | 30 
TEV | XEF | 40 3 28 | 27 


XEQ | GID 6 | 63 
YEG | MEC | 31 | 67 
BIP | QES | 71 | 26 
KEX | BEH | 39 | 44 
HOD | BOF | 69 | 31 
TES | J1Q | 80 | 13 
PAC | QIH | 93 6 
LAN | Kuc | 75 | 32 
GIC | wip | 19 | 86 
VAK | JEN | 37 | 88 
CIJ | DUL 8 | 91 
BES | CEH | 85 | 25 
BEP | Lis | 58 | 83 
REG | KIH | 91 | 15 
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BEL | vir | 99 | 21 48 |58 
Gov | NuB | 95 | 76 58 | 62 
NAV | RUQ | 72 | 24 58 | 72 


POH | SAV | 52 | 86 
VOL | DET | 82 | 69 
PAV | KOF | 76 | 71 
WIZ | SEC | 83 | 88 
TEL | cum | 99 | 92 
FEM | Hos | 90 | 81 
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“Ease of learning (EL) and common meaning 
(CM) values from Richardson and Erlebacher (1958), 


either high or low stimulus or response 
AV and were of relatively low or high 
EL (ease of learning) or CM (com- 
mon meaning) as scaled by Richard- 
son and Erlebacher. Their results 
gave some support to Richardson and 
Erlebacher’s suggestion about the im- 
portance of the relationship between 
Pairs in determining learning rate. 
The ANOVA showed a strong rela- 
tionship between EL and perform- 
ance but a subsequent correlational 
analysis, partialling out the effects of 
AV, EL, and CM, revealed AV to be 
More important. The correlation co- 
efficients between the AS value and EL 


and CM values for the pairs are quite 
high (FAS vs. EL: .83; FAS vs. CM: 
-83; MAS vs. EL:.81; MAS vs. CM: 
.86) leading to the conclusion that 
these scales measure very similar, if 
not the same, factors. Experiments 
to be described will explore the rela- 
tionship between AS and performance 
further. 


Discussion 


The major results of the scalings are a 
pool of equal AV CVC pairs with a reli- 
able difference in AS values between the 
sexes and a clearly defined relation be- 
tween the AS scale and the EL and CM 
scale derived by Richardson and Erle- 
bacher (1958). 

The AS scale is reliable upon repeated 
measurements within (test-retest relia- 
bility) and between large groups of Ss. 
Although considerable variation in AS 
scores exists between pairs of given level 
of AV, there is, as expected, a significant 
relationship between AV and AS. How- 
ever, at best, this relationship accounts 
for only about 40-50% of the variance. 
This finding indicates that considerable 
differences may be observed in learning 
rate between pairs of the same AV level 
and this factor may account for differ- 
ences in interpair difficulty. 

The AS-3 scaling produced results 
somewhat at variance with data from 
learning experiments (Ekstrand, 1966). 
It was found that AS values for items re- 
versed in order from AS-1 correlated very 
highly with their AS-1 values. The AS 
measure is bidirectional for pairs of 
homogeneous AV. Since many verbal 
learning experiments have shown the 
importance of stimulus and/or response 
AV in directionality of association (Goss 
& Nodine, 1965), it is difficult to inter- 
pret the meaning of this finding for PA 
research. The result is probably due to 
the task requirements of the scaling 
where Ss had both items in front of them 
and 15 sec. in which to respond. This 
contrasts markedly with experimental 
conditions where a response must be 
retrieved from memory upon presenta- 
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tion of its stimulus item within a few 
seconds. 


RELATION oF AS TO PERFORMANCE 


In order to demonstrate the usefulness 
of the AS measure several experiments 
were conducted. The main purpose of 
these studies was to demonstrate reliable 
differences in learning rate or in recall as 
a function of variations in AS value in- 
dependently of stimulus-response AV. 
The primary problem in implementing 
this research was the presence of the cor- 
relation between AS and AV. Orthog- 
onal manipulation of the two variables 
is not possible as the AS distributions of 
pairs at the various AV levels show only 
partial overlap. Therefore, less direct 
procedures were used to examine the re- 
lationship between AS and learning. 


EXPERIMENT I 


In this experiment four mixed lists 
varying in mean AS but of approxi- 
mately constant mean AV were con- 
structed, given to four independent 
groups to learn, and were tested after 
24 hr. 


Method 


Materials —Four 12-pair lists were con- 
structed from among items scaled in AS-1. 
Each list contained 3 pairs from each 
level of AV, but different lists were con- 
structed of items taken from either the high, 
high-middle, low-middle, or low parts of the 
FAS distribution at each level of AV. The 
lists with their mean AV and FAS values are 
shown in Table 6. The mean AV values for 
the lists were all approximately 55 while the 
mean AS values for the four lists were 43, 55, 
63, and 77. 


TABLE 6 
Pairs Usep IN Exp. I, II, AnD III, THEIR MEAN AV anp FAS VALUES 


Groups Mean AV FAS 
Group 1 (Lowest 
AS) 
QEF wuq 8 16 
ZIJ xiy 4 17 
XUB zuf 8 13 
HIB zan 43.5 26 
RAX yoc 45.5 29 
MOY cib 46 35 
CAY hek 14 51 
yuK fos 67.5 49 
BM fey 68 45 
PAR dab 99 13 
VAN pin 100 80 
BEL mac 99 82 
M 55.2 43 
Group 2 (Low- 
Middle AS) 
YIG luj 10 22 
XUP qy 15 23 
QEV zf 11.5 24 
DE jir 41 43 
JAH gam 44 43 
BEH peg 44 44 
DAL a 73.5 65 
TAV 69.5 64 
LIQ daf 72.5 64 
DIM tel 99 89 
PUT raw 99 90 
sow buz 99 90 
M 55.9 55 


Groups Mean AV FAS 
Group 3 (High- 
Middle AS 
GuQ foj 11 30 
XET qaj a 30 
yuy xog 9 30 
vaD juk 39.5 53 
TEP wix 44.5 58 
PUH wej 42 53 
PES daw 67.5 71 
NAM dup 73.5 13 
ROF lef 67 74 
RAP six 99 93 
JOB saw 100 94 
ROB kit 100 94 
M 55.3 63 
Group 4 (High 
AS) 
VIJ xon 9 57 
ZAJ xiz yi 41 
TUJ yoj 9.5 53 
yus nis 40.5 15 
QAN goh 40.5 78 
XAM loz 39 70 
CAS pel 75 90 
RIV saf 69 84 
BOK dei 69 85 
DOG bar 100 98 
TAN hid 99 97 
MAN run 99 99 
uM 54.7 17 


À nomm— lm oN ŘĂ—Č 
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ı Procedure-—Four independent groups of 
16 Ss each learned one of the lists by Battig’s 
(1965) modification of the recall or study-test 
method which he calls a “correction-adjusted- 
learning” procedure. On any trial, after 
every pair has been presented, the stimuli are 
each presented alone during a test trial. Then 
pairs for which S had recalled the correct re- 
sponse are dropped temporarily, incorrect 
pairs are re-presented and retested, and so on 
until all items have been correctly recalled 
once. Then another trial begins with the 
presentation of all pairs and their testing. 
The Ss learned to a criterion of 11 out of 12 
correct on an initial test when all stimuli were 
presented. Pairs were presented for 2 sec. 
during the learning and stimuli were presented 
alone for 6 sec. during testing. Different 
random orders were given during learning 
and testing for all Ss over all trials. The 
groups were run on the University of Illinois’ 
PLATO system (Bitzer, Lyman, & Easley, 
1966). Each S sat at a booth with a TV dis- 
play on which the instructions and pairs were 
presented. The S typed his response using 
one finger on a typewriter and all responses 
were automatically recorded. 

Following criterion attainment, Ss were 
given a questionnaire in which each stimulus 
was presented alone for S to write the re- 
sponse and means of associating the pair. 
After this a final test was given and Ss were 
dismissed for 24 hr. 

The recall session began with brief instruc- 
tions reminding them of the testing procedures 
and then five test sequences were given in 
which each stimulus was presented for 6 sec. 
Response items were never shown. After the 
fifth test sequence the questionnaire was given 
again and Ss wrote the response, if they re- 
called it, and the means they had used to asso- 
ciate the pair, if they could recall that. 


Subjects —The Ss were 64 paid volunteer 
undergraduates. Twelve females and four 
males were run in each group. Males were 
Tun in each group because females were not 
available. 


Results 


Acquisition——When using the cor- 
rection-adjusted-learning method a 
measure that most adequately reveals 
differences in rate of learning is the 
number of “exposures” to criterion 
averaged over pairs. An exposure is 
counted each time a pair is presented 


for learning. Different pairs may 
have differing numbers of exposures 
within a trial depending on how 
quickly the correct response is given 
within a trial. The number of ex- 
posures on Trial 1 and the total num- 
ber of exposures to criterion for all 
groups are presented in Table 7. 
Means are shown for each AV level 
within each list. The expected differ- 
ence in the number of exposures on 
Trial 1 or until criterion as a function 
of AS was not found by ANOVA. 
The AV, which was varied within 
each list and thereby within Ss, was 
inversely related to exposures on 
Trial 1, F (3, 180) = 108.28, and to 
criterion, F (3, 180) = 82.48. Table 
7 shows that at all levels of AV, Group 
4 has fewer exposures to criterion than 
any of the other groups. Groups 1, 2, 
and 3 overlap and do not differ. A 


TABLE 7 


AVERAGE NUMBER OF EXPOSURES PER ITEM 
ON TRIAL 1 AND TO CRITERION, AND PRo- 
PORTION OF NLMs REPORTED PER ITEM 
IN Exp, I 


Lists Learned by Independent Groups 


e Bola a AGa 
— M 
(43)" | (55)* | (63)" | (77) 
Low 
To Criterion 1951 18:38 | 15:73 ret 15:10 
BS NEMS 06} .21| 29| 33| .22 
voi 5.48 | 6.00 | 5.14 | 4.98 | 5.40 
To Criterion | 11.15 | 12.08 | 11.81 | 9,06 | 11.03 
Pro PNE Ms 25 | 48| .35| 58] .42 
High iode 4.31 | 4.77 | 3.29 | 3.10 | 3.87 
To Criterion | 9.35 | 10,40 | 8.60 | 6.33 | 8.67 
PrePNLAS 35| 56] .58| .63| «53 
High 
Trial ion | 259 | 342 | Fn | Sioa | 6:37 
PrP NEMS 42| .67| .79| 81| 67 
Tidl A ion | $89 |1078 | 10.84 | 791 |, 
Pro NEM 27} 48] 51| 59 


a Mean AS. 
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post hoc comparison revealed that on 
the total number of exposures Group 
4 differed reliably from Group 1, 
F (1, 30) =436, p<.05. The 
groups did not differ on the number of 
trials to criterion, number correct on 
the criterion trial, or number correct 
on the postquestionnaire test. The 
questionnaire data taken at the con- 
clusion of Session 1 were divided into 
categories to ascertain if AS affected 
verbal reports given by Ss. The 
categories employed were (a) natural 
language mediators (NLMs) defined 
as any learning method reported 
which transformed the pair into a 
word, phrase, or sentence; (b) in- 
stances where S reported he learned 
the pair by rote repetition or could re- 
port no associative device. 

The proportion of NLMs per item 
given by each group out of the total 
possible is presented in Table 7. The 
proportion of NLMs given increased 
with both AS and AV, F (3, 60) 
= 5.47; F (3, 180) = 31.67, respec- 
tively, both p's < .01. 

Recall—There was a significant de- 
crease in the number correct on the 
criterion trial and the number correct 
on the recall trial Tr, F (1, 60) 
= 75.04, but the groups did not differ 
(F <1). Recall on the repeated test 
trials following Tr improved slightly 
but significantly, F (4, 240) = 7.75, 
p < .01, but again the groups did not 
differ (F < 1). 


Discussion 


The results provide only weak support 
for the validity of the AS scale. High- 
and low- AS groups (4 and 1) differed in 
number of exposures needed for acquisi- 
tion but the difference was not large and 
the intermediate AS groups did not align 
themselves consistently as predicted. 
The finding concerning number of NLMs 
used is also anomalous; number of NLMs 
reported increased with AS but had no 
strong effect on acquisition rate. 


The failure to find differences in recall 
among groups may be attributable to the 
use of the correction-adjusted-learning 
method. When employing a list crite- 
rion this procedure brings all items to 
about the same level of associative 
strength regardless of learning method re- 
ported (Montague & Kiess, 1966). 
Thus, associability does not seem to have 
an effect on retention. 


EXPERIMENT II 


The primary purpose of this study 
was to obtain additional data from Ss 
run under the conditions of Groups 1 
and 4 in the previous experiment. 
Since no differences in recall were 
found in Exp. I, in this study only the 
acquisition data were obtained. 


Method 


Procedure—The procedures were the same 
as the acquisition phase of Exp. I except that 
Ss learned only the high- or low-AS lists. The 
two intermediate AS groups of Exp. I were 
not run. 

Subjects —The Ss were 30 undergraduate 
females from the University of Illinois, naive 
to verbal learning experiments. Participa- 
tion in the experiment was part of a course 
requirement and Ss were not paid. Fifteen 
Ss were run in each of the two groups. 


Results 


The mean total number of exposures 
to criterion, the mean number of ex- 
posures on Trial 1, and the mean num- 
ber of NLMs per item are presented 
in Table 8. In the ANOVA, AS had 
no appreciable effect on Trial 1 or 
total number of exposures. Number 
of exposures did decrease, however, 
with increasing AV both on Trial 1 
and to criterion, F (3, 84) = 61.33, 
p < .01; F (3, 84) = 49.92, p < .01, 
respectively. On Trial 1 there was 
also a significant AS X AV interac- 
tion, F (3, 84) =3.11, p<-05- 
Number of NLMs given in the post- 
acquisition reports increased with both 
AS and AV, F (1, 28) =15.18 
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p < .01; F (3, 84) = 10.14, p < .01, 
respectively. 


Discussion 


The significant difference in number 
of exposures to acquisition of low- and 
high-AS items found in Exp. I was not 
replicated in Exp. II. The trend in both 
experiments follows that predicted by the 
authors’ hypothesis but the effect is not 
very strong. Butin both experiments AS 
has a strong effect on number of NLMs 
given in the postacquisition test. In 
view of other experimental evidence for 
performance variations due to NLMs 
(e.g., Dallett, 1964; Kiess & Montague, 
1965; Underwood & Schulz, 1960, p. 297) 
why did we find AS effects only in re- 
ported NLMs and not in performance? 

A plausible explanation of this differ- 
ence between number of NLMs in Exp. I 
and II and rate of acquisition as a func- 
tion of AS led to the third experiment. 
It is reasonable to assume that genera- 
tion of a NLM in paired-associate learn- 
ing is not instantaneous but requires a 
certain period of time. That is, time is 
required for S to integrate the pairs into 
his established language structures 
(Mandler, 1967). It seems possible that 
the 2-sec, presentation interval employed 
was not long enough for S to apply 
his language habits effectively to the 
materials to be learned on any one trial. 
It may have taken several trials for S to 
form a NLM for a pair. Consequently, 
AS would have had little opportunity to 
affect rate of acquisition but may have 
influenced the number of NLMs reported 
at criterion. If this is the case, a longer 
Presentation period should permit S to 
effectively apply his language skills on 
the first trial and AS should affect rate of 
learning. 


EXPERIMENT III 


The purpose of Exp. III was to in- 
vestigate the effect of a lengthened 
Presentation time on the speed of ac- 
quisition of low- and high- AS items. 
Experiment III was concerned only 
with acquisition and not with recall. 


TABLE 8 


AVERAGE NuMBER OF Exposures PER ITEM 
ON TRIAL 1 AND TO CRITERION, AND PRO- 
PORTION OF NLMs REPORTED PER ITEM 
IN Exp. II anp IIL 


Exp. II Exp, III 
AV Levels ; A List P 
within Lists | List I| List 4 i List 4 
(43) | (e| M | (43)5) De M 
Low 
Trial 1 7.36 | 7.51 | 7.43 | 4.84 | 4.28 | 4.56 
To Criterion | 16.09 | 14.98 | 15.54 | 9.01 | 7.69 | 8.35 
Proportion 
of NLMs 02} .20] 11] .09] .26] .17 
Low-Middle 
Trial 1 5.73 | 5.31 | 5.52 | 3,95 | 3.28 | 3.61 


To Criterion | 13.31 | 10.42 | 11.87 | 8.21 | 6.20 | 7.20 


Proportion 
of NLMs 16 29 .23| .32| 54] .43 
High-Middle 
Trial 1 5.22] 3.13] 4.18 | 3.44 | 2.32 | 2.88 
To Criterion | 11.91} 7,42] 9.67 | 7,00 | 4.91 | 5.95 
Proportion 
‘of NUMs 16 AT 32] 49) .67] 58 
High 
Trial 1 2.67 | 2.13] 2.40 | 1.96 | 1.42 | 1.69 
To Criterion | 7,87} 6,33] 7.10 | 5.04 | 3.93 | 4.48 


Proportion 
ae ‘of NEMs 27 60} 44] 56] 59] 57 


Trial 1 5.25] 4.52 3,54 | 2.82 
To Criterion | 12.30} 9.79 7.31 | 5.68 
Proportion 
NLMs AS] 39 36) .51 
à Mean AS, 
Method 


Procedure——The method was the same as 
the acquisition phase of Exp. I and II, The 
Ss were run only in the high- and low-AS 
groups and the pair presentation time was 
lengthened from 2 sec. to 5 sec. The recall 
time remained at 6 sec. 

Materials —The items were the same as the 
low- and high-AS items of Exp. I. 

Subjects—The Ss were 54 undergraduate 
females who took part as a course requirement 
and Ss were not paid. Twenty-seven Ss were 
run in each of the two groups. 


Results 


The average total number of ex- 
posures to criterion, the average num- 
ber of exposures on Trial 1, and the 
average number of NLMs per item 
are presented in Table8. ANOVA re- 
vealed significant effects of both AS 
and AV on Trial 1 and total number 
of exposures. Number of exposures 
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decreased with increasing AS both on 
Trial 1 and to criterion, F (1, 52) 
= 9.27, p < .01; F (1, 52) = 5.17, 
p < .05, respectively. Similar effects 
were found for increasing AV, F (3, 
156) = 93.73, p < .01; F (3, 156) 
= 89.54, p < .01, respectively. The 
number of NLMs given increased with 
ASand AV, F (1,52) = 5.74, p < .05; 
F (3, 156) = 30.00, p < .01, respec- 
tively. 


Discussion 


Although the results tend to confirm 
the hypothesis that time is an important 
factor for the generation of NLMs, the 
performance difference between groups 
learning Lists 1 and 4 is not large and is 
of about the same magnitude in Exp. II 
and III, The longer time may have 
merely produced a small change in 
variance. 

Taken together, the results of these 
three experiments show a consistent, but 
minor, effect of AS value when it is ma- 
nipulated in lists composed of a mixture 
of various levels of AS and AV. The AS 
value does not have much predictive 
power over and above variation of AV. 
It seems possible that such mixed lists 
coupled with the use of the correction- 
adjusted-learning method allow Ss to 
select learning strategies which would at- 
tenuate or mask the effects of AS. Since 
the list gets progressively shorter as items 
are learned, the last few items may be 
learned differently than the others, e.g., 
by holding them in short-term memory. 
Therefore, a more straightforward test 
of the effects of AS variation was under- 
taken in the next experiment. Both AS 
and AV values were varied within a 
single long list of pairs presented for a 
single trial. 


EXPERIMENT [V4 


The purpose of this study was to 
examine the effect of AS on recall by 
partially replicating an experiment by 


4 Alexander Wearing and Clinton Walker 
executed; this study. 


Montague, Adams, and Kiess (1966) 
using AS as an independent measure 
of NLM-formation probability. In 
their study, pairs of high or low mean- 
ingfulness were presented to Ss once 
for a period of 15 or 30 sec. during 
which time S wrote down his NLM 
for the pair, if he had any. One day 
later S was shown each stimulus and 
asked to recall the appropriate re- 
sponse and his NLM or other learning 
method. The reported use of NLMs 
was strongly associated with a high 
level of recall. 

In the present study CVC pairs 
were either of high or relatively low 
AV and within each level of AV there 
were two levels of AS. It was ex- 
pected that AS would be related to 
the number of NLMs used and 
thereby to the number of correct re- 
calls independently of AV. 


Method 


Materials—Sixty pairs of CVCs were se- 
lected from Table 3. Thirty of the pairs 
contained items with a mean AV of 42 (range 
38-47), while the other 30 items had a mean 
AV of 99 (range 99-100). Within each AV 
level were two levels of FAS, 15 pairs per level. 
Within low AV the FAS means were 37 (range 
28-43) and 63 (range 52-78), and within high 
AV: 78 (range 61-86) and 96 (range 95-99). 
The 60 pairs were assigned randomly to four 
blocks of 15 with the restriction that no FAS 
level occurred more than four times per block. 
Sixteen different lists were constructed by 
varying the order of the blocks so that each 
block followed each other block exactly four 
times across lists. One novel sequence of 
blocks was used to test recall for all Ss. 

Subjects Forty-eight paid female under- 
graduates served as Ss. 

Procedure.—Groups of 10-16 Ss were run on 
the University’s PLATO system (Bitzer et al., 
1966). The Ss were instructed that 60 pairs 
would be presented once each for 15 sec., that 
they were to learn as many as possible during 
that trial, and that their recall would be tested 
24 hr. later. A 5¢ bonus per correct recall 
was offered as an incentive to learn. j; 

After a practice series (with letter pairs) 
used to familiarize them with computer-con- 
trolled PA learning, the experimental list was 
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presented. The Ss returned 24 hr. later and 
recall was tested at the same presentation rate. 
To reduce variability in the data due to differ- 
ences in typing skill, Ss were required to type 
with their preferred hand, hunt-and-peck 
fashion. 


Results 


Since AS and AV are correlated it 
was not possible to cross completely 
levels of both factors. Therefore, we 
used a Ss X Levels ANOVA with re- 
peated measures on the number of cor- 
rect recalls for all four AS-level means 
and planned comparisons providing 
specific tests of the effect of AS with 
AV controlled. From the lowest AS 
level to the highest the mean number 
of recalls were .94, 1.77, 3.79, and 
7.48. The overall effect was highly 
significant, F (3, 141) = 119.74, 
p< .01. The planned comparison 
between AS-level means within low 
AV was significant also, F (3, 47) 
= 4,93, p < .01, as was the compari- 
son of AS-level means within high AV, 
F (3, 47) = 96.71, p < .01. A third 
orthogonal comparison provided a test 
of the interaction of AS and AV. The 
differences between the AS means for 
low and high AV were taken and tested 
(7.48-3.79) — (1.77-.94). The dif- 
ference between the AS-level means 
was significantly larger at high AV 
than at low, F (3, 47) = 29.0, 
P< .01.5 


Discussion 


Not only was recall shown to be a func- 
tion of AS value, but more importantly, 
recall varied as AS value within AV level. 
However, the significant interaction be- 


ë Recently, this experiment was replicated 
using an independent group of 61 Ss. Mean 
recall for the four AS levels was .85, 2.09, 5.82, 
and 9.21, values quite close to those obtained 
in Exp. IV. Once again ANOVA revealed 
highly significant effects of AS within AV 
levels and a significant interaction between 
AV and AS. 


tween AS and AV indicates that the re- 
lationship is not simple. The magnitude 
of the effect of AS seems to be dependent 
upon the level of AV. Other data from 
our laboratory (including Exp. II above) 
and those reported by Montague et al. 
(1966) have shown similar trends. At 
low-AV levels the effect of AS differences 
is much less than it is at high-AV levels. 
The reason(s) for this finding are un- 
known. It seems likely that it may arise 
in the need for Ss to integrate the items 
into their verbal repertoires (Mandler, 
1954), This process may involve con- 
siderable complexity in the form of trans- 
formations or encoding pair items before 
they can be associated. For low-AV 
items many transformations or letter 
additions, etc., are necessary to make a 
meaningful word. Within a list of such 
pairs a large set of encoding rules might 
be used by S. Therefore, for each pair 
at recall, S must remember how a re- 
sponse was encoded and how it was as- 
sociated with the stimulus. It seems 
likely that the complexity of such en- 
coding would differ only slightly between 
lists of pairs of low-AV items differing in 
AS value. On the other hand, the much 
larger effect due to AS differences among 
pairs composed of high-AV items may be 
produced by a relatively much larger 
difference in encoding complexity. For 
high-AS pairs a single (or a very few) rule 
for generating NLMs may be used re- 
ducing the need to remember a rule for 
each pair. More rules for generating 
NLMs may be needed for the low-AS 
pairs thereby raising the amount to be 
retained. Although we have no data 
bearing directly on the validity of this 
idea, recent research on encoding in free 
recall lends some support. Mueller, 
Edmonds, and Evans (1967) report that 
the number of trigrams recalled by Ss is 
inversely related to the number of en- 
coding rules used in learning. 


SUMMARY AND CONCLUSIONS 


The main objectives of the research 
were achieved. A reliable measure of 
the probability of NLM formation, AS, 
was scaled and it was found to be related 
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both to learning rate and to the frequency 
of NLMs in postacquisition reports. 

In PA-learning research it has long 
been fashionable to regard pair learning 
as a complicated analogue of conditioning 
where specifiable stimuli become dis- 
criminative cues for particular responses. 
The S is generally considered to be a 
passive vehicle for the demonstration of 
interference or transfer effects due to 
manipulations of stimulus and response 
similarity or meaningfulness. However, 
the fact that S searches his past language 
associations, transforms the items when- 
ever necessary, and generates appropriate 
NLMs to link them may mask or con- 
found variables of interest. For ex- 
ample, the relatively unique NLM 
formed for letter pairs may attenuate or 
eliminate interference from extraexperi- 
mental associations (Montague & Wear- 
ing, 1967b). Similarly, NLMs formed 
for pairs in an original list may protect 
those pairs from interference from an 
interpolated list (Adams & Montague, 
1967). Perhaps the paired-associates 
task is too complicated a vehicle for the 
demonstration of simple interference and 
transfer effects. 

The present research, along with some 
discussed by Mandler (1967), has made 
a modest beginning in illuminating cer- 
tain aspects of techniques Ss use to en- 
code paired associates. Furthermore, 
although a NLM may represent S's 
unique encoding for a pair, the pres- 
ent data demonstrate that there is con- 
sistency across Ss in the likelihood of 
forming aNLM. It remains for further 
investigation to uncover variables deter- 
mining this commonality among Ss and 
to investigate the full range of complexity 
Ss’ transformations of paired-associate 
ists. 
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3 ‘groups of Ss were run in a human eyelid conditioning experiment, 
with the independent variable being UCS intensity. The resulting data 


were evaluated against expectation: 
2-phase model. 


s from a version of Norman's 


Expectations were obtained by estimating best-fit 


parameters separately for each S and generating a series of Monte 


Carlo (MC) data matrices which 
empirical data. While there were 


were then compared with the 
discrepancies between model and 


data, the overall fit was good, particularly in view of the fact that the 


expectations had generally small sta 


The purpose of this paper is to report 
an experiment in human classical con- 
ditioning and to evaluate the extent to 
which a mathematical model provides 
an adequate summary of the data. 
The experiment is a straightforward 
one: UCS intensity was varied across 
three groups of Ss. This independent 


1The data for this study were gathered 
as part of the second author’s undergraduate 
honor’s thesis submitted in partial fulfillment 
of an honor’s degree in psychology at the 
Pennsylvania State University. The authors 
acknowledge the technical assistance pro- 
vided by Louis Misantone and thank the 
consulting editor of this journal for his help- 
ful review. This study was supported from 
funds provided by National Science Founda- 
tion Grants GB-3875 and GB-6750 to the 
senior author. 

2 An undergraduate research participant at 
Pennsylvania State University, supported by 
National Science Foundation Grant GY-975. 
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ndard errors. 


variable was selected because it is well 
known to affect conditioning perform- 
ance, thereby providing an opportunity 
to investigate a model across differen- 
tially effective treatment levels. 

The model is a mild variant of the 
two-phase model first investigated by 
Norman (1964). In the first phase 
Ss respond with a constant likelihood. 
In the second phase, after learning be- 
gins, changes in response probability 
are described by a single-operator, 
linear-operator model. The present 
equations for the two phases are: 


P,=Po t=1to K, 


P, =P. — (Pe — Pa) (1-8), 
eee) tom ED) 


where P, is the initial rate of respond- 
ing, Pa is the asymptote, N is the 


[1] 
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number of trials, @ is the growth rate, 
and K is the trial on which the shift 
from the first to the second phase oc- 
curs. 

The differences between Equations 1 
and 2 and the model proposed by Nor- 
man (1964) are two in number. First, 
Norman assumed an asymptotic re- 
sponse probability of 1.0, an assump- 
tion quite appropriate in the verbal 
discrimination learning situation he in- 
vestigated. In human conditioning, 
however, this assumption is not ten- 
able and Pa was assumed to be a free 
parameter to be determined by the data. 
Second, Norman assumed that the shift 
from first to second phase was the re- 
sult of a probabilistic system in which 
there exists a constant likelihood, c, on 
any given trial prior to the shift that 
the shift will occur on that trial. In the 
present study it was assumed simply 
that there is a trial, K, on which the 
shift occurs, with no specification of 
the shift mechanism. The K, then, rep- 
resents a kind of threshold before which 
S responds at an operant level and after 
which there are incremental changes in 
response probability. 

The distinctive feature of this study 
is that high-speed computer techniques 
were used to obtain best-fit parameters 
separately for each S. Once obtained, 
these parameters were employed to gen- 
erate sets of Monte Carlo (MC) data 
which, in turn, served as the basis for 
comparing model and data. Particular 
emphasis was placed on comparative 
statistics sensitive to the individual dif- 
ferences which exist in human condi- 
tioning. 


METHOD 


Apparatus—The apparatus for picking up 
and recording eyelid responses is described 
elsewhere (Prokasy, Carlton, & Higgins, 
1967; Prokasy & Whaley, 1961). 

The CS was a 500-msec. increase in bright- 
ness of a .75-in. diameter standard translucent 


red cut glass jewel. It was illuminated from 
behind by a 7.5-w. bulb. The CS source was 
mounted on a 1 X 2 ft. panel attached to the 
cubicle wall at approximately 60 in. in front 
of S at eye level. The UCS was a 50-mseç. 
puff of nitrogen delivered to S’s right cornea 
500 msec. after CS onset. It was delivered 
through a 4.5-mm. diameter jet located ap- 
proximately .5 in. from S’s eyelash and was 
of sufficient pressure to support either a 
100-, 150-, or 200-mm. column of mercury, 
respectively, for the three experimental 
groups. 

Interstimulus interval, CS, and UCS dura- 
tions were controlled by Tektronix timers. 
Equipment cues were masked by a combina- 
tion of white noise and a tape recording of 
the dynograph paper drive, the combination 
being 77 db. (re .0002 dynes/cm’). 

Subjects—The Ss were 138 volunteers 
from introductory psychology courses at the 
Pennsylvania State University who partici- 
pated in the study to fulfill requirements for 
optional course credits. 

Experimental design—Forty-six Ss were 
assigned unsystematically to each of three 
groups of Ss. Group 100 received a UCS 
intensity of 100 mm., Group 150 received a 
UCS intensity of 150 mm., and Group 200 
received a UCS intensity of 200 mm. 

After being placed in the cubicle, S re- 
ceived a set of instructions by tape recorder 
in which he was instructed to sit quietly, not 
to try to figure out the experiment, and to 
let his reactions take care of themselves. 
Following the instructions, he received a 
series of 90 adaptation trials, these trials 
composed of a random ordering of 40 CS 
and 50 UCS trials at an average intertrial 
interval of 8 sec. The adaptation trials were 
followed immediately by 280 CS-UCS paired 
trials presented at an average intertrial 
interval of 10 sec. 

A CR was defined as a 1-mm., or greater, 
deflection of the recording pen between 148 
and 524 msec. after CS onset. A 1-mm. 
pen deflection corresponded to an eyelid 
movement of approximately .9 mm. 

When the records were scored, it was 
found that two Ss, both in Group 150, re- 
sponded during adaptation trials with likeli- 
hoods greater than .8. Also, their response 
probabilities during conditioning did not in- 
crease; in fact, they decreased as condition- 
ing progressed. This made it a practical 
impossibility to estimate parameters, so their 
records were considered no further, the re- 
sult being that the number of Ss for Group 
150 was reduced from 46 to 44. 
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RESULTS 


Adaptation trials—The P, was de- 
fined for each S as his response proba- 
bility to the CS during adaptation trials. 
The overall means for Groups 100, 150, 
and 200 were .090, .086, and .105, 
respectively, these means not differing 
significantly from each other, F(2, 
133) =.28. As a further check for 
stability, the data were also examined 
for any changes in probability over 
blocks of adaptation trials and across 
sequences of CS trials. No reliable 
effects were obtained.® 

Parameter estimation—The model 
requires an estimation of four parame- 
ters for each S, Since there was an 
empirical estimate of P, from the adap- 
tation trials, only three remained to be 
estimated. It is to be noted that Po 
was not permitted to vary while the 
best-fit values of the other three param- 
eters were determined. 

Parameter values were obtained for 
each S separately through a maximum 
likelihood method similar to that em- 
ployed by Theios and Brelsford (1966). 
Equations 1 and 2 were written into 
a computer program, S’s P, was pro- 
vided in the input, and then successive 
values of K and Pe were plugged into 
the equation. The @ was estimated for 
all K and Pa pairs from: 


0 = (Po — Px)/(NP.—S), [3] 


where N is the total number of trials 
between K + 1 and 280, and S is the 
total number of CRs in that trial range. 
Test values of K were permitted to 


3 Though not of direct relevance for this 
study, it should be noted that overall response 
probability to the CS during adaptation trials 
(.105) was reliably less than ‘that measured 
in a comparable period just before UCS 
occurrence (.174), F (1, 135) = 59.07. It 
would appear that S’s orienting reaction to 
the CS results in a substantial reduction from 
the average spontaneous rate of slightly less 
than one blink every 2 sec. 
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vary, in steps of 1 from Trial 1 to a 
trial number predetermined to be 
clearly in a range of trials in which 
response probability was greater than 
P, Test values of Pe were permitted 
to vary from .03 below the probability 
obtained in the last 100 conditioning 
trials to 1.0 in steps of .01. All possi- 
ble pairs of K and Pe in that two- 
dimensional array were examined. 

Once the test values were plugged 
into Equations 1 and 2, the computer, 
a UNIVAC 1108, calculated response 
probability for each of the 280 trials. 
For any given S, this provided a se- 
quence of assumed probabilities to be 
compared with the empirical data. A 
likelihood for S’s 280-trial protocol for 
each set of estimated parameters was 
obtained by multiplying together, across 
the 280 trials, the likelihoods of a re- 
sponse occurring, when a response oc- 
curred, and the likelihoods of a response 
not occurring when one failed to occur, 
Thus, the likelihood function for a 
single S is 


ie Il (Py er yas 
i=l 


where X; equals 1 if S makes a CR on 
Trial i and is zero otherwise. The 
best-fit parameters were defined as that 
combination which yielded the highest 
probability for S’s data sequence. 
Group likelihoods were obtained by 
multiplying across Ss the maximum 
likelihoods obtained for each S in the 
group. 

The means and standard deviations 
of the best-fit parameters retained for 
analysis are provided in Table 1. Also 
included is the probability of each data 
matrix, given the best-fit parameters. 
Such probabilities are small, of course, 
simply because a set of 280 numbers 
between 0 and 1 were multiplied to- 
gether for each S and then multiplied 
across Ss. 
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TABLE 1 
Mrans AND SDs or Brst-Fir PARAMETERS 


Group| Po | | 0 | Pe | pry 
Reed E, 
M | .090 | 25.67} .161 | .687 | 107 
SD | 110 | 38.45] .217 | .288 
150 
.086 | 10.64} .196 | .742 | 1077 
SD | 146 | 16.49} .248 | .254 
200 
M_| .105 | 10.00] .170 | .840 | 107 
SD | 128 | 13.84] .217 | .168 


Random number generator —Be- 
cause individual differences were per- 
mitted in obtaining the parameters, 
and because the population distribution 
of parameter values is unknown, there 
is no practical way short of MC meth- 
ods to assess the goodness of fit between 
model and data. The MC methods, to 
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be of value, rely on a random number 
generator which provides a reasonable 
approximation to randomness. The 
generator employed in this study is de- 
scribed by Barnett (1962). It yields a 
rectilinear distribution of values from 
0 to 1, but, as in the case with most 
generators, it was found to contain se- 
quential dependencies in which the 
value of the number of Trial T + 1 was 
a function of the value on Trial T. 
Since the effect seemed to be confined 
largely to adjacent trials, an analysis of 
dependencies between alternate num- 
bers was made. There was a mild, 
though not significant, bias toward low 
numbers being followed by low num- 
bers and high numbers by high when 
12,000 alternate numbers were drawn. 
This was deemed close enough for 
present purposes, and only every other 


TABLE 2 
OBSERVED AND EXPECTED MEANS AND SEs of EXPECTATIONS 


Over BLOCK 


Group 


100 200" | 328 | 422 420° | 496 570 
646 650 663 659 641 6248 


150 257a | 427 489 | 545 566 598 
757 711 691" | 720 732 752 


200 304° | 478 550 | 609 678 | 689 
783 763" | 804 | 802 830 7858 


S OF 10 TRIALS 

Obtained 
576 | 557 | 570 | 591 | 628 | 663 | 030 | 672 
635 663 659 650 650 707 665 689 
6075 | 645 664 691 670 7488 711 730 


732 720 736 700" | 698 718 718 711 


711 743a | 737a | 785 830 839" | 776 813 
787 841 841 837 837 817 839 833 


Expected 


100 163 | 305 | 409 | 470 | 512 547 
652 | 652 | 661 665 673 671 


150 225 | 399 | 480 | 550 | 583 621 
725 719 | 728 724 729 726 


200 263 | 455 534 | 619 669 
808 | 816 | 822 828 827 32 


571 585 598 606 618 624 634 645 
672 678 676 677 679 675 684 682 


640 662 676 691 703 704 712 713 
730 735 732 737 731 737 734 733 


734 761 773 785 798 799 801 801 
822 825 834 833 824 833 832 833 


SE of Expectation 


100 018 | 018 | 021 020 | 016 
014 | 018 | 018 | 015 019 oie 


150 012 | 018 021 ois 
022 017 017 016 O19 020 


200 015 013 016 | 019 | o1; 
019 016 | 017 015 012 ois 


020 ois 021 018 016 021 018 022 
024 017 018 014 019 019 019 016 


018 015 014 i 019 019 015 019 016 
017 018 019 017 017 014 017 016 


021 | 017 | o15 | 014 | 017 | 017 | 018 | 016 
020 | 014 | 016 | 018 | 015 | 015 | 017 | 013 


Note.—Decimal points omitted. 
* Departs from expectation by greater than 2 SEs. 
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number selected by the generator was 
used to obtain the MC data. 
Comparative statistics—The best-fit 
parameters were employed to determine 
trial-by-trial response probabilities for 
each S, and then the random number 
generator was used in conjunction with 
these probabilities to determine for each 
trial whether or not a response “oc- 
curred.” If, e.g., the response proba- 
bility yielded by Equations 1 and 2 with 
the best-fit parameters was .375 on 
Trial 18, then a random number of 
.375 or less, if drawn, would mean that 
S gave a response on that trial. If the 
random number were greater than .375, 
then no response would be recorded. 
In this manner a MC data matrix was 
generated for all three groups, and the 
procedure was repeated until 25 sets of 
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data had been obtained. The reason 
that 25 matrices were obtained for each 
group is that the authors wanted to ob- 
tain an expectation and a standard 
error of expectation for a variety of 
statistics. 

The trials were blocked into 28 
blocks of 10 trials each and averaged 
over the 25 MC sets. Table 2 includes 
the empirical data, the expectations 
from MC runs, and standard errors 
(SEs) of the 25 MC means for each 
trial block, A total of 13 data points, 
out of 84, departed more than 2 SEs 
from expectation. Two additional 
facts are worth noting. First, the only 
systematic departure is that the initial 
data point in all three groups is reliably 
above expectation. Second, the vari- 
ance in the system is quite small, with 


TABLE 3 


OBTAINED AND EXPECTED BETWEE! 


N-Ss SDs AND SEs or EXPECTATIONS 


OVER BLocks oF 10 TRIALS 


Group Obtained 
100 | 227| 318° | 322 | 330 | 363 | 351 | 369 | 366 | 366 | 355 | 349 | 323 | 352 | 351 
337 | 329 | 320 | 319 | 347 | 350» | 317 | 315 | 319 | 363" | 319 | 309 | 321 | 290 
150 | 256 | 329 s | 332 | 329 | 310 | 314 | 317 | 324 | 259 | 309 | 208 | 294 | 307 
256 | 323 | 338 | 334 | 200 | 273 | 318°] 277 | 307 | 310 | 298 | 276 | 298 | 303 
200 5 39 | 307 | 288 | 295 | 299 | 294%] 220 | 207 | 203 | 227 | 264 
261 | 246 23 $33 | 199 | 242 | 232 | 244 | 228 | 225 | 195 | 212 | 228 | 215 
Expected 
1 : 349 | 354 | 352 | 347 | 350 | 350 | 350 | 343 | 339 | 331 
°° | 330 | 324 | 327 37 | 317 | 31s | 310 | 318 | 312 | 316 | 318 | 317 | 318 | 317 
7 | 286 | 285 
150 | 262 | 3. 333 | 313 | 307 | 301 | 302 | 290 | 287 | 291 | 288 | 28 
362 | 337 | 333 | 300 | 283 | 283 | 283 | 283 | 282 | 283 | 281 | 283 | 280 | 283 
220 
200 313 | 202 | 275 | 258 | 241 | 243 | 233 | 231 | 224 
2i 2 306 322 | 303 | 211 | 205 | 205 | 207 | 202 | 204 | 205 | 199 | 203 
SE of Expectation 
18 | 017 
10 o12 | o14 | o11 | 013 | 013 | O11 | 013 | 0 
5 oie | ois | O13 O12 O12 | ote | o12 | ota | 012 | 016 | 013 | 013 | O14 | 012 
ois | 016 
a 1 | os | ois | o17 | o15 | 012 | O13 | 012 
5 ois | Old O12 Ot ots | o12 | 013 | o15 | o17 | 012 | O15 | ors | O14 | ors 
019 
013 | 024 | 022 | o18 | 014 | 017 | O19 | 022 
ae 020 | oat | 017 019 ors | Gis | 022 | o16 | 023 | 020 | O15 | ois | 017 | 019 


Note.—Decimal points omitted. 
“ Departs from expectation by greater than 2 SEs, 
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TABLE 4 
EXPECTED AND OBTAINED MEANS AND SDs oF SELECTED STATISTICS 
Ms SDs 
Obtained Expected SE Obtained Expected SE 
Group 100 
Trial Ms 590 596 .0034 118 126 .0040 
Trial SDs 332° 324 .0035 .029 .035 .0035 
Trial of First Response | 16.283 | 13.568 | 1.5988 | 24.834 | 20.212 | 4.2982 
P(Rayi[Ra)* 458 470 .0079 .234 .222 .0075 
P(Rayi|Ra)? 661° 645 .0063 268° .287 .0073 
P(Ray2|Ra) 485 488 .0079 248 .235 .0083 
P(Ray2|Rn) 649 642 .0052 275° .289 .0062 
Group 150 
Trial Ms .659 664 .0028 «114° 121 .0029 
Trial SDs 301° .290 .0034 .022 021 .0025 
Trial of First Response | 8.295. | 10.819 | 1.2669 7.697 | 11.359 | 2.5284 
P(Rapı|Ra) 543 546 0098 .229 221 .0091 
P(Ra4a| Rua) .709 .105 0044 .244 251 .0057 
P(Rng2|Rn) 545° 566 .0078 .232 230 .0072 
P(Ray2|Ra) TASS -701 .0047 234° 253 .0063 
Group 200 
Trial Ms 747 752 .0037 126° 136 .0030 
Trial SDs .253° .240 .0037 .047 051 .0022 
Trial of First Response 9.239 11.371 1.1328 10.277° 13.136 1.4171 
P(Rajal Ra) 570 591 .0111 218 202 .0087 
P(Rapi| Ra) -812° .800 .0045 166° 179 .0061 
P(Ra42|Ra) 608 613 .0102 224° .208 .0078 
P(Rat2| Ra) 802 197 0041 .170 183 ,0071 


* Read “probability of response on Trial n + 1 given no response on Trial n,” 
b Read “probability of response on Trial n + 1 given a response on Trial n.” 


° Departs from expectation by more than 2 SEs. 


the SEs of the MC estimates being .024 
or less, 

Since Ss were not assumed to be 
homogeneous, another test of the ade- 
quacy of the model is to examine indi- 
vidual differences over trial blocks. 
Table 3 contains the between-Ss stand- 
ard deviations (SD), the average of 
the SDs obtained from the 25 sets of 
MC data, and the SE of the expecta- 
tions. Of the 84 comparisons, a total 
of only 8 departed by more than 2 
SEs from expectation, and there ap- 
pears to be no particular trend in these 
departures. As with the means, the 
model predicts standard deviations 
with relatively high precision: the SEs 
are no greater than .024, 


Seven additional statistics are com- 
pared in Table 4. For each compari- 
son, the data mean, the mean of 25 MC 
sets, and the SE of the MC sets are 
provided. In no instance is the numer- 
ical difference between data and expec- 
tation large, although 15 of the 42 data 
points depart by more than 2 SEs from 
expectation. The only systematic de- 
parture was the model’s underestima- 
tion of the overall means of the trial 
SDs. ‘ 

A further assessment of the model is 
provided in Table 5. This table in- 
cludes the product-moment correlations 
between the expected and obtained val- 
ues of the numbers contributing to the 
means of Table 4. The correlations 
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TABLE 5 
OBTAINED AND EXPECTED CORRELATIONS BETWEEN SELECTED STATISTICS 
Group 100 Group 150 Group 200 

Obtained | Expected | SE | Obtained] Expected | SZ | Obtained] Expected | SE 

Trial Ms 971 978 |0070| 972 975 |0080| 976 982 0046 
Trial SDs 771 838 | 0488) 514 510 | 1283) 827 875 | 0469 
Trial of First 570 635 1415| 441 474 |2020| 743 734 1006 

Response 
P(Ra41| Ra) 927 928 | 0182} 898 908 | 0306} 894 850 |0464 
P(Rayi|Ra) 980 962 | 0150) 986 970 |0096|) 981 973 | 0071 
P(Ray2| Ra) 938 941 0168| 943 930 |0174| 906 876 | 0353 
P(Ray2|Rua) 985 976 |0102| 988 981 0053| 980 976 |0051 
Note —Decimal points omitted. 

obtained with trial block means and net result being little redundancy 


SDs have 28 pairs of numbers (i.e., the 
number of trial blocks), while the cor- 
relations obtained with the remaining 
five statistics have the number of Ss 
in a group as the number of pairs. The 
column labeled “Expected” refers to 
the correlations obtained when 25 pairs 
of MC data sets were analyzed. These 
values, rather than 1, constitute the 
expectations for the correlations since 
the probabilistic system being investi- 
gated will always have a certain amount 
of variance that cannot be accounted 
for. The bulk of the individual differ- 
ences is predicted by the model. Eleven 
of the obtained correlations are higher 
than their corresponding expected val- 
ues, an outcome which is possible in 
view of the fact that the predicted value 
is an expectation rather than a deter- 
ministic solution. 

Additional comparisons.—Because 
parameters were estimated separately 
for each S, it was possible to obtain 
intercorrelations among the best-fit 
parameters for the three groups. These 
are provided in Table 6. Only 2 of 
the 18 correlations differ significantly 
from zero, and 1 of these accounts for 
only 11% of the variance. Thus, there 
is no systematic departure from zero 
correlation across the three groups, the 


among the parameters. 

A comparison was made among the 
means of the best-fit parameters across 
groups (see Table 1). As sum- 
marized earlier, there were no between- 
group differences in P,. The Duncan 
range test was used with a rejection 
region of .05 to analyze the differences 
in the other three group parameter 
means as a function of UCS intensity. 
No differences were found in 6, Two 
clusters emerged from the three P,,’s: 
the asymptotes of Groups 100 and 150 
formed one cluster and those of Groups 
150 and 200 formed another cluster, 
This outcome is consistent with the 


TABLE 6 
INTERCORRELATIONS AMONG PARAMETERS 
Group K 0 Pa 
100 
Po —.116 161 .210 
K 135 —.519* 
[4 —.021 
150 
Po 097 .041 .245 
K .084 —.237 
0 —.107 
200 
Po —.327* 176 «258 
K —.206 —.019 
(} .051 


* Differs significantly from ,0 (p <.05). 
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empirical observation that asymptotic 
performance increases with UCS in- 
tensity. Two clusters emerged from 
the three Ks. One contained the values 
from Groups 200 and 150, while the 
other was defined by Group 100, i.e., 
K decreased from Group 100 to Group 
150, with no additional decrease from 
Group 150 to Group 200. 

Since the single-operator, linear-op- 
erator model is a special case or a sub- 
model of the two-phase model, i.e., they 
are the same when K = 1, it is possible 
to determine with likelihood ratio tests 
(Theios & Brelsford, 1966) whether 
the two-phase model is accounting for 
the data reliably better than is the sim- 
ple linear-operator model. Best-fit 
parameter values for 0 and Pa were ob- 
tained with P, given from the adapta- 
tion data and K held constant at 1 for 
all Ss. The group maximum likeli- 
hoods with these best-fit parameters 
were 107489, 107388, and 10-2338, respec- 
tively, for Groups 100, 150, and 200. 
Dividing these likelihoods by the rele- 
vant likelihoods for the two-phase 
model given in Table 1, the obtained 
likelihood ratios are 102°, 10-3, and 
1022, Under the hypothesis that the 
two-phase model is doing no better 
than the simple linear-operator model, 
—2 times the log to the base e of the 
likelihood ratios is distributed approxi- 
mately as chi-square with the number 
of Ss in each group as the degrees of 
freedom. The three chi-squares for, 
respectively, Groups 100, 150, and 200 
are 930, 166, and 424, all significant 
beyond the .001 level. From this it may 
be concluded that the two-phase model, 
in fact, accounts for the data reliably 
better than does the single-operator, 
linear-operator model. The substan- 
tially higher value for Group 100 re- 
flects that group’s larger mean K. 

The final comparison involved 
parameter recovery from MC data. 
If the parameter search technique is 


TABLE 7 


MEANS AND SDs oF PARAMETERS RECOVERED 
FROM ONE SET oF MoNTE CARLO 


(MC) Data 

Group K o Pe 
100 

M 26.80 .182 .692 

SD 40.30 250 294 
150 

M 12.36 203 742 

SD 16.30 256 -258 
200 

M 11.54 224 845 

SD 13.93 301 77 


successful, and if the generation of MC 
data from the best-fit parameters is sat- 
isfactory, it should be possible to re- 
cover the original best-fit parameters 
from the MC data. To this end, one 
set of MC data from each of the three 
experimental groups was subjected to 
the parameter search described earlier. 
While exact recovery of parameters 
cannot be expected for individual Ss, 
the averages for each group should ap- 
proximate the averages provided in 
Table 1. The means and standard 
deviations of the recovered parameters 
are shown in Table 7. Since P,’s were 
empirically fixed, they are not included. 
The means of the three recovered 
parameters fall within the .95 con- 
fidence limits of the means in Table 1, 
thus indicating satisfactory search and 
recovery techniques. 


Discussion 


The major purpose of this study was to 
determine whether or not the two-phase 
model provides a reasonable description 
of human classical conditioning perform- 
ance. There were two systematic de- 
partures from expectation. The model 
slightly underestimated between-Ss vari- 
ability and underestimated mean response 
probability on the initial block of trials. 
An explanation of these departures is not 
readily available, though one possibility 
for the latter is that the switch from un- 
paired to paired trials slightly increases 
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the base rate of responding to the CS; 
in other words, response rate to the CS 
during adaptation trials may not be an 
unbiased estimate of P,. 

Those two departures notwithstanding, 
the overall fit of model to data is suffi- 
ciently good to warrant its further de- 
velopment and application. None of the 
departures from expectation was numer- 
ically large, the significant deviations re- 
flecting, in general, a high degree of pre- 
cision. Since the model has been applied 
successfully to a verbal discrimination 
learning situation (Norman, 1964), there 
is good reason to investigate its appro- 
priateness in other learning situations in 
order to determine what generality it has 
for human and animal learning. 

Since the Hull-Spence theory has been 
applied to human conditioning studies, it 
may be useful to explore briefly two mat- 
ters of relevance both to that theory and 
to the two-phase model. First, according 
to Spence, Haggard, and Ross (1958), 
an increase in UCS intensity should re- 
sult in an increase in both H and D, the 
net result of such manipulations being, 
for two groups differing in UCS in- 
tensity, a continued divergence between 
groups across trial blocks to asymptote. 
The present experiment and the two-phase 
model provide a somewhat different in- 
terpretation of UCS intensity. Groups 
100 and 150 differed in that the latter had 
a higher mean P,, and a lower K. Groups 
150 and 200 differed only in that the latter 
had a higher mean Pa. It appears, more- 
over, that neither 9 nor P, area function 
of UCS intensity. This means that gross 
differences in learning curves are a sim- 
ple function of P,, at higher intensities 
and are a joint function of K and Pa at 
lower intensities. The model, therefore, 
does not provide a uniform prediction of 
effects with changes in UCS intensities: 
At higher intensities different groups 
would show continued divergence across 
trial blocks because they differ only in 
Pa, while at lower intensities different 
groups would show an initial divergence 
followed by some convergence due to the 
additional influence of K. Not only are 
the present theoretical curves (Table 2) 
consistent with this interpretation, in a 


review of past research, Prokasy (1967) 
concluded that there was an initial di- 
vergence followed by convergence, the 
reviewed studies incorporating compar- 
isons between the effects of low UCS 
intensities. 

The second point in common to the 
Hull-Space theory and the two-phase 
model is a threshold. In the Hull- 
Spence theory, if H is initially below 
threshold some number of reinforcements 
is required before an increase in per- 
formance is observed. Spence (e.g., 1956, 
p. 61, ff.) points out that it is not uncom- 
mon for the acquisition curves in human 
eyelid conditioning to be S-shaped, i.e., an 
initially slow rate of rise followed by a 
more rapid one, when Ss are homo- 
geneously classed. The two-phase model 
yields the identical prediction, the ideal- 
ized form being an initial constant rate of 
responding followed by an increase de- 
scribed by the linear-operator model. If 
Ss were selected to be homogeneous in K, 
the curve anticipated by Spence would 
be observed. 

It would be premature to speculate on 
the “meaning” of the model’s parameters. 
Should the model continue to provide a 
satisfactory description of human eyelid 
conditioning data over a variety of ex- 
perimental situations, the “meaning” of 
the parameters will emerge in the context 
of independent variable manipulation. 
Similarly, the continued successful ap- 
plication of the model should provide a 
framework from which the role of such 
variables as interstimulus interval, CS in- 
tensity, intertrial interval, and instruc- 
tional sets in classical conditioning can 
be clarified. One attractive possibility is 
that the use of a threshold, i.e., a shift 
between phases, will provide at least a 
partial solution to the problem of discon- 
tinuities in the conditioning process (see, 
e.g, Grant, Levy, Thompson, Hickok, & 
Bunde, 1967). 

In any event, the relative success of the 
two-phase model in this study should not 
obscure the fact that there remains a 
variety of information in human eyelid 
conditioning which it is not designed to 
address. For example, response prob- 
ability on Trial N is known to be a func- 


368 


tion of the amount of time separating 
Trials N-1 and N (Prokasy, 1965). 
Similarly, the latency changes over trials 
as well as across sequences of intertrial 
intervals (Prokasy, 1965) make clear that 
the model is incomplete. 

The use of MC techniques in conjunc- 
tion with the estimation of parameters on 
an individual-S basis merits further com- 
ment. This procedure avoids several ma- 
jor obstacles encountered with most other 
techniques for evaluating how well a 
model describes a set of data. First, it 
allows for individual differences, one of 
the facts of most experiments, but a fact 
frequently ignored in assessment. 

Second, it makes use of individual dif- 
ference statistics to help evaluate how well 
the model works. Data matrices are 
generated by MC techniques from which 
a variety of statistical comparisons can 
be made, and the extent to which the 
model can account for variability intro- 
duced by individual differences constitutes 
an index of how well it works. 

Finally, extremely precise predictions 
can be made, the data from Table 4 illus- 
trating that probability differences of less 
than .01 can reliably distinguish obtained 
and expected values. This serves to 
emphasize the utility of calculating stand- 
ard errors of expectation, which are 
typically difficult to determine analytically 
even when Ss are assumed to be homo- 
geneous with respect to parameters. 

There are cautions which should be ob- 
served, One is that it is necessary to 
have a random number generator which 
provides a reasonable approximation to 
randomness. The property that can be 
most damaging is dependence between the 
values of successive numbers, and this 
should be investigated before any gener- 
ator is employed. A second is that ob- 
taining individual-S parameters requires 
that the learning not be rapid, and that 
there are enough data per § to provide 
for a sensible parameter search. In the 
long run, however, it is doubtful that 
any model can be assessed properly with- 
out meeting these two conditions: Rapid 
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learning makes it difficult to distinguish 
between learning models regardless of 
what assumptions are made about indi- 
vidual differences. 
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15 Ss received 16 adaptation, 16 conditioning, and 16 extinction trials 
with an apparatus designed to eliminate reflex ORs to stimulus onset. 
The apparatus consisted of a dial with markings for CS intervals 
and a moving pointer. The UCS, an electric shock, occurred at CS 
offset. That the procedure effectively eliminated reflex ORs was 
exhibited by an extremely low rate of responses in the absence of 
threat of shock. Threat of shock increased responses to stimulus 
onset. During conditioning there was an early rise in responses for 
the conditioning interval only followed by a protracted decline, It 
was concluded that habituation of the CR during conditioning and 
implicit expectancies associated with the stimulus situation continue to 
complicate GSR conditioning when reflex ORs to the CS are eliminated. 


A problem arises in GSR condition- 
ing in that almost any stimulus which 
can serve as a CS elicits the GSR 
prior to being paired with the UCS. 
Thus, identical responses are referred 
to as orienting or conditioned responses 
depending on whether they have oc- 
curred before or after the conditioning 
procedure. Unfortunately, adaptation 
to the CS, although widely resorted to, 
does not provide a sufficient control, as 
almost any environmental change, in- 
cluding the conditioning procedure it- 
self, can reinstate the orienting re- 
sponse (Stewart, Stern, Winokur, & 
Fredman, 1961). The problem is fur- 
ther exaggerated in studies of gen- 
eralization of conditioning where the 
presentation of test stimuli that differ 
from the CS introduces an additional 
source of orienting responses (see 
Burstein, Epstein, & Smith, 1967 ; Ep- 
stein & Burstein, 1966; van Olst & 
Orlebeke, 1965), very likely contrib- 
uting to the inconsistent findings on 
generalization gradients of the GSR. 


1 This research was supported by Grant 
MH-01293 from the National Institute of 
Mental Health, United States Public Health 
Service, to the second author. 


It is not our intent to deny that GSR 
conditioning has been demonstrated, 
as despite the complications introduced 
by orienting reactions, stimulus dis- 
parity effects, and sensitization effects, 
a few adequately controlled studies on 
differential conditioning would appear 
to support a genuine conditioning ef- 
fect. The establishment of differential 
GSR conditioning does not, however, 
eliminate the need for demonstrating 
the occurrence of nondifferential GSR 
conditioning, for investigating the 
influence of orienting reactions pro- 
duced by the CS upon the course of 
both differential and nondifferential 
conditioning. An approach that war- 
rants exploration is one that, unlike 
the usual approach with differential 
conditioning which attempts to control 
ORs by distributing them equally over 
positive and negative CSs, directly al- 
ters the contribution of the OR. 

A major source of orienting re- 
sponses is the sudden onset of the CS, 
ie. the typical light or tone CS con- 
sists of a sudden “burst” of light or 
sound energy which produces a “reflex 
OR.” We refer to such ORs as reflex 
ORs to distinguish them from ORs 
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produced by variations in qualitative 
features of the stimulus attended to 
only because of past experience with 
the stimulus. The present study intro- 
duces a technique for investigating 
GSR conditioning in the absence of 
reflex ORs produced by the CS. Thus, 
the only way the CS can evoke a re- 
sponse is through its meaning, or ac- 
quired signal value, to S. 


METHOD 


Apparatus —Figure 1 presents a photo- 
graph of the apparatus. The display panel 
was painted flat white and had a diameter 
of 36 in, The pointer, painted flat black, 
was attached to a shaft which was rotated 
by a Syncron motor (115 v., 60 cycles, 3 
tpm). Thirty-two orange rectangles, 1 in. 
long and 3 in, wide, served as CSs. These 
rectangles, which were held in place by metal 
clips painted flat white to match the panel 
background, could be placed at any point on 
the perimeter of the panel. The distances 
between rectangles established the inter- 
trial interval. A circuit, controlling the 
movement of a marker pen and the delivery 
of the UCS, was closed when a hidden shim 
brass arm, attached to the pointer shaft, 
made contact with the rear of the metal clip 


Fig, 1. 


The conditioning apparatus, 
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holding each CS. Thus, each CS had its 
own contacts. A system of Grason-Stadler 
timers and relays was programmed so that 
presentation of the UCS was simultaneous 
with the termination of the CS interval, 
The CS-UCS interval was set at 5 sec.—the 
time it took the pointer moving at 6 in/min 
to traverse the 4 in. width of the orange 
rectangles. The UCS, delivered when the 
pointer broke contact with the orange rec- 
tangle, had a duration of .1 sec. The inter- 
trial intervals, corresponding to the dis- 
tances between the CSs, were randomly 
varied between 15 and 60 sec. with a mean 
of 37.5 sec. Previous work (Burstein et al., 
1967) revealed that 15 sec. was ample time 
for the recovery of the GSR. Further de- 
tails concerning the recording apparatus and 
procedure are provided elsewhere (Burstein 
et al., 1967). 

Subjects —The Ss were 25 students in an 
introductory psychology course at the Uni- 
versity of Massachusetts, 10 of whom con- 
stituted a control group. They were in- 
structed to keep their eyes on the pointer 
under all circumstances and were told 
that they would receive some shocks. They 
were tested in an IAC soundproof chamber 
and observed by E in a control room 
through a one-way screen. 

Procedure—For the 15 Ss in the experi- 
mental group, UCS intensity was set at a 
level that each § described as “unpleas- 
ant, but not painful.” After reminding Ss 
to watch the pointer, E left the booth. The 
pointer was started haphazardly at any 
point on the perimeter so long as it was not 
on an orange rectangle, the CS. For trials 
during the initial, or adaptation, phase, the 
pointer traversed 16 CSs without a shock 
being delivered. The next 16 CS presenta- 
tions were accompanied by shock, after 
which 16 more shockless trials were pre- 
sented, These 48 trials were presented 
without interruption, i.e, no cues were pro- 
vided to separate the three phases of the 
experiment. 

The 10 Ss comprising the control group 
were treated identically to those in the ex- 
perimental group except that, immediately 
prior to leaving the booth, E removed the 
snap leads attached to the shock electrode 
and told S that he would not be given any 
shocks, and that it was, in fact, impossible 
to receive any without the leads to the 
electrodes. The control group was added 
to demonstrate that the CS by itself, ‘ie, 
independent of the experimental conditions 
including threat of shock, does not pro- 
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duce ORs, ie, it provides neither reflex 
ORs nor ORs as a function of its part sig- 
nificance to S. The control Ss were run 
through the preshock, or adaptation, phase 
of the experiment only. 


RESULTS 


Any visible deflection of the record- 
ing pen in the 3.2-sec. interval prior 
to the contact of the pointer with the 
CS was scored as a first interval or 
anticipatory response; any deflection 
within the next 3.2 sec. was scored as 
a second interval or orienting response ; 
any deflection in the next, or third, 
3.2-sec. interval was scored as a third 
interval or conditioned response. The 
3.2-sec. intervals were chosen on the 
basis of findings in previous studies 
(Burstein et al., 1967; Epstein & Bur- 
stein, 1966) where it had been ob- 
served that the GSR response almost 
invariably had a latency of less than 
3.2 sec. Although the UCS (presented 
5 sec. after CS onset) was delivered 
during the third scoring interval (3.2- 
6.4 sec. after CS onset), because the 
latency of the UCR is rarely less than 
1.4 sec., there was little chance that a 
UCR would obscure a third interval 
response. In those rare, and easily 
identifiable, instances when short-la- 
tency UCRs did occur, only CRs which 
preceded them were scored. In order 
to assess the frequency of “nonspecific” 
responses, a 3.2-sec. interval midway 
between the CSs was also scored, A 
maximum of one response per interval 
was scored. 

The data were scored for both mag- 
nitude and frequency of response. Be- 
cause of the large number of zero re- 
sponses, far in excess of that normally 
found in conditioning studies in which 
there is either a stimulus of sudden 
onset or S is instructed to produce 
some response, if only a verbal label, 
it was decided that the most appro- 
priate analysis was in terms of fre- 


quencies grouped into blocks of trials. 
The magnitude data produced ex- 
tremely skewed distributions, ‘the cell 
means of which were markedly dis- 
torted by one or two deviant responses. 
When the magnitude data were cor- 
rected for individual variability in 
range, the results closely paralleled the 
frequency data. Accordingly, only the 
frequency data are reported. For the 
experimental group an analysis of 
variance was performed on the fre- 
quency of responses arranged in blocks 
of four trials for the three scoring in- 
tervals within the three experimental 
conditions. For the adaptation phase 
Trials 1-16 were used. For the con- 
ditioning phase Trials 18-33 were 
used. Trial 17 was omitted, as the re- 
sponses scored for it preceded the first 
UCS presentation, and Trial 33 was 
included, as the responses scored for 
it occurred before a UCS had been 
omitted. For the extinction phase 
Trials 34-48 were used, and the final 
block of 4 trials was completed by 
extrapolating from the data on 3 trials. 
An alternate analysis performed on the 
data blocked in four groups of 4 trials 
within the initial division of three 16- 
trial phases yielded virtually identical 
F ratios. 

The results are presented in Fig. 2. 
Included in Fig. 2, although not in the 
analyses because of the high frequency 
of zero responses, are data on the non- 
specific GSRs. 

The Phase effect was significant at 
the .01 level, F (2, 28) =6.79. The 
mean number of responses in the adap- 
tation, conditioning, and extinction 
phases, respectively, was 3.07, 4.33, and 
2.24 per block. The nonspecifics varied 
in the same direction, but to a lesser 
extent, with corresponding means of 
1.93, 2.26, and 1.60. The Block effect 
was significant at the .01 level, F (3, 
42) = 5.55, reflecting the weakening of 
the responses over blocks pooled over 
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Fic. 2, Frequency of GSRs as a function of blocks of trials for adaptation, 
conditioning, and extinction phases. 
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tion was significant at the .05 level, F 
(6, 84) = 2.82. While there was a de- 
crease over blocks in the first two 
phases, there was an increase from the 
first to the second half of the extinction 
phase, with means in order of Blocks 
1—4 as follows: .51, .44, .67, and .62 
responses per block. The Interval 
effect was not significant, nor was the 
Phase x Interval interaction. How- 
ever, inspection of Fig. 2 reveals that 
there was a tendency in the adapta- 
tion phase for the second interval to 
have the highest response strength, 
and in the conditioning phase for the 
third interval to have the highest re- 
sponse strength, These tendencies 
were more marked in the first block 
of trials than in later blocks, 

Some distortion was introduced by 
the blocking of the first few trials in 


the adaptation and conditioning phases 
due to rapid, early changes. For this 
reason, the first five single trials are in- 
cluded as an inset in Fig. 2. It is 
noteworthy that overall responsivity 
to the first presentation of the CS in 
the adaptation phase was greater than 
following the first shock in the condi- 
tioning phase. However, in the adap- 
tation phase there was an overall de- 
crease in responsivity from the first to 
the fifth trial, while in the conditioning 
phase the third interval response 
showed a steady increase from the first 
to the fourth trial followed by a de- 
crease on the fifth trial. The peak re- 
sponse during the third scoring inter- 
val in the conditioning phase was 
greater than the peak response for any 
other scoring interval in any other 
phase. 

In order to assess the effect of 
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threat of shock upon reactions to the 
CS in the absence of conditioning, the 
control group was compared to the 
experimental group during the adapta- 
tion phase only. Because of the ex- 
tremely high number of zero responses 
in the control group, the data were 
analyzed by chi-square applied to a 
median split for total responses per S. 
A value of 6.83, df = 1, indicated that 
the experimental group was signifi- 
cantly more responsive than the con- 
trol group at the .01 level. It was 
further observed that the reactivity of 
the control group to the CS was less 
than the nonspecific reactivity of the 
experimental group. 


Discussion 


The technique used in this study would 
appear to have an advantage over stan- 
dard GSR conditioning procedures in 
that it eliminates reflex orienting re- 
actions to CS onset. Support for the 
view that the apparatus used in this study 
effectively reduced orienting responses 
to the CS comes from an inspection of 
response frequencies obtained in another 
study conducted in this laboratory in- 
volving a procedure identical to that used 
in this study, but employing a CS which 
had a sudden onset (1,000-cps tone, 40-db. 
SPL). While 71% of Ss responded 
within 3.2 sec. after the first CS onset in 
that study, only 40% responded within 
3.2 sec. after the first CS onset in the 
present study. 

Although the procedure used in the 
present study eliminated reflex orienting 
reactions, response frequency to stimulus 
onset during the adaptation phase was 
nevertheless higher than the nonspecific 
rate. Given the threat of shock, appar- 
ently any stimulus which can be inter- 
preted as possibly signaling the occur- 
rence of a shock, including the nonsudden 
onset of a CS, can be expected to evoke 
a response. Such implicit expectancies 
associated with stimulus onsets can be 
viewed as similar, if not identical, to 
conditioned responses, the difference 


being that they were not directly manip- 
ulated by E. One way of viewing 
the results of the present study is to con- 
sider that during the adaptation phase, 
in the absence of confirmation, the initial 
expectancy that shocks would be received 
at stimulus onset was extinguished, and a 
new expectancy was established with re- 
spect to stimulus offset. It is noteworthy 
that during extinction, responses to 
stimulus onset steadily increased and 
tended to reassert their dominance, indi- 
cating that intrinsic expectancies based 
on preexperimental biases may recover 
despite a period of experimental dis- 
confirmation. 

The conclusion that implicit expectan- 
cies must be taken into consideration 
when interpreting results from classical 
conditioning procedures gains further 
support from a finding in this study 
which, although anecdotal, seems worthy 
of mention. For four Ss the recording 
apparatus and the motor driving the 
pointer were inadvertently left on after 
completion of the 48th experimental trial, 
marking the end of the experiment. Asa 
result, a 17th extinction trial was pre- 
sented which could have been construed 
to begin a third cycle of 16 trials. In 
each case, an unusually large response 
was obtained. Each of the four Ss, when 
interviewed, stated that he thought that 
another series of shock trials might be 
forthcoming. 

Expectancies associated with stimulus 
onset further complicate the question of 
whether a response is an OR or a CR. 
Stewart et al. (1961) conclude that re- 
actions to stimulus onset in typical con- 
ditioning studies with a CS of sudden 
onset are more apt to be orienting re- 
actions than conditioned reactions. The 
data of the present study suggest that 
another complicating factor enters into 
the picture, i.e., that reactions to stimulus 
onsets may be influenced by the extinc- 
tion and spontaneous recovery of initial 
implicit expectancies. 

Was conditioning illustrated? By the 
usual criterion of an increase in response 
strength beyond that at the end of the 
adaptation trials, it was. However, this 
criterion leads to the view that the non- 
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specifics were also conditioned. The fact 
that there were fewer responses in the 
conditioning trial than in the first adapta- 
tion trial might be taken as evidence that 
the conditioning procedure did no more 
than cause the partial recovery of 
adapted-out orienting reactions (Ste- 
wart et al., 1961). 

However, there is also evidence that 
supports an interpretation that condition- 
ing was demonstrated. It is not critical 
in establishing conditioning to demon- 
strate that response strength for a CR 
is greater than for an orienting reaction 
which can be very strong. What is nec- 
essary is to demonstrate some degree of 
independence between the two. As noted 
earlier, while necessary for parametric 
analysis, the presentation of trials in 
blocks introduced a measure of distortion 
for the early trials. Inspection of the 
data from single trials indicated that 
during conditioning, the number of GSRs 
to the third, or conditioning, interval in- 
creased from the first to the fourth trial 
before decreasing. Neither the other in- 
tervals nor the nonspecifics exhibited a 
similar initial increase. Moreover, the 
peak of the inverted U-shaped curve of 
the GSR for the third interval during the 
conditioning phase of the experiment 
was higher than any other point for any 
interval during any phase. 

Assuming that conditioning did occur, 
the data indicate a serious problem that 
studies on GSR conditioning must con- 
tend with. Namely, GSR conditioning 
occurs in a very few trials and is then 
followed by a decrease in response 
Strength during further conditioning 
trials. The finding of an inverted U. 
shaped acquisition function is in agree- 
ment with data reported by Grings, Lock- 
hart, and Dameron (1963), Kimmel 
(1959, 1966), Meryman (1953), and 
Stewart et al. (1961). Since response 
strength may be no greater at the end 
than at the beginning of an extended 
conditioning period, it follows that if 
GSR conditioning is to be efficiently in- 
vestigated, the number of conditioning 
trials must either be appropriately limited, 


or the conditioning process must be 
monitored, and an appropriate cutoff 
point selected for each individual. 

The question may be raised as to 
whether the underlying conditioning 
process is indeed rapid in the sense of 
being completed in a few trials, or ap- 
pears so only because of the masking 
effect of some other factor, such as a 
transient inhibitory effect. To examine 
this possibility, work is required on the 
effect of number of reinforcement trials 
upon response strength with time be- 
tween conditioning and test trials treated 
as a parameter. 
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During a prefamiliarization period, 20 Ss responded (Group R) and 
20 Ss did not respond (Group NR) at offset of a 5-sec. light. The 2 
groups failed to show differential rates of GSR response decrement, 
but Group R did exhibit larger intertrial GSRs than Group NR. 
Response speeds (RS) then were obtained from all Ss to presenta- 
tions of a prefamiliarized or a nonfamiliarized light. The hypothesis 
that faster RS would accompany the nonfamiliarized stimulus was not 
supported. The expectation that Ss differing in GSR responsivity 
would show differential RS performance was confirmed; high- as 
opposed to low-GSR responders showed an increment in RS across 
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trials. This finding was discussed according to an associative inter- 
pretation and related to the GSR results. 


If a warning signal precedes the 
execution stimulus in the reaction-time 
(RT) experiment, the situation can be 
described in terms of the classical con- 
ditioning paradigm (Behar & Adams, 
1966; Grant, 1964). Thus, the follow- 
ing equations can be made: the warn- 
ing signal with the CS, the reaction 
stimulus with the UCS, and the inter- 
val between warning-signal onset and 
the reaction stimulus, usually referred 
to as the foreperiod, with the CS-UCS 
interval, 

Beecroft (1966) reviewed evidence 
that UCS prefamiliarization hinders 
classical conditioning; it is also the 
case that prefamiliarization of the UCS 
has an adverse effect in the RT con- 
text. In a series of studies with chil- 
dren, Cantor and Cantor (1964, 1965) 
have observed faster response speed 
(RS) to a nonfamiliarized stimulus as 
compared to a stimulus preexposed 
during an initial familiarization period 
when no response was required. This 
finding of relatively slower RS to a 

1 The authors wish to thank C. Clifton and 


C. Spiker for their suggestions and comments 
on an earlier version of this paper. 


familiarized UCS appears to be quite 
general, having been obtained for a 
lever-pulling response and a button- 
pressing response. Moreover, this ef- 
fect of UCS prefamiliarization has 
been replicated by Bogartz and Witte 
(1966) with children in two experi- 
ments, one with a simple RT procedure 
and the other involving discriminative 
RT. Grim and White (1965) re- 
corded simple RT of adults and re- 
ported a finding apparently opposite to 
those just cited. Slower RS occurred 
when color of the UCS was changed 
on a test trial after 16 initial familiari- 
zation presentations. It should be 
noted, however, that the latter study 
departed from the Cantor and Cantor 
procedure in two ways: Grim and 
White had Ss respond during the fa- 
miliarization trials and they did not 
employ a warning signal. 

Little work has been done regarding 
CS prefamiliarization in the RT situa- 
tion, although this manipulation has 
been investigated in the conditioning 
context. Both Lubow (1965) and 
Carlton and Vogel (1967), for ex- 
ample, found that preexposure of the 
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future CS resulted in a performance 
decrement in animals. Sokolov (1963, 
p. 249) has commented that prefamili- 
arization of the CS retards the appear- 
ance of GSR responses to the CS in 
a conditioning paradigm with an in- 
structed response. 

The present study employed a sim- 
ple RT task and two types of CS pre- 
familiarization procedures. Onset of 
a 5-sec. warning light served as the 
CS, whereas its offset was the UCS 
for a button depression. Prior to the 
RT trials all Ss were preexposed to 
the light; during this familiarization 
phase, half the Ss (Group R) re- 
sponded to light offset while the re- 
maining Ss (Group NR) only ob- 
served the light. This manipulation 
was introduced to assess the influence 
of responding vs. not responding on 
RS during the test phase; positive 
transfer was anticipated for Group R 
resulting in faster RS. Also of inter- 
est was the possibility that differential 
response decrements would occur for 
Groups R and NR in the familiariza- 
tion phase. This question stemmed 
from the proposition (Lynn, 1966, P- 
30) that stimuli, given cue properties 
(i.e., the light for Group R), are more 
resistant to habituation-like effects. 

During the test phase following the 
familiarization period, while RS data 
were recorded from all Ss, half the 
trials involved presentation of the fa- 
miliarized light and the remainder were 
exposures of a different colored light. 
The first objective here was to assess 
the effect of CS prefamiliarization on 
(a) magnitude of the GSR to stimulus 
onset and (b) RS to stimulus offset. 
It was expected that the nonfamiliar- 
ized CS as compared to the familiarized 
CS would be accompanied by both 
larger GSRs and faster RSs, A final 
question dealt with the possibility that 
significant variability in RS perform- 


ance is attributable to the organismic 
variable of GSR responsivity. Maltz- 
man and Raskin (1965) concluded 
that Ss showing high “orienting-re- 
flex” activity (defined by GSR respon- 
sivity) show better semantic condition- 
ing and paired-associate learning than 
Ss scoring low on this dimension, 
Here, the question was whether dif- 
ferential RS performance would be 
shown by Ss exhibiting high vs. low 
GSR responsivity. 


METHOD 


Subjects —Forty-three female undergrad- 
uate students at the University of Iowa 
served as Ss. Three S's were eliminated be- 
cause of equipment malfunction. Participa- 
tion in the experiment was part of the ele- 
mentary psychology course requirement. The 
median age of Ss was 19 years. 

Apparatus—A black box (10 X 10X44 
in.) containing a 24-in. diameter opal glass 
window displayed the stimuli. Blue and yel- 
low 6-y. bulbs mounted behind the window 
were adjusted to be equal in brightness as 
judged by three Os. Hunter interval timers 
controlled stimulus duration, which was 5 
sec., and intertrial interval randomly varied 
at 6, 7, or 8 sec. The response button was 
mounted over a microswitch in a box (194 
X5X3 in.) which S held in her lap. Re- 
sponse latencies, the interval from offset of 
the stimulus light to button depression, were 
recorded in milliseconds by a Hunter Kloc- 
kounter and reciprocalized to speed scores. 

Skin resistance readings from a Fels 
dermohmmeter were recorded on a Grass 
Model 7 polygraph. Zinc cup electrodes with 
a surface area of 28.5 cm’, containing zinc 
sulfate paste were attached to the palm and 
dorsal surfaces of S’s left hand. A subject 
current of 40 wa. gave a current density of 
12.7 ua/cm?. 

Procedure—The electrodes were attached, 
and instructions appropriate to S’s group af- 
filiation were given. Group NR was told 
only to watch the lights in the stimulus box 
located approximately 36 in. from their eyes. 
Group R also was asked to rest their index 
fingers on the response button and press it 
as fast as they could after the light went off. 
Sixteen presentations, of either a blue or yel- 
low light, then were given, this constituting 
the familiarization phase; half the Ss in each 
group received one of the two counterbalanc- 
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ing conditions for light color. 
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Following the 
familiarization phase, Group R was asked to 
continue to respond to light offset, while 
Group NR now was given the response box 
and the instruction to respond. During the 
32-test phase trials, the familiarized stimu- 
lus and a nonfamiliarized stimulus (i.e. 
whichever light had not been exposed during 
the familiarization phase) were each pre- 
sented 16 times in a random sequence with 
the restriction that equal numbers of the 
two types of stimuli occur in a block of 
8 trials. 

Data scoring—Decreases in skin resist- 
ance of at least 200 ohms were defined as 
GSRs. The response was quantified by 
measuring two points: the resistance value 
where the deflection began (prelevel) and 
the minimum resistance value reached before 
the pen changed direction (peak level). 
These readings were reciprocalized and the 
final GSR score was the difference between 
the pre- and peak measures or change in log 
conductance in micromhos. GSRs begin- 
ning during the CS-UCS interval were 
classified as “stimulus GSRs” and scored sep- 
arately from those which occurred during 
the intertrial interval termed “intertrial 
GSRs.” The classification of Ss into GSR- 
responsivity groups was based on stimulus 
GSRs to the initial light of the test phase 
which was the familiarized stimulus, These 
GSRs were divided at the median, separately 
for Groups R and NR, into groups of high- 
and low-GSR responders. 


RESULTS 


Familiarization phase—The GSR 
scores in four-trial blocks for Groups 
R and NR are presented in Fig. 1, 
separately for stimulus GSRs and in- 
tertrial GSRs, These data were ana- 
lyzed with a 2x 2x 2 x 4 factorial 
analysis of variance, which included 
two between-Ss and two within-Ss 
factors. The between-Ss factors were 
counterbalancing (blue or yellow 
light prefamiliarized) and instruction 
(Groups R and NR). The within-Ss 
factors included blocks (four blocks 
of four trials) and period (i.e., stimu- 
lus vs. intertrial GSR). 

A significant main effect for Blocks, 
F (3, 108) = 15.31, p < .001, indi- 
cated that the GSR scores showed a 


STIMULUS- GSR. 


INTERTRIAL- GSR 


— or R 
os e-e GRP. NR 


4 -TRIAL BLOCKS 


Fic. 1. Stimulus-GSR and intertrial-GSR 
means for Groups R and NR on trial blocks 
of the familiarization phase. 


response decrement across trials, thus 
pointing to the effectiveness of stimulus 
repetition in producing what may be 
termed an intrastimulus-familiarization 
effect. The expected difference be- 
tween Groups R and NR in rate of 
response decrement failed to appear as 
revealed by a nonsignificant Instruc- 
tion X Blocks interaction, F (3, 108) 
= 1.48, p > .05. As seen in Fig. 1, 
Group R exhibited larger GSRs than 
Group NR, this difference being most 
pronounced for intertrial GSRs. This 
observation was supported by the oc- 
currence of a significant Instruction X 
Period interaction, F (1, 36) = 15.90, 
p<.001, in addition to significant 
main effects for Instruction, F (1, 36) 
= 651, p<.05, and Period, F (1, 
36) = 7.04, p < .05. Follow-up ¢ tests 
(.05 level) comparing the means in- 
volved in the significant Instruction X 
Period interaction confirmed the con- 
clusion that the primary reason for 
this effect was the occurrence in Group 
R of large intertrial GSRs. Group R 
showed an intertrial-GSR level (X = 
.010) that was significantly higher than 
its stimulus-GSR level (X = .005), 
and both the stimulus-GSR (X= 
002) and intertrial-GSR (X= .001) 
levels of Group NR. i 

Test phase. —The CS-prefamiliariza- 
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tion manipulation was ineffective in 
that presentations of the familiarized 
CS (X=3.85) and nonfamiliarized 
CS (X = 3.86) resulted in nearly iden- 
tical mean RS. The two types of CS- 
prefamiliarization procedures, involv- 
ing respond vs. no respond instruc- 
tions, also had little differential effect 
on RS during the test phase as shown 
by similar mean RS for Group R (X = 
3.82) and Group NR (X=3.89). 
These data were analyzed with a 2 x 
2x 2x4 factorial analysis of vari- 
ance. Between-Ss factors were in- 
struction and GSR-responsivity groups. 
Within-Ss factors included blocks 
(four blocks of eight trials) and 
familiarity (familiarized vs. nonfamili- 
arized CS). This analysis confirmed 
the above conclusions regarding the 
absence of prefamiliarization effects. 
The familiarity and instruction fac- 
tors were both nonsignificant main ef- 
fects, F <1 in each case, and neither 
factor entered into any significant in- 
teractions. 

Mean RSs are plotted in Fig. 2 for 
Ss responding with high vs. low stim- 
ulus GSRs on the first trial of the test 
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Fic. 2. Test phase RS for high- and 
low-GSR-responsivity groups. 


phase. The two groups showed mark- 
edly dissimilar temporal patterns in 
RS; whereas low-GSR_ responders 
showed little change across trials the 
high-GSR responders exhibited a pro- 
nounced increment in RS on the last 
two trial blocks. These conclusions 
were verified by follow-up comparisons 
(t tests at the .05 level) done on the 
mean values for each trial block after 
the RS analysis revealed the main 
effect for GSR-Responsivity Groups, 
F (1, 36) =4.39, p< .05, and the 
Blocks x GSR-Responsivity Groups 
interaction, F (3,108) = 4.60, p < .01, 
to be significant. None of the trial 
block means were found to differ for 
the low GSR responders indicating the 
absence of reliable across-trial changes 
in this group. There was a significant 
RS increment for the high-GSR re- 
sponders ; the means for Trial Blocks 3 
and 4 each differed from the means for 
Trial Blocks 1 and 2. In addition, the 
means for the high vs. low GSR-Re- 
sponsivity Groups were found to differ 
significantly at both Trial Blocks 3 
and 4, 

The GSR data were analyzed like 
the RS data, separate analyses being 
done on stimulus GSRs and intertrial 
GSRs. Dissimilar GSR levels were 
not associated with familiarized and 
nonfamiliarized stimuli, the familiarity 
main effect being nonsignificant, F < 1, 
for both types of GSR score. For 
stimulus GSRs three effects were sta- 
tistically reliable: the GSR-Responsiv- 
ity Groups main effect, F (1, 36) = 
14.88, p < .001; the Blocks main ef- 
fect, F (3, 108) = 10.61, p < 001; 
and the GSR-Responsivity Groups X 
Blocks interaction, F (3, 108) = 6.78, 
p < .001. 

The means involved in this inter- 
action are portrayed in the left side of 
Fig. 3. No significant differences were 
found among the mean GSR values on 
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Stimulus-GSR and intertrial-GSR 


Fie, 3. 
means for high- and low-GSR-responsivity 
groups on trial blocks of the test phase. 


the four trial blocks for low-GSR re- 
sponders indicating that this group 
showed no response decrement. A re- 
sponse decrement in stimulus GSRs 
was found for high-GSR responders 
(i.e., Block 2 and all subsequent blocks 
had significant lower means than Block 
1). The high-GSR responders also 
showed mean stimulus-GSR levels sig- 
nificantly higher than those for low- 
GSR responders on the first two trial 
blocks of the test phase. 

The analysis of intertrial GSRs 
yielded results comparable to those just 
cited, significant sources of variance 
being: the GSR-responsivity groups 
main effect, F (1, 36) = 11.12, p< 
01; the Blocks main effect, F (3, 
108) = 6.90, p < .001; and the GSR- 
Responsivity Groups X Blocks interac- 
tion, F (3, 108) = 3.71, p< .01. The 
mean values involved in this interaction 
are portrayed in the right side of Fig. 
3. Follow-up ¢ tests indicated that this 
interaction can be attributed to a re- 
sponse decrement for high-GSR_ re- 
sponders but not for low-GSR re- 
sponders. Significant mean differences 
were found in the former group be- 
tween Block 1 vs. Blocks 2, 3, and 4, 
and also between Block 2 vs. Blocks 3 
and 4. The two groups also were 
found to have dissimilar intertrial-GSR 
levels: At each block of the test phase 


the high GSR responders showed a 
significantly higher mean than the low 
GSR responders. 

The above analyses, showing that 
CS prefamiliarization had little influ- 
ence on RS and GSRs during the test 
phase, used trial-blocked data repre- 
senting rather gross time intervals. 
Additional analyses of variance were 
used, comparing the familiarized vs. 
nonfamiliarized stimuli on their first 
and second presentations, to determine 
whether transient effects might be de- 
tected. In the analyses, neither RS, 
stimulus GSRs, nor intertrial GSRs 
had significant main effects or interac- 
tions involving the familiarity factor 
thus indicating the absence of even an 
initial prefamiliarization effect. 


Discussion 


The significant GSR response decre- 
ment during the familiarization phase 
indicated that stimulus repetition resulted 
in an habituation-like or intrastimulus-fa- 
miliarization effect. Differential response 
decrements were not associated with in- 
structions to respond or not respond at 
stimulus offset (i.e. the Instruction X 
Blocks interactions were nonsignificant). 
This result thus failed to confirm Lynn’s 
(1966) generalization that habituation 
(or “extinction” in Sokolov’s, 1963, 
terms) is prolonged to stimuli with cue or 
signal properties. Two aspects of this 
generalization require clarification. First, 
the type of response used as an index of 
the effects of stimulus repetition must be 
considered. It should be noted that Soko- 
lov reaches different conclusions regard- 
ing vasomotor and GSR responses as in- 
dicators of the process of habituation of 
the orientation reaction (OR). Sokolov 
(1963, p. 164) concludes that habituation 
of the OR, indexed by the vasomotor re- 
sponse, is “much slower” with signal than 
with nonsignal stimuli, In contrast, con- 
cerning the GSR as an OR component, 
Sokolov (1963, pp. 194-195) maintains 
that “orientation reactions are stronger 
to signal than to nonsignal stimuli” but 
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the processes of GSR habituation are 
“similar” for the two types of stimuli. 
The latter statement coincides with the 
present results showing that Group R 
had larger intertrial GSRs than Group 
NR during the familiarization phase but 
that the two groups did not exhibit dif- 
ferential response decrements. A second 
issue concerns the criterion for conclud- 
ing that a manipulation, intended to give 
cue properties to a stimulus, is effective. 
A significant mean difference between 
signal vs. nonsignal conditions would be 
sufficient to demonstrate a difference in 
overall effectiveness between two types of 
instruction. The significant Instruction 
X Period interaction obtained here for 
GSRs in the familiarization phase indi- 
cated that such an effect occurred, at 
least for intertrial GSRs. The more spe- 
cific question raised in this study, how- 
ever, was whether different rates of re- 
sponse decrement would accompany the 
two instructions, a positive answer re- 
quiring a significant Instruction X Blocks 
interaction. According to this criterion, 
no evidence was obtained of differential 
habituation under the instructions. Thus, 
although the rate of GSR habituation may 
not be altered by instructions presumably 
giving a visual stimulus a UCS function, 
it appears that such a manipulation does 
amplify the GSR subsequent to the occur- 
rence of the UCS. 

Tt is not clear whether the failure to 
find faster RSs to the nonfamiliarized 
stimuli, as found in situations similar to 
the present one where they were employed 
as a UCS (Bogartz & Witte, 1966; Can- 
tor & Cantor, 1964), is due to the ineffec- 
tiveness of the CS-prefamiliarization pro- 
cedure or other innovations in this study. 
Two factors, duration of stimulus ex- 
posure and age of Ss, might account for 
the failure to obtain a CS-prefamiliariza- 
tion effect. The relatively long stimulus 
exposure, resulting in rapid intrastimulus 
familiarization, could conceivably have 
minimized interstimulus-familiarity dif- 
ferences. In addition, Elliot (1964) con- 
cluded that children are less able than 
adults to maintain attention to motor 
tasks. It does appear that adults perform 
much faster in this type of situation since 
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mean RS found here were approximately 
twice as fast as those obtained by Bo- 
gartz and Witte (1966), for example, 
who studied kindergarten children. Thus, 
it might be speculated that adults, being 
more task oriented, are less susceptible 
than children to the effects of familiariza- 
tion procedures which would interfere 
with optimal RS. 

Support was obtained for the Maltzman 
and Raskin (1965) belief regarding the 
importance of between-Ss differences in 
OR activity as measured by GSR mag- 
nitude to an initial stimulus. The fact 
that high-GSR responders (“high orient- 
ers” in the Maltzman and Raskin termi- 
nology) showed a significant RS incre- 
ment in the test phase could be interpreted 
by two alternative hypotheses. The first 
hypothesis attributes the relative RS su- 
periority of high-GSR responders to a 
decremental process on the part of the 
low-GSR responders. The CS-induced 
ORs, which could initially have facilitated 
processing of the UCS via mechanisms 
outlined by Lynn (1966), might have 
habituated, this leading to relatively 
slower RS in the low-GSR-responsivity 
group. This decremental hypothesis was 
not supported, if stimulus GSRs are 
equated with occurrence of the OR, since 
the low-GSR responders failed to exhibit 
a response decrement in stimulus GSRs 
during the test phase. The second hy- 
pothesis attributes the superior RS per- 
formance of high-GSR responders to an 
associative process, in agreement with the 
view of Maltzman and Raskin (1965) 
that “high orienters” are more sensitive 
to conditioning procedures. Learning of 
the CS-UCS contingency could have been 
facilitated by an elevated level of arousal, 
of increased OR activity, or both, induced 
by the CS. Either conception would 
imply greater GSR activity on the part of 
the high-GSR responders during the CS- 
UCS interval. The finding that _this 
group showed an initially elevated stimu- 
lus-GSR level during the test phase thus 
appears to support the interpretation that 
the RS increment on later trials could 
have been due to learning of the CS-UCS 
contingency by high-GSR responders. 
Replication of this phenomenon obviously 
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is needed with a finer temporal analysis 
of GSRs and appropriate control for 
pseudoconditioning. 

Congruent with recent observations 
(Andreassi & Whalen, 1967; Germana 
& Pavlik, 1964) . regarding electrodermal 
correlates of verbal learning, it appears 
that the significant GSR response decre- 
ments, shown by high-GSR responders, 
may have signaled the consolidation point 
in learning. It was only after the occur- 
rence of these response decrements in both 
stimulus GSRs and intertrial GSRs that 
a significant increment appeared in the 
RS curve. Some question can be raised, 
however, concerning the status of the 
intertrial GSR, particularly the greater 
degree of such activity on the part of 
high-GSR responders. These responses 
simply might be due to motor artifacts 
from key pressing, or perhaps they could 
be correlated with a higher motivational 
state in the high-GSR-responsivity group. 
Interestingly, Maltzman and Raskin 
(1965) have asserted that the OR, in- 
dexed by the GSR, serves as a reinforcer. 
Accepting their view would implicate in- 
strumental conditioning in the observed 
RS increment by attributing it to dif- 
ferential reinforcement on the part of 
the two GSR-responsivity groups. 
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FUNCTIONAL STIMULUS SELECTION AS RELATED 
TO COLOR VERSUS VERBAL STIMULI: 


ROBERT L. SOLSO 2 


Saint Louis University 


This study tested functional stimulus selection among 3 learning 
paradigms and 5 stimulus conditions. The paradigms conformed to 
the A-B, A-Br, and A-C transfer prototypes with the A terms repre- 
senting a color stimulus within a compound stimulus and the B, Br, 
and C terms representing preexperimentally associated, preexperi- 
mentally associated and re-paired, and preexperimentally nonassociated á 
verbal responses to the color terms, respectively. In addition, a 
within factor of color meaningfulness was considered in the analysis. 
The stimulus condition consisted of colors alone, colors + low-mean- we 
ingful trigrams (LMT), colors + high-meaningful words (HMW), 3 
LMT alone, and HMW alone. Significant main effects and selected 

comparisons were interpreted in terms of a stimulus-competition 

hypothesis. 


i 


tions of color meaningfulness in com- 


The tendency to select a functional 
stimulus from a compound stimulus has 
been well documented within the cur- 
rent verbal learning literature. Some 
investigators have used the technique 
of presenting a compound stimulus 
composed of a verbal unit and a color 
(e.g., Underwood, Ham, & Ekstrand, 
1962; Weiss & Margolius, 1954). 
Stimulus selection in these studies is 
generally thought to be a function of 
the relative meaningfulness of each 
of the two elements within a compound 
stimulus, that is, the higher the mean- 
ingfulness of one element, the higher 
the probability that it will be selected 
as a functional stimulus. Despite the 
active research in this field, no objec- 
tive measure of the meaningfulness of 
color elements has been reported in the 
psychological literature, and interpreta- 


* This article is based on a dissertation 
submitted to the graduate school of St. 
Louis University in partial fulfillment of the 
requirements for the PhD degree. Special 
thanks are due to Donald H. Kausler who 
served as chairman of the author’s doctoral 
committee and generously assisted through- 
out this research, 

? Now at Loyola University, 
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pound-stimulus experiments have been 
largely on the basis of conjecture. The 
present research retests the basic hy- 
pothesis that stimulus selection is a 
function of the relative meaningfulness 
of the stimulus elements. 

In addition, previous research re- 
garding compound stimuli has gener- 
ally avoided the problem of functional 
stimulus selection as a function of re- 
sponse characteristics. Some re- 
searchers (e.g, Underwood et al., 
1962) have tended to neutralize this 
effect by using ubiquitous responses in 
which the assumed preexperimental re- 
lationship between stimuli and re- 
sponses is equal for all pairs. In one 
phase of the present study quite the 
opposite technique was used. That is, 
high preexperimentally determined re- 
sponses were used to test the influence 
of response factors on cue selection. It 
was hypothesized that stimulus selec- 
tion would be determined, in part, as 
a function of responses. More specific- 
ally, if a response is preexperimentally 
associated with one component of a 
compound stimulus, then that preex- 
perimentally associated stimulus would 
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have a higher probability of being se- 
lected as the functional stimulus. 
Finally, little concern has been re- 
ported on the different acquisition rates 
which may exist between compound 
and simple stimuli. It may be that Ss 
acquire paired associates faster under 
conditions of simple stimuli than under 
` conditions of compound stimuli. This 
hypothesis holds that the two stimuli 
within a compound stimulus may com- 
pete with each other for the role of 
functional stimulus. The influence of 
this hypothesized stimulus competition 
may be manifested in a slower acquisi- 
tion rate as contrasted with a noncom- 
petitive simple stimulus condition. 
These hypotheses were tested ín the 
present study. 


METHOD 


General design and procedure. —Two dis- 
tinct phases were completed. In Phase 1, 
meaningfulness of colors was measured in 
terms of S’s production of verbal responses 
and S’s ratings. In addition, responses to 
colors were collected and arranged in a 
frequency hierarchy. Thus it was possible 
to isolate common responses to these stimuli. 
A general description of the method and 
results of this phase appears elsewhere 
(cf. Solso, 1967). 

An underlying assumption of Phase 2 
was that high associates to stimuli, in this 
case colors, can be used as preexperimental 
associates. Let A represent the color green 
and let B represent grass. In Phase 1, B 
was given as a response to A by approxi- 
mately 75% of the standardization sample 
and appeared as the primary response about 
40% of the time. It is plausible, then, to 
consider the A-B relationship as being fairly 
well established by prelaboratory experi- 
ences. If A is then experimentally paired 
with B, assuming a previously established 
A-B relationship, then the paradigm repre- 
sents the classical maximum positive-transfer 
paradigm, A-B, A-B (first-stage preexperi- 
mental learning, second-stage experimental 
learning). Assuming the validity of the 
preexperimental relationship, it would be 
possible to test a wide range of transfer con- 
ditions, such as an A-B (preexperimental), 
A-Br paradigm in which colors and pri- 
mary responses to colors are re-paired. Or, 
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one may test an A-B (preexperimental), 
A-C paradigm (same stimulus, new re- 
sponse). 

The stimulus elements of the lists em- 
ployed in Phase 2 consisted of colors, low- 
meaningful trigrams (LMT), and high- 
meaningful words (HMW). From these 
elements five different list conditions were 
prepared for which the intralist stimuli were 
composed of either single elements or com- 
pound elements. These conditions were: 
color alone, color + LMT, color + HMW, 
LMT alone, and HMW alone. In addition, 
three transfer paradigms were investigated, 
These paradigms were identified with re- 
spect to the three stimulus conditions contain- 
ing color elements. For each paradigm, 
the first stage, A-B, consisted of preexperi- 
mental associations to colors as determined 
by the normative data of Phase 1, and the 
second or experimental stages were A-B, 
A-Br, and A-C (hereafter the paradigms 
will be identified by the second stage sym- 
bols only). For the remaining two stimu- 
lus conditions (LMT alone and HMW 
alone) there were three subgroups for each 
condition, varying in terms of the responses 
paired with the verbal elemeyts. One sub- 
group had the B responses of the color 
conditions, a second the Br responses, and 
the third the C responses (cf. Table 1). In 
effect, the transfer. paradigm represented by 
each subgroup was the standard C-D (new 
stimuli and new responses with respect to the 
first stage) control paradigm, but the re- 
sponse terms varied across subgroups as a 
control for the differential responses neces- 
sarily employed in the lists containing color 
elements. 

Finally, the meaningfulness of the color 
elements on cue selection was also investi- 
gated by dichotomizing at the median the 
eight intralist color stimuli into high- and 
low-meaningful sets. The high-meaningful 
colors, as determined by both the produc- 
tion and rating values of Phase 1, were BLUE, 
GREEN, RED, and YELLOW, and the low-mean- 
ingful colors were BROWN, ORANGE, PINK, and 
PURPLE. ‘ 

Thus the experimental design was basically 
a 3X5 factorial, with three transfer para- 
digms and five stimulus conditions, and there 
were 15 groups corresponding to the factorial 
combinations. In addition, for the color con- 
ditions there were two within-group levels of 
color meaningfulness, thus yielding a 3 X 3 X 
2 mixed factorial design. 

Lists—The  stimulus-response elements 
making up the eight pair lists are given in 
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TABLE 1 


STIMULI AND RESPONSES EMPLOYED 
IN THE LIsTS 


Stimuli Responses 
Colors | Words| Tri- B Br ts 
grams} 

BLUE Gas | Gws | sky blood hat 
BROWN | NEW | NXQ | dirt bumpkin | rain 
GREEN BED | BXD | grass sun city 
ORANGE | BAD | BWD | pumpkin | sky door 
PINK GOT | Gvs | baby dirt train 
PURPLE | NOW | NHQ | grapes baby yord 
RED DAY | DWK | blood grapes hall 
YELLOW | DIE | DHX | sun grass glass 


Table 1. The HMW and LMT components 
are identical to those used by Underwood 
et al, (1962) with the exception of the Now 
and the trigram NHQ since the present list 
contained eight pairs as contrasted with 
seven pairs for the Underwood et al. list. 
The LMT stimulus elements have low asso- 
ciative connections between letters as deter- 
mined by Underwood and Schulz (1960), 
but they have high formal similarity to 
reduce the possibility that a single letter may 
serve as the functional stimulus. 

The color stimuli were composed of the 
same materials used for the standardization 
of meaningfulness and word associates. 
They were originally described by Brown 
and Lenneberg (1954). The color cards 
used in Phase 1 were cut in 6/8 X 7/8 in. 
rectangles and pasted onto memory drum 
tapes. The colors used, expressed in terms 
of Munsell notations, were: 5PB 4/10, SYR 
3/4, 2.5G 5/8, 25YR 6/14, 25R 7/8, 10P 
3/10, 5R 4/14, SY 8/12. For compound-stim- 
ulus conditions the verbal elements were 
typed (pica type) on 2/8 X 3/8 in. white 
Paper and pasted in the center of the color 
chips. Thus, the color comprised an equally 
distributed 4-in, frame around the verbal ele- 
ment, The B and Br responses were high 
word associates to colors haying similar 
word-count frequency as the C terms. 

Procedure —A Lafayette memory drum 
was modified by enlarging the viewing aper- 
ture to 4X5 in. to accommodate the entire 
stimulus. Anticipation learning was used 
with a 2:2 sec. rate of exposure and a 4-sec. 
intertrial interval. Three different serial 
orders were presented, 

In each of the 15 groups Ss continued to 
practice until the criterion of one perfect 
trial was attained. Under conditions in 
which a compound stimulus was used, Ss 
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were simply informed that the stimuli con- 
sisted of words (or letters) surrounded 
by a color and that the word (or letters) and 
the color would consistently be paired with 
the same response. 

Subjects —The Ss were 150 undergraduate 
students at St. Louis University who were 
alternately assigned to the 15 groups (five 
men and five women per group). Following 
acquisition of the conditions in which colors 
were used, Ss were asked if they had any 
known color deficiencies. 


RESULTS 


The means and standard deviations 
for all 15 groups are given in Table 2. 
The primary analysis of these data in- 
volved the 3 x 5 between-group fac- 
torial design, with all 15 groups repre- 
sented. The main effect of paradigms 
and the main effect of stimulus condi- 
tions were significant, F (2, 135) = 
16.48, p < .001, and F (4, 135) = 
34.50, p < .001, respectively. 

Separate analyses of variance and 
subsequent multiple comparisons 
(Duncan’s) between paradigms were 
conducted, when appropriate, for each 
stimulus condition. The between-para- 
digm effect was significant for the color 
alone, color + LMT, and color + 
HMW conditions, p’s < .01, but not 
for the LMT and HMW conditions. 

The failure to find a significant be- 
tween-paradigm effect for the LMT 


TABLE 2 


MEANS AND STANDARD DEVIATIONS 
FOR 15 ACQUISITION CONDITIONS 


Acqui- Condition 

aa 

digm | Color |Cplor +/Color +) HMw | LMT 
A-B 

M 2.50} 3.00 | 4.00 | 15.30 | 23.10 
SD -50 .89 | 1.70} 4.07 | 5.59 
A-Br 

M 10.80 | 15.80 | 13.70 | 10.45 | 22.70 
SD 5.79 | 6.27 | 6.45| 5.39} 7.37 
A-C 

M 9.20 | 14.00 | 16.10 | 14.10 | 21.80 
SD 4.17 | 3.26] 857] 5.15] 4.53 
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and HMW conditions suggests both 
hat the B and C response words were 
comparable in difficulty and that as- 
sociations between the verbal elements 
and the B and C responses were com- 
parable in difficulty with the associa- 
ions between the verbal elements and 
he Br responses. Such comparability 
is very important in interpreting the 
between-paradigm effects for the con- 
ditions containing color elements. 

The A-B paradigm was superior to 
he A-Br and A-C paradigms for color 
condition alone, color + LMT, and 
color + HMW conditions with p’s of 
<.001, <.01, and <.001, respectively. 

A similar analysis was conducted be- 
tween stimulus conditions for each 
transfer paradigm. The effect of stim- 
ulus conditions was significant, p < 
.001, and Duncan’s multiple range test 
was employed to contrast means be- 
tween specific stimulus conditions. 
Considering the A-B paradigm first, 
the color alone condition was superior 
to all of the other conditions. Al- 
though, the color alone vs. color + 
LMT comparison fell short of signifi- 
cance, p > .10, the color alone vs. color 
+ HMW approached significance, p 
< .10, and the color alone vs. HMW 
alone and LMT alone comparisons 
were clearly significant, p's < .001. 
For the A-C paradigm the color alone 
condition was significantly superior to 
all other conditions, with color vs. color 
+ LMT, p <.01, color vs. color + 
HMW, p < .01, color vs. HMW, $ < 
.01, and color vs. LMT, p < .001. 

As an additional analysis, the data 
for the groups receiving color as a com- 
ponent of the stimulus (excluding all 
HMW and LMT groups) were sub- 
jected to a 3 X 3:X 2 mixed analysis of 
variance, with color meaningfulness as 
the within-group factor. The mean 
effects of both paradigms and stimulus 
conditions were again significant, F (2, 
81) = 37.02, p < .001, and F (2, 81) 


= 4.42, p < .05, respectively, as was 
the main effect of meaningfulness, F 
(1, 81) = 6.77, p < 05. However, 
none of the interactions approached sig- 
nificance, Fs < 1.56. 


Discussion 


The paired associates in which high- 
meaningful color stimuli appeared were 
learned faster than the paired associates 
in which low-meaningful color stimuli 
appeared. The possibility that colors 
may differ in meaningfulness and that 
this factor may influence acquisition has 
been largely ignored in the contemporary 
research using colors as stimuli, But 
even with the evidence. presented in this 
study, a definitive statement regarding 
the influence of various colors’ meaning- 
fulness on acquisition is impossible. 

The capacity of colors alone to func- 
tion as effective stimuli was demonstrated. 
The superiority of colors as stimuli over 
verbal elements was reported in the A-B 
and A-C paradigm. In the A-C paradigm 
the verbal responses were selected from 
high-frequency words specifically not 
associated with the stimulus colors. 
Learning of the eight paired associates 
was accomplished with significantly fewer 
trials when colors alone served as stimuli 
than when LMTs or HMWs served as 
stimuli. The superiority of colors as 
stimuli over that of HMWs as stimuli is 
somewhat surprising. It may be that 
color meaningfulness is significantly 
greater than for HMW and that this 
higher meaningfulness facilitated acquisi- 
tion. Or, it may be that colors as stimuli 
are more easily differentiated than 
HMWs. The HMWs with which color 
stimuli were contrasted did have high 
formal similarity (eg, NEW, NOW, BED, 
pap). It should be noted, however, that 
stich an argument does not exclude the 
problem of stimulus differentiation be- 
tween color stimuli but rather that stim- 
ulus differentiation may have been easier 
between color stimuli than between 
HMW stimuli. 

The tendency for Ss to select a func- 
tional stimulus from compound stimuli 
was evident in the A-B paradigm for 
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the three conditions in which color was 
part of the compound stimulus. Acquisi- 
tion under the two compound conditions 
was similar to that under color alone and 
significantly superior to the two condi- 
tions in which only the verbal elements 
alone were presented. These results lend 
considerable support to the hypothesis 
of cue selection. However, the selection 
of color as the functional stimulus in the 
A-B condition was probably not exclu- 
sively determined by the relative mean- 
ingfulness of the two elements in the 
compound stimulus. That is, the nature 
of the response terms probably played an 
important role in that the responses were 
high word associates to colors. For ex- 
ample, Ss confronted with the task of 
associating the verbal stimulus of pay 
surrounded by a red color with the re- 
sponse blood may select the color as a 
cue on the basis of attempting to find 
the most appropriate cue for the response. 
In this sense, forward associative learn- 
ing may be closely related to R-S or 
“backward” learning. 

A compound stimulus composed of 
stimuli of nearly equal meaningfulness 
may actually retard acquisition. This 
phenomenon may be a result of stimulus 
competition. In the present study con- 
siderable support was found for this 
argument. In the A-B paradigm acquisi- 
tion was superior if the stimuli consisted 
of colors alone. A decrement in learning 
was noted in this paradigm with the addi- 
tion of LMTs and HMWs to the color 
stimuli. These results only approached 
statistical significance, but this may be 
due to the rapid acquisition in the A-B 
paradigm which masked the effects of 
stimulus competition. However, in the 
A-C paradigm, where acquisition was 
generally slower than in the A-B para- 
digm, the effects of stimulus competition 
are more apparent. 

In the A-C paradigm colors alone were 
again the most effective stimulus condi- 
tion. A significant decrement in acquisi- 
tion was noted if colors + LMT served 
as stimuli, and an even greater decrement 
occurred if colors + HMW appeared as 
compound stimuli. In addition, the com- 
pound stimulus of color + LMT tended 


to lead to more rapid acquisition than 
the compound stimulus of color + HMW, 

The slightly retarding effects of the 
compound-stimulus condition of colors 
+ LMT, as contrasted with the relatively 
rapid acquisition with only color stimuli, 
may be accounted for in terms of Ss’ dif- 
ficulty in the selection of a functional 
stimulus. Colors alone as stimuli allow 
little possibility for cue selection in that 
the stimulus is unitary. But colors + 
LMT do at least present an opportunity 
for cue selection. It may be that Ss 
tentatively select LMTs as functional 
stimuli in at least some of the eight pairs 
but find associative acquisition somewhat 
difficult. Very early in practice, Ss may 
abandon the use of the LMT and attend 
to the colors as functional stimuli. The 
net effect of this early vacillation between 
stimuli may manifest itself in retarded 
acquisition. Now consider the more sig- 
nificant retardation observed in the color 
+ HMW condition. Under these sed- 
ulous learning circumstances stimulus 
competition between two desirable func- 
tional stimuli is more obvious. Some Ss 
may attend only to the HMWs through- 
out acquisition; some may vacillate be- 
tween selection of a color stimulus on 
one trial and an HMW stimulus on 
another trial; some may consistently se- 
lect only color stimuli. The net effect of 
this hypothesized active stimulus compe- 
tition is a reduced acquisition rate. 

The concept of stimulus competition 
with compound stimuli is compatable with 
the R-S model mentioned earlier. Con- 
sider the following conceptualization: 
Let St, represent one stimulus of a com- 
pound stimulus, St, represent the other 
stimulus, R represent a response, and 


ae represent the compound stimulus. 
2 


It may be that Ss select one stimulus as 
a functional stimulus during the initial 
exposure to the compound stimulus. And 
then, when the complete pair is pre- 
sented, he may attempt to associate the 
R with the selected stimulus. For ex- 
ample, S may tentatively select St, from 


si and, upon presentation of the pair 
2 
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SLR, begin to accrue some habit strength 
2 


between St,-R. This increment in habit 
strength between St, and R may persist 
for several trials. But, if one assumes 
some degree of R-S or backward learn- 
ing is simultaneously occurring, then S$ 
is confronted with two stimuli, both vying 
for use as a functional stimulus. Dia- 
grammatically expressed, this may appear 

St. 
as R¢ Sty’ 
that the original choice of a functional 
stimulus was inappropriate and that St, 
is easier to associate with the response. 
If, in fact, § is confronted with this di- 
lemma of stimulus competition, then the 
reduced acquisition rate of compound- 
stimuli experiments is easily understand- 
able in terms of two well-documented 
verbal learning principals. First, the 
habit strength built up between St, and 
R is in effect “wasted” on subsequent 
trials if at a later point S selects St, as 
a functional stimulus. Second, if back- 
ward association is occurring, that is, 


Furthermore, S may discover 


R( a then the formation of R-St; on 
‘2 


early trials may be a source of interfer- 
ence when § later begins to associate 
R-St,. An analogous paradigm in for- 
ward learning would be A-B, A-C learn- 
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ing in which S learns a B response to an 
A stimulus and then learns a C response 
to the same A stimulus. Response com- 
petition between the B and C terms is 
well documented in the contemporary ver- 
bal learning research. The concept of 
stimulus competition may find wide appli- 
cation in compound-stimulus experiments. 
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DETECTION OF MOTION DURING BINOCULAR 


RIVALRY SUPPRESSION * 


ROBERT FOX anp RONALD CHECK 
Vanderbilt University 


During binocular rivalry suppression of a static target, an element 
of the target was put in motion and detectability of the movement was 
assessed by reaction time (RT). Rate and pattern of movement 
and the method of rivalry induction were varied in 4 experiments. 
In all cases RT to movement during suppression was significantly in- 
creased relative to nonsuppression control conditions. The magnitude 
of the increase was inversely related to the strength of the movement 
stimulus. The results support the hypothesis that suppression is an 
inhibitory state that acts nonselectively on all stimulus attributes 


falling within the suppressed retinal region. 


During binocular rivalry, dissimilar 
monocular stimuli undergo alternating 
phenomenal suppression. The dura- 
tion of the suppression phase of each 
eye is on the order of seconds, and sup- 
pression occurs even when the rivaling 
stimuli are physically intense and con- 
figurally complex. The existence of 
suppression suggests that the visual 
system contains an inhibitory mecha- 
nism of considerable power. Yet, apart 
from the limited number of inferences 
that follow directly from phenomenal 
observation of rivalry, little is known 
about the operation of the suppression 
mechanism. 

More comprehensive information 
about the characteristics of suppression 
can be obtained by the test-probe 
method whereby differences in visual 
sensitivity between suppression and 
dominance phases are assessed by re- 
sponse to an appropriate test stimulus 
presented in the vicinity of the rivalry 
target. 

Investigations of rivalry suppression 
by the test-probe method suggest that 
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Blindness. Some of these data were pre- 
sented at the meeting of the Psychonomic 
Society, 1965. 


the suppression phase attenuates test 
stimuli while the dominance phase and 
conditions of monocular viewing do not 
(e.g., Fox, 1963; Fox & Check, 1966). 
The present experiments are concerned 
with the selectivity of the inhibitory 
suppression state. When a target is 
suppressed it is clear that suppression 
has acted upon the stimulus attributes 
comprising the suppressed target, but it 
is not known if suppression is confined 
to those attributes or if it operates more 
broadly to attributes not represented 
in the suppressed target. 

Two general alternatives can be 
considered. First, there is the hypoth- 
esis based upon the assumption of a 
nonselective inhibitory process that re- 
quires all classes of test stimuli be 
attenuated during suppression inde- 
pendent of their similarity to the sup- 
pressed target and regardless of their 
intrinsic prepotency. Second, there is 
a group of specific hypotheses gener- 
ated from the assumption of selective 
suppression, holding in common the 
concept of a suppression process Oper- 
ating analytically on test stimuli so that 
certain types are blocked while others 
pass unattenuated. 

To determine which of these two 
general alternatives is more nearly cor- 
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rect, the present study utilized test 
stimuli designed to maximize the op- 
portunity for manifestation of a selec- 
tive rivalry suppression process, This 
strategy requires that two assumptions 
be made about the hypothesized selec- 
tive suppression mechanism. First, the 
likelihood of inhibition of a test stim- 
ulus decreases as a function of increas- 
ing differences between test stimulus 
and suppressed stimulus. Second, a 
selective suppression mechanism is 
biased in favor of more salient or im- 
portant stimulus events. 

A test input that satisfies both as- 
sumptions is a moying stimulus. By 
any definition of similarity a moving 
stimulus differs considerably from a 
static one. With respect to stimulus 
salience, it is generally agreed that mo- 
tion is highly prepotent in situations 
where stimulus attributes compete for 
attention. If a moving stimulus is 
detected equally well under both sup- 
pression and nonsuppression condi- 
tions, then the suppression mechanism 
has some capacity for selection. Con- 
versely, if detection is impaired during 
suppression, the hypothesis of nonse- 
lective suppression receives support. 

Experiment I tested the detectability 
of slow rotary motion during rivalry 
induced by moving contours. Experi- 
ment II tested the detectability of both 
rotary and linear motion during rivalry 
induced by foveally located static tar- 
gets. Experiment III further tested 
the detectability of linear motion. Ex- 
periment IV tested the hypothesis that 
response inhibition is correlated with 
suppression. 


EXPERIMENT I 
Method 


Subjects—Two Ss, RF and RC, the 
authors of this report, were employed. Both 
had extensive experience in observing rivalry. 
RF’s corrected vision was: left eye acuity 
20/25, right eye acuity 20/25, binocular acuity 


20/25, vertical phoria .17 prism  diopter, 
lateral phoria .66 prism diopter. RC’s cor- 
rected vision was: left eye acuity 20/20, right 
eye acuity 20/20, binocular acuity 20/18, ver- 
tical phoria .17 prism diopter, lateral phoria 
.33 prism diopter. The Modified Orthorater 
was used for visual testing. 
Apparatus—The basic apparatus (for all 
experiments) was a large Wheatstone-type 
stereoscope constructed from optic bench 
components and enclosed in a light-tight 
housing. The left and right visual fields 
were 7° squares located 688 mm. from the 
viewing port and transilluminated by in- 
candescent lamps. These fields contained 
the primary rivalry targets; by optical means 
the movement-detection stimulus appeared 
in the plane of the right field. The Ss 
viewed both fields in binocular coincidence 
through 2-mm. artificial pupils. A head- 
and-chin rest minimized head movements. 
White noise masked the onset of the move- 
ment-display motor and other auditory cues. 
The movement apparatus consisted of a 
lamphouse mounted perpendicularly to the 
axis of a large wheel driven by a variable- 
speed motor. The lamphouse carried and 
projected the movement stimulus. By vary- 
ing the position of the lamphouse along the 
radius of the wheel, either rotary or linear 
motion of the stimulus could be obtained. 
To signal the rivalry fluctuations, S de- 
pressed and released a telegraph-type lever 
switch with his right hand. One output of 
the switch was connected to an event re- 
corder while a second output was connected, 
at the option of E, to a set of timing and 
logic units (Scientific Prototype Co.) which 
initiated a test trial. The units simultane- 
ously started the movement display and a 
timer (Hunter Klockounter) 2 sec. after 
S signaled the start of the phase predeter- 
mined by E's schedule. The S$ indicated 
detection of movement by releasing, with his 
left hand, a switch that stopped the clock, 
In Fig. 1 the rivalry and motion stimuli are 
shown schematically for all experiments. 
Note that the illustrations are not to scale. 
Some features have been exaggerated and 
others eliminated in order to indicate more 
clearly the path of the motion stimulus. 
For Exp. I an 11-min. black outline circle 
served as a fixation point in the left monocu- 
Jar field. Over the surface of that field dark 
contours swept up and down in an apparently 
random order at the rate of 6°/sec. The 
contours were shadows cast by 1° strips 
mounted on an endless plastic belt running 
in the optic path between the field surface 
and its light source. The luminance of the 
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ROTARY MOTION: EXP. I 


a aa 
or 
See 


3 O 


ROTARY MOTION: EXP. 2 


LINEAR MOTION: EXP. 2,3, ond4 


LEFT EYE RIGHT EYE 
VIEW VIEW 
Fic. 1. Schematic representation of the 


stimuli used in Exp. I-IV. (Solid lines indi- 
cate the initial position of the moving stim- 
uli; dashed lines indicate movement. Draw- 
ings are not to scale. Several of the stimuli, 
€g, color and small contours, are omitted 
for clarity.) 


field was varied (220 ftl. for RF and 440 
ftl. for RC) so that the duration of the sup- 
pression phases for each S was approxi- 
mately equal. 

In the right monocular field the fixation 
point was a white-filled 11-min. circle seen 
against a black ground. On the horizontal 
plane of the fixation point and 1°11 min. to 
the left, a small (28 min. x 28 min.) geo- 
metric figure, called the suppression target, 
was located; this target consisted of three 
white lines joined in a U configuration. 
When the left and right fields were united 
binocularly, the suppression target appeared 
as a figure against the ground of the left 
field. As S fixated the fused fixation points, 
the suppression target disappeared and reap- 
peared following the temporal pattern char- 
acteristic of rivalry. The phenomenal tran- 


sitions were abrupt and unitary, resembling 
an intermittent physical presentation; fur- 
ther, the left field ground surrounding the 
target never became suppressed. The mov- 
ing contours in the left field are responsible 
for suppression of the right field target. 
The use of movement to facilitate suppres- 
sion in the contralateral eye is not new, 
although the technique is not widely known. 
A more complete description of the method 
can be found in Fox (1963). Other reports 
describing the efficacy of movement in pro- 
ducing suppression include those of Breese 
(1899) and Grindley and Townsend (1965). 
The movement stimulus was a replica 
of the right field suppression target; it was 
superimposed on the suppression target so 
that only a single target was seen before 
movement began. The motion to be detected 
was rotation of this stimulus about its axis; 
the suppression target remained stationary. 
The angular velocity of the motion stimu- 
lus was 9 min/sec, measured during constant 
rotation of the display. The luminance of 
the combined targets for RF was .6 ftl.; for 
RC, 2.2 ftl. In the second phase of Exp. I, 
a green filter (Wratten 61) covered the 
stationary suppression target and a red filter 
(Wratten 23a) covered the movement tar- 
get. With the targets in coincidence, only 
a single yellow-white target was visible; con- 
comitant with rotation, the red and green 
elements of the targets were exposed. 
Procedure—The stimulus fields were care- 
fully aligned for each S so that fusion could 
be maintained without strain. | Extensive 
practice in observing and signaling rivalry 
changes was given. The Ss assumed a high 
criterion for a rivalry change, reporting only 
complete dominance and suppression states. 
Preliminary tests demonstrated that the 
rivalry observation response did not interact 
with the RT response. Following the plan 
of a randomized block design, 10 trials were 
run in a single block; 5 suppression and 5 
nonsuppression trials were intermixed in 
accord with a predetermined quasi-random 
schedule. Each trial consisted of two 
through five cycles of rivalry. During the 
appropriate phase of one of the cycles, se- 
lected by a predetermined random schedule, 
rotation of the moyement stimulus began. 
The Ss were instructed to release the RT 
switch whenever movement was detected. 
Differences in latency of reaction between 
suppression and dominance periods were ‘the 
basic measure of the effect of suppression. 
However, two differing phenomenal reports 
were also obtainable during suppression: (4) 
a nondetection case where motion was seen 
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only after the suppression state apparently 
terminated spontaneously, and (b) a detec- 
tion case where at least a fragment of the 
motion stimulus appeared to break through 
the suppression state. These reports were 
paralleled by differences in RT (i.e. when 
motion was not seen until spontaneous ter- 
mination of the suppression period, RT 
duration approximated duration of the sup- 
pression period). While there were several 
ways to divide RTs into detection and non- 
detection categories, the pattern of results 
made this distinction unnecessary. 

To control for potential errors in signal- 
ing rivalry changes, Ss gave a confirmatory 
response after each trial indicating the 
rivalry state present when motion began. 
Only confirmed trials were accepted. Since 
Ss were highly practiced, and the motion 
started at the beginning of the rivalry phases, 
the number of disconfirmed trials proved to 
be negligible. 

The addition of the chromatic filters re- 
duced the luminance of the suppression and 
movement targets and increased the average 
duration of the suppression phases for both 
Ss. This increase was so substantial for RF 
that a set of trials was run without artificial 
pupils. The removal of the pupils effectively 
increased target luminance and reduced sup- 
pression duration, 


Results and Discussion 


In the first two rows of Table 1 
mean detections under suppression and 
nonsuppression are given for the move- 
ment condition alone. In the third and 
fourth rows RTs are given for the com- 


TABLE 1 


Reaction TIME TO MOVEMENT 
AND To COLOR CHANGE 


- Non- Signifi- 

o, | mas | sunpresgion | tee 
n 

M SD M SD D 

RF | 50 | 2.660 | 1.02 | .959 | .19 | <.001 

RC" | 30 | 2.108 | 1.27 | .962 | -13 | <.01 

RF» | 20 | 7.985 | 4.91 | 1.356 | -36 | <.01 

RC» | 20 | 3.240 | 1.33 | 1.208 | .29 | <.01 

RFe | 30 | 2.058 | .63 | 1.015 | -18 | <.01 


ky Movement only; no color change. 

Movement and color change. F 

4 3 Movement and color change without artificial 
upils. 
P Calculated by Mann-Whitney U test. 
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bined movement and color-change con- 
dition. The last row contains data 
from RF obtained without artificial 
pupils. 

The differences between suppression 
and nonsuppression are large and sig- 
nificant (Mann-Whitney U test). In 
fact, there is virtually no overlap be- 
tween the suppression and nonsuppres- 
sion RT distributions. The most di- 
rect interpretation of these data is that 
the suppression state attenuates the 
neural signal generated by the moving 
target and by the combination of mo- 
tion and color change. The RTs under 
nonsuppression probably represent the 
minimum obtainable, given the inertia 
of the movement display and the 
velocity of the target. The generality 
of these results was investigated in 
Exp. II and III. 


EXPERIMENT II 
Method 


In this experiment rivalry was induced by 
foveally placed static targets. Two motion- 
detection conditions were employed: the 9 
min/sec rotary motion of Exp. I and an 
8°/sec linear motion condition. 

Subjects—The Ss were RF and RC from 
Exp. I and two new Ss, CY and LS, both 
of whom were experienced in observing 
rivalry but naive regarding purpose. These 
new Ss were paid volunteers selected for 
their excellent uncorrected vision, i.e, acuity 
20/20 and essentially orthophoric. 

Apparatus and procedure—Apparatus and 
procedure were the same as those used in 
Exp. I except for the rivalry targets and the 
motion-detection stimulus. The left and 
right rivalry targets consisted of 1°20 min. 
squares located in the center of the 7° fields 
of the stereoscope. The borders of the 
squares were formed by black stripes; the 
50-min. interior of each square was covered 
by colored gelatin theatrical filters, green 
for the left square and red for the right 
square. Thin diagonal stripes (17 min.) 
covered the colored areas, with the stripes 
of each field oriented in 90° opposition to 
each other. 

The black borders of the rivalry targets 
served to maintain fusion of the squares, 
To facilitate fusion of the 7° fields, two 
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vertical stripes were placed at each of the 
vertical field borders. 

The motion-detection stimulus consisted of 
a 2 min. X 1 min. red rectangle projected by 
the lamphouse onto the center of the right 
field rivalry target. The rectangle, oriented 
45° from the vertical, was clearly visible 
against the ground of the rivalry target. 
The presence of this rectangle substantially 
increased the dominance of the right field; to 
compensate for this increase, a similar green 
rectangle was placed in the center of the 
left field rivalry target and oriented ver- 
tically. Each rectangle and its background 
target rivaled as a single unit. The lumi- 
nance of the fields was adjusted slightly 
for each S in order to secure equal alterna- 
tion of the fields and to obtain clear visibility 
of the movement rectangle under nonsuppres- 
sion conditions. Overall, the luminance 
levels were approximately 10 ftl. 

In the rotary motion condition the right 
field rectangle rotated about its axis at 
9 min/sec. For the linear motion condition 
the rectangle moved to the right of its center 
starting position at 8°/sec. The rotary mo- 
tion condition was completed before the 
linear condition because of apparatus re- 
quirements. As a result, LS was not avail- 
able for the linear condition. Through an 
error, CY received an additional 20 trials 
under the linear condition, 


Results and Discussion 


Table 2 contains the mean RTs for 
each S under each condition. Also in- 
cluded are the mean suppression-phase 
durations for each S, based on a sample 
of durations obtained throughout the 
experiment, Mean RT differences 
between suppression and nonsuppres- 
sion were evaluated by the Mann- 
Whitney U test, which indicated all 
differences to be significant (p< 
001). Under the 9 min/sec velocity 
condition the overlap between suppres- 
sion and nonsuppression RT distribu- 
tions was negligible. As in Exp. I, 
on a number of trials motion was not 
detected until the suppression state ter- 
minated spontaneously. 

The relatively fast RTs found under 
the 8°/sec linear motion condition are 
due to the increase in signal strength 
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TABLE 2 


REACTION TIMES TO ROTARY AND LINEAR 
MOVEMENT AND SUPPRESSION 
DURATIONS FOR EACH S 


Sup- a 
Pres- | Suppression on, | Signifi- 
Don. (sec.) suPeee) | cancer 
S |n| tion 
M| M |SD| m |sD| p 
LS» |40| 2.65 | 1.890 | 1.27 | .955 | .15 | <.001 
RC" | 40 | 5.12 | 3.268 | 1.91 | 1.378 | .29 | <.001 
CY% | 40 | 3.83 | 2.158 | <62 | 1.149 | .19 | <.001 
RF» | 40 | 9.43 | 5.172 | 3.48 | 1.091 | .16 | <.001 
RC| 60 | 5.12 | .624| .31 | .461 | .06 | <.001 
CY» | 80| 3.83 | 1577} <09 | :514 | .07 | <.001 
RF» | 60 | 9.43 | :598 | .15 | .465 | .08 | <.001 
* Rotary movement : 9 min/sec. 
b Linear movement : 8°/sec. 
e Calculated by Mann-Whitney U test. 
produced by the higher velocity. Al- 


though motion was invariably detected 
during the suppression state, RTs un- 
der suppression were significantly 
longer than comparable nonsuppression 
RTs. The difference in detection time 
between conditions suggests that some 
attenuation of the motion signal is 
present even at the higher velocity. 

These results are consistent with 
Exp. I; moreover, they demonstrate 
that the inhibitory effects of suppres- 
sion are not specific to the method of 
rivalry induction nor to the pattern of 
movement employed. The relation- 
ship between the magnitude of the in- 
hibitory suppression effect and the 
strength of the test input appears to be 
inverse. This relationship is examined 
more completely in the next experi- 
ment. 


EXPERIMENT III 
Method 


In this experiment, detection of linear mo- 
tion under two velocity conditions (2°/sec 
and 5°/sec) was investigated with procedures 
highly similar to those used in Exp. II. 

Subjects—Three Ss were employed: LS 
from Exp. II, and FR and DC, both paid 
experienced observers with excellent uncor- 
rected vision. 5 

Apparatus and procedure —The equipment 
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and stimuli were identical to those used in 
Exp. II. Trials were blocked in groups of 
10; the two velocity conditions were counter- 
balanced. 


Results and Discussion 


In Table 3 the mean RTs for each $ 
under each condition are given, along 
with mean suppression durations based 
on a representative sample of dura- 
tions taken throughout the experiment. 

All suppression and nonsuppression 
RT differences and the Rivalry State 
X Velocity interaction are significant 
(p < .001) as tested by the Mann- 
Whitney U test. As in the 8°/sec 
velocity condition of Exp. II, motion 
tended to be detected during suppres- 
sion; however, for both the 2°/sec and 
5°/sec velocity conditions RTs under 
suppression were longer than compar- 
able nonsuppression RTs. Further, 
the magnitude of the difference be- 
tween suppression and nonsuppression 
RTs decreased with an increase in 
target velocity. These results are con- 
sistent with the suggestion from Exp. 
II that the relationship between stim- 
ulus strength and suppression strength 
is inverse. Such a relation is evident 
in Fig. 2, where the results of both 
Exp. II and III are combined. 


TABLE 3 


Reaction TIME To LINEAR MOVEMENT OF 
VARIED RATE AND SUPPRESSION 
DURATIONS FOR EACH S 


Sup- 
pres- A Non- ignifi- 
sion | Suppression | suppression Sigal 
s Dura- i (sec.) 
n| tion 


DCa| 100 | 3.56 | 1.237 | .19 | -807 | 13 | <.001 
FR®| 100 | 3.34 | 1.075 | .23 | .847 | 19 | <.001 
LS* | 100 | 3:25 | ‘904 | 21 | .776 | -22 | <.001 
DCb} 100 | 3.56 | .673 | 107 | .544 | .07 | <.001 
FR>| 100 | 3:34 | 1575 | .06 | 482 | .04 | <.001 
Ls» | 100 | 3:25 | ‘526 | 105 | .454 | .04 | <.001 


{ Linear movement: 2°/sec. 
Linear movement: 5°/sec. 
e Calculated by Mann-Whitney U test. 


MEAN DETECTION TIME (SECONDS) 


SUPR. NONSUPR SUPR NONSUPR SUPP, NONSUPR SUPP. NONSUPP, 
97SEC, 2°/SEC. BSYSEC,  8°/SEC. 


ROTARY MOVEMENT. LINEAR MOVEMENT 


Fic. 2. Differences in RT to movement 
stimulus between suppression and nonsup- 
pression conditions for Exp. II (9 min/sec 
and 8°/sec) and for Exp. III (2°/sec and 
5°/sec). 


EXPERIMENT IV 


This experiment was concerned with 
the validity of RT as a measure of vis- 
ual sensitivity differences between sup- 
pression and nonsuppression states. It 
is possible that RT differences could, in 
part, reflect changes in the response 
criterion. In fact, there is some reason 
to suspect that the criterion for re- 
sponding may have changed in a direc- 
tion favoring faster RT during sup- 
pression since all Ss reported being 
more attentive during suppression 
phases, detection of motion of a stimu- 
lus not seen being more interesting 
than detection of a clearly visible 
stimulus. But this claim is difficult 
to evaluate in view of the negligible 
number of false-positive reactions for 
both conditions. Some pilot observa- 
tions suggested that knowledge of mo- 
tion occurring during suppression did 
not facilitate detection. During pre- 
liminary work with the 9 min/sec 
rotary motion condition, the start and 
rotation of the movement apparatus 
was quite audible. Yet on many trials 
Ss reported no visual indication of 
motion during suppression even though 
they knew motion was present. 
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Although it is unlikely that shifts in 
the response criterion could account for 
the increased RT under suppression, a 
reasonable alternative hypothesis is 
that a generalized motor inhibition ac- 
companies the suppression state. To 
test that hypothesis, auditory and vis- 
ual signals were presented randomly 
during suppression and nonsuppression 
phases and RTs were obtained. If re- 
sponse inhibition is present, auditory 
RTs as well as visual RTs should in- 
crease during suppression. 


Method 


Subjects—The Ss were DC, LS, and RF, 
who had served in previous experiments. 

Apparatus and procedure—tn essence, the 
experiment was a replication of the 2°/sec 
linear motion condition of Exp. III, except 
that either the movement stimulus or an audi- 
tory signal (1,000-cycle tone, 50-db., .1-sec. 
duration) could be presented on any trial. 
The order of presentation of the tone and 
the movement was randomized and unknown 
to S. On 70% of the trials the tone was 
given; on 30% of the trials movement oc- 
curred. The Ss were instructed to react 
as rapidly as possible to either event. From 
each S 110 auditory RTs (55 under each 
rivalry phase) and 38 movement RTs (19 
under each rivalry phase) were obtained. 


Results and Discussion 

The mean RTs are given in Table 4. 
For auditory RT the differences be- 
tween suppression and nonsuppression 


for each S were not significant, but for 
movement RT the differences were sig- 


TABLE 4 


Reaction TIME TO AUDITORY AND VISUAL 
SIGNALS DURING RIVALRY PHASES 


M Auditory Ñi Visual RT 
RT (sec.) (sec.) 
Sig- Sig- 
s Non- ances Ne ee 
Sup- |- Su fon- |cancea 
sup- | ? D- | su b 
pres- D- 
sion | Pres- Tion | pres- 


DC | .320 333 ns 


1.064 | 
DS esou a15 [come Emaan eee 
RF | .319 | (315 | ns | 1.426 | voit | <05 


a Calculated by the Mann-Whitney U test. 
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nificant for each S (p < .025 by the 
Mann-Whitney U test). 

Although the movement RTs are 
similar in magnitude to those found 
in Exp. III, the absence of any RT 
difference for the auditory condition 
argues against the hypothesis of corre- 
lation between the response system and 
the rivalry states. Accordingly, sup- 
port is given to the assumption that 
differences in movement RT between 
suppression and nonsuppression reflect 
variations in the sensitivity of the per- 
ceptual system. 


Discussion 


An increase in RT during suppression 
is the consistent outcome of all four ex- 
periments. A reasonable interpretation 
of the increase is that the suppression 
state weakened the neural signal gener- 
ated by the moving stimulus. Even 
though the suppressed static stimulus and 
the motion stimulus differed consider- 
ably, the suppression process ignored the 
difference and inhibited both. Ac- 
cordingly, these data favor the hypothesis 
of a nonselective suppression mechanism. 
Additional experiments are required to 
establish that hypothesis firmly, but it 
should be clear that failure to differenti- 
ate movement sharply limits the stimulus 
dimensions from which a selective sup- 
pression system could operate. 

The nonselective hypothesis is con- 
sistent with the phenomenal spread of 
suppression beyond the borders of the 
rivalry target; i.e, during suppression 
phases a portion of the background sur- 
rounding the target is concomitantly 
suppressed, the magnitude of the por- 
tion varying over phases (Hochberg, 
1964; Kaufman, 1963). Apparently, the 
suppression process is not precisely tuned 
to the contours of the rivalry target but, 
rather, operates more coarsely, gating the 
spatial regions within which the target 
falls. 

But there is at least one class of phe- 
nomenal data which seems to suggest that 
the suppression state is not wholely non- 
selective. In certain displays contam- 
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ing both rivalry and stereopsis stimuli, 
depth is seen even though rivalry is 
clearly present (e.g., Ogle & Wakefield, 
1967). | One obvious interpretation is 
that the depth signal escapes attenuation 
by the suppression state, but further in- 
vestigation of this situation is required 
before that interpretation can be ac- 
cepted without qualification. 

The main feature of the data from 
the present experiments is the implica- 
tion for the issue of selectivity during 
suppression. A second notable feature is 
the inverse relationship between the 
strength of the test stimulus and the in- 
hibitory suppression state found in Exp. 
II and III. One way to interpret this re- 
sult is to assume that during suppression 
the energic threshold of the input chan- 
nel is elevated. When a test input occurs 
during suppression the elevated threshold 
attenuates the energy of the input. The 
extent to which the input is weakened 
depends upon its initial strength. For 
very strong signals only a negligible pro- 
portion of the energy would be subtracted 
by the raised threshold, while very weak 
signals would be completely absorbed. 
In general, an inverse relation between 
the effect of suppression and stimulus 
strength should obtain, 


Independent of the adequacy of this 
interpretation, the inverse relation be- 
tween suppression RT and motion veloc- 
ity found in these experiments indicates 
that the motion stimulus does not simply 
act as a cue for switching rivalry states. 
Rather, detection of the stimulus depends 
upon its intensity. 
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Rats were run in a 2-link chain in which the 1st (R1), the 2nd (R2), 


or both received negatively correlated reinforcement. 


In every case, 


R2 speeds were faster following nonreward for R1. This frustration 
effect was essentially the same whether nonreward was correlated or 
uncorrelated with speed. Test trials with an extraneous stimulus led 
to an increase in the average speed of a response receiving negatively 


correlated reinforcement. 


Performance in the second link of a 
two-response chain is importantly af- 
fected by the reward conditions in the 
first link. Specifically, if rats are run 
in a double-alley maze, with reward 
always given in the second alley but 
given on only half the trials in the first 
alley, speed in the second alley is faster 
following nonreward than following 
reward in the first alley (e.g., Amsel, 
1958). This has been called the “frus- 
tration effect” (FE) because the dif- 
ference compared with controls receiv- 
ing continuous reward in the first alley 
occurs as faster running following non- 
reward. It thus appears that non- 
reward has motivational properties 
which can contribute to the total drive 
level and potentiate responding (Am- 
sel, 1962; Spence, 1960). 

The previous studies germane to 
FE used conditions in which reward 
was not explicitly correlated with re- 
sponse speed, ignoring the inherent cor- 
relation that faster speeds usually get 
the reward sooner, In order to extend 
and elaborate upon the FE, the 
present studies included conditions of 
negatively correlated reinforcement for 
either the first (R1), the second (R2), 
or both of the links of the response 


1 Supported in part by a grant from the 
National Science Foundation. The author is 
indebted to L. M. Gonzalez for running rats 
in Exp. I. 


chain. In order to receive greater re- 
ward, negatively correlated reinforce- 
ment requires S to consume time be- 
fore completing the response. Thus 
in the discontinuous case involved in 
these studies, if a 10-sec. cutoff is em- 
ployed, only speeds below .10 are re- 
warded. It is known that Ss can ad- 
just to such conditions remarkably well 
(Logan, 1960). Because of the pres- 
ent focus on FE, the equally appropri- 
ate expression of “correlated nonrein- 
forcement” will sometimes be employed 
to emphasize that nonreward resulted 
from failure to meet the designated 
criterion. 

Consider a situation in which R1 re- 
ceives correlated nonreinforcement, 
while reward for R2 is uncorrelated 
with response speed. In this case, 
the question is whether nonreward re- 
sulting from a timing error on the part 
of S is more “frustrating” than uncor- 
related nonreward. When this ques- 
tion was first posed in the context of 
conditional outcome choice behavior, it 
was found that correlated conditions 
are equally preferred to alternatives 
containing matched frequencies of 
uncorrelated reinforcement (Logan, 
1962). Specifically, rats were indiffer- 
ent as to whether reward did or did not 
depend on differential responding. 
However, it is still possible that cor- 
related nonreward is more frustrating 
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but that correlated reward is also more 
positive so that the net effect is neutral. 
This can only be determined by eval- 
uating each component separately. 

Consider next the condition in which 
R2 receives correlated reinforcement. 
Even if nonreward for R1 affects the 
motivational level of S, the effect upon 
R2 performance can not confidently be 
predicted. It is known that an in- 
crease in drive following training with 
negatively correlated reinforcement 
produces an increase in response speed 
(Logan, 1960), but it is not yet known 
how drive conditions to which S has 
regularly been exposed affect perform- 
ance in such situations. Accordingly, 
nonreward for R1 could increase drive 
but still induce S to respond more 
slowly in order to insure reward for 
R2. The present context may thus be 
an unusually good one in which to de- 
termine the effects of drive on perform- 
ance under correlated reinforcement. 

Preliminary data were also collected 
that address a quite different question. 
It is well known that organisms ad- 
just to negatively correlated reinforce- 
ment predominantly by performing rit- 
uals during the time required for 
reward (Wilson & Keller, 1953). This 
observation permits the interpretation 
that S is not actually timing the mea- 
sured response per se, but instead has 
learned a response chain that coinci- 
dentally consumes the requisite time. 
Another interpretation is that the total 
response speed is selected by a single 
choice at the beginning of each trial. 
A third interpretation is that S is 
continuously monitoring his behavior 
and deciding when to complete the re- 
sponse on the basis of some internal 
temporal cues. Any of these interpre- 
tations could equally well fit the simple 
fact of adjustment to correlated condi- 
tions. 

If the last of these interpretations is 
correct, one could presume that S is 


actively inhibiting the response until 
appropriate temporal cues arrive. The 
situation would thus be seen as some- 
what analogous to delayed classical 
conditioning. Pavlov (1927) intro- 
duced the concept of inhibition of delay 
because extraneous stimuli could pro- 
duce “disinhibition” as revealed by pre- 
mature responding. Pursuing the 
analogy, one could infer that an ex- 
traneous stimulus would disinhibit the 
response in negatively correlated rein- 
forcement and thereby cause an in- 
crease in speed. 


METHOD 


Subjects—The Ss were 28 experimentally 
naive hooded rats bred in the colony of 
the psychology department of the Univer- 
sity of New Mexico. They were housed in 
individual cages with water freely available 
and were normally fed 14 gm/day immedi- 
ately after being run, The Ss in Exp. II 
were given 50 gm. on Fridays to carry 
them through the weekend. (This latter 
procedure, systematic evaluation of which 
will be reported separately, appears to have 
maintained reasonably constant motivation on 
running days.) 

Apparatus—The apparatus was a double- 
alley maze with two adjacent, 4-ft. parallel 
alleys, one of which was white and the other 
black. Start chambers faced each alley to 
force entry but, by turning a crank, these 
could be sealed off exposing a chamber 
(normally used as a choice start box) 
through which commerce between the alleys 
could be achieved. A motor-driven, ver- 
tically sliding door separated the end 12 in. of 
each alley into a goal chamber, When ap- 
propriate, the selected number of 45-mg. 
pellets was delivered by Davis feeders at a 
rate of about 6/sec timed from S breaking 
a photobeam within the food cup. Conven- 
tional programming and recording equipment 
were available through a plugboard as re- 
quired by the procedure. 

Procedure-—The Ss were run in rotation 
in squads of four producing an intertrial 
interval (ITI) of 3-4 min. Reward was 
two pellets without delay. Free exploration 
of the apparatus and magazine training pre- 
ceded the experimental conditions. Train- 
ing lasted 50 days at 6 trials/day. 

Experiment I—The 10 experimental Ss 


398 


were placed in the start box facing the black 
alley and were rewarded in the black goal 
box if the time between the opening of the 
start door and nosing into the food cup ex- 
ceeded 3 sec. The goal doors closed for a 
6-sec, interresponse interval whether this 
first response was rewarded or not (during 
which time E turned the crank opening the 
choice chamber back at the “start” end). 
Following the reopening of the goal doors, 
S was required to retrace the black alley, 
turn 180°, and traverse the white alley at the 
end of which reward was given on every 
trial. The procedure for the 10 control Ss 
was identical except that reward for R1 was 
given independently of speed on trials cor- 
responding to those on which matched ex- 
perimental Ss earned reward. 

Experiment II—The four experimental Ss 
were placed in the black goal box such that 
their rump broke a photobeam 1 in. from the 
closed goal doors and 12 in. from the food 
cup. (The foreshortening of R1 was 
adopted to minimize possible interference 
when two different timing requirements were 
imposed in the same apparatus.) Reward 
was given if they waited at least 14 sec. be- 
fore nosing into the food cup. The goal 
doors opened 12 sec. after placement whether 
R1 was rewarded or not. The R2 was the 
same as in Exp. I except that reward was 
given only if S consumed at least 10 sec. 
completing this response. The procedure for 
the four control Ss was identical except that 
reward for RI was given independently of 
speed on trials corresponding to those on 
which matched experimental Ss earned re- 
ward, 

Following training, all Ss in Exp. II were 
tested with a moderately intense 2,000-cycle 
tone. This tone was sounded below the 
black goal box for the last 4 sec. preceding 
opening the doors to enable R2. Tests were 
given on Trials 4 and 6 over 3 days with re- 
ward or nonreward for R1 on Trial 6 un- 
correlated and opposite to that received dur- 
ing the regular procedure on Trial 4. Re- 
pick for R2 remained correlated during test 
trials. 


RESULTS 


The results of both experiments in 
terms of asymptotic running speeds, in- 
cluding the testing phase of Exp. II, 
are shown in Table 1. Consider first 
the control group of Exp. I which is in 
essence a replication of familiar uncor- 
related FE procedures. It is clear that 
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Ss ran faster in R2 following nonre- 
ward for R1; since all 10 Ss showed 
such an FE within their individual 
records, no further statistical analyses 
were performed. This result confirms 
that the present procedure is effective 
for studying the FE. 

All 10 experimental Ss of Exp. I 
for which R1 involved correlated non- 
reinforcement also showed an FE 
within their individual records. That 
the correlation was effective is evident 
in their substantially slower R1 speeds 
than that of their matched controls, t 
(19) = 9.4, p < .001. Although the 
average FE as shown in Table 1 is 
somewhat larger for the experimental 
than the control Ss, the difference be- 
tween the differences did not approach 
acceptable levels of statistical signifi- 
cance, U (10, 10) = 26, p = ns. 

The control Ss in Exp. II encoun- 
tered uncorrelated nonreinforcement 
for R1 and correlated reinforcement for 
R2. Although the FE shown in Table 


TABLE 1 


RESPONSE SPEEDS AND PERCENTAGE OF 
REINFORCEMENT IN EACH LINK OF A 
Two-RESPONSE CHAIN, WITH R2 
SEPARATED ACCORDING TO 
WHETHER R1 Was RE- 
INFORCED OR 


NONREINFORCED 

After R1 evs: 

p- 
Rie, Reinforcement | reinforcement 

lo 

Rt | Rein- — T 
Group | speed] force- T R2% 
ment | R2 | Rein- R2 || Bein 
- |S orce- 
Speed] Wats pee Cand 
cl 76 | (37.7)| .417 | (100)8 | .502 | (100)* 
EL AS arp 394 (too 554 {looa 
M 1405 1528 i 
c2 1.69 | (32.6)=| .084 | 71.6 | 105 | 324 
E2 ‘00 | 32.6 | 100 | 63.1 | 115 | 45. 
M ‘092 | 67.3 | -110 | 48.6 
Test ti | 45.8 | .126 | 40. 


Note.—For Groups C1, E1, C2, and E2, respectively, 
the reward conditions in the two links were: Unt” 
related-uncorrelated, correlated-uncorrelated, uncorre: 
lated-correlated, and correlated-correlated. The ete 
row shows the effect of a brief extraneous tone st 
preceding R2 for the correlated groups C2 and E2 c 
bined. were 

a The percentage figures shown in parentheses 
uncorrelated. 
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1 appears small, it must be recalled that 
the variation is around a cutoff speed 
of .100. All four Ss showed an FE 
within their individual records, the im- 
portance of which is best revealed by 
the substantially lower percentage of 
reward in R2 following nonreward in 
RI. 

All four experimental Ss in Exp. II, 
for which the correlated nonreinforce- 
ment for R1 was followed by correlated 
reinforcement for R2, also showed an 
FE within their individual records. 
The effect of the correlation is again 
reflected in their substantially slower 
R1 speeds, t (7) = 10.7, p < .001. 
They did somewhat more poorly on the 
average in R2 than their matched con- 
trols, but the magnitude of the FE 
was essentially the same, U (4, 4) = 
7, p = ns. 

Presumably because performance 
with correlated reinforcement is under 
control of a number of variables, in- 
cluding recent history of success and 
failure, the effects of the few tone test 
trials were not ubiquitous for all com- 
parisons for all Ss. Nevertheless, 
seven of the eight Ss averaged faster 
speeds over the six test trials, and the 
average effect shown in Table 1 was 
striking and statistically significant; 
R2 speeds were increased by a brief ex- 
traneous tone immediately preceding 
that response, t (7) = 4.1, p < -01. 


Discussion 


The present results systematically ex- 
tend earlier work in showing that an 
FE results from correlated as well as 
uncorrelated nonreinforcement. This is 
true whether or not the second response 
itself receives negatively correlated rein- 
forcement. Furthermore, the magnitude 
of the FE is substantially the same fol- 
lowing either basis for nonreinforcement. 
This last result, combined with earlier 
work on conditional outcome choices, 
suggests that neither is reward more 


positive nor nonreward more negative 
when they depend upon a quantitative di- 
mension of a response than when they 
occur irregularly without such a correla- 
tion. 

A basis for increased speed when the 
R2 involves correlated reinforcement may 
be found by assuming first that per- 
formance in such situations is based pri- 
marily upon some biological timing 
mechanism, and second that the speed 
of this mechanism is to some extent de- 
pendent upon the motivational level. 
That is to say, the organism learns to 
respond when his “internal clock times 
out.” This analysis does not imply that 
adjustment would be worse with con- 
sistently high as opposed to low motiva- 
tion; the organism could learn to respond 
to the distinctive temporal cues in either 
case. It does imply, however, that an 
increase in motivation will produce an 
increase in speed because the learned 
temporal cues would then occur sooner. 
In the present case, where motivation 
presumably varied from trial to trial de- 
pendent on occasional frustration, the re- 
inforced temporal cues would not be con- 
sistent. Since rats respond to the earliest 
cues that are ever reinforced (Logan, 
1967), they would tend to respond early 
when their timing mechanism was accel- 
erated by frustration drive. 

The fact that speed was increased by 
an extraneous stimulus appears analogous 
to disinhibition of delay and is, on that 
basis, consistent with the interpretation 
that negatively correlated reinforcement 
involves an inhibition of responding. 
Although further research with adequate 
controls is required, this finding suggests 
the importance of viewing behavior as a 
dynamic process taking place continu- 
ously over time. 
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3 experiments on the role of verbal labels in the transfer of stimulus 
predifferentiation to shape recognition and identification learning are 
described. Ss were given preliminary paired-associate practice in 
labeling random shapes with 1 of 3 kinds of verbal labels and then 
were given a shape-recognition task (Exp. I and II), or an 
identification-learning task (Exp. III) to assess effects of verbal pre- 


training. 


Recognition performance was found to depend primarily 


on the relevance of the verbal label, as distinct from meaningfulness, 
while transfer to the identification-learning task depended on both 


response relevance and response meaningfulness. 


The results sug- 


gested that the verbal label may serve both confirming and directing 
functions, varying with the relationship between pretraining and 
transfer-task activities and with intratask properties of the pretraining 


stimuli. 


Studies of the transfer of stimulus 
predifferentiation are designed to de- 
termine “what is learned” about the 
pretraining stimuli, using an A-B, A-C 
transfer paradigm or more direct tests 
of perceptual performance such as stim- 
ulus discrimination or recognition. In- 
terest has focused on the transfer task 
as an indicator of stimulus learning 
during A-B pretraining and on the 
variables that govern such learning. In 
addition, modifications of the prediffer- 
entiation paradigm have been used in 
tests of theoretical accounts of transfer 
in perceptual learning. Although con- 
vincing evidence for the positive trans- 
fer effects of predifferentiation train- 
ing has been obtained, the area has been 
plagued, nevertheless, by seemingly in- 
consistent evidence as well as by rela- 
tively small effects. Moreover, where 
consistent findings have been obtained, 
it has not been easy to identify the 
specific processes or mechanisms Te- 
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sponsible for the transfer effects (El- 
lis, 1968). In view of these diver- 
gencies, it was considered desirable to 
conduct a series of experiments de- 
signed to examine three classes of 
variables associated with seeming in- 
consistencies in the predifferentiation 
literature: properties of the verbal label, 
associative properties of the stimuli, 
and transfer-task variables. 

The precise role of properties of 
the verbal label in predifferentiation 
studies remains unclear. For ex- 
ample, Hake and Eriksen (1955), in 
examining the role of response specific- 
ity, suggested that the perceptual gain 
resulting from labeling practice is pri- 
marily a result of the general labeling 
process which provides a search task 
for S. Subsequently, Hake and Erik- 
sen (1956) demonstrated that what Ss 
learn about visual stimuli during pre- 
training is more reliably tied to labeling 
performance when the labels are 
familiar to S. Similarly, studies inves- 
tigating properties of the labeling re- 
sponse, such as “meaningfulness” of 
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the label, “relevance” of the label to the 
shape, and shape-label “congruence,” 
have shown divergent findings. For 
example, Johnson (1964) found that 
recognition of random shapes following 
predifferentiation training varied in- 
versely with the class-similarity and 
class-label congruence of the shapes. 
In turn, Arnoult (1956) found that 
properties of the verbal label did not 
differentially affect recognition mem- 
ory. He gave independent groups of 
Ss practice in observing shapes, or 
labeling them with nonsense syllables, 
girls’ names, modal labels, or labels 
supplied uniquely by each S, and found 
that recognition performance was in- 
variant with the pretraining conditions. 

Arnoult’s (1956) negative findings 
remain puzzling, principally because it 
appears reasonable to assume that S- 
supplied labels and modal labels would 
serve as more effective mediators of 
transfer performance than would non- 
sense syllables or irrelevant labels, pre- 
sumably byvenabling S to select more 
readily distinctive features of the stim- 
uli. It is conceivable, however, that 
Arnoult’s findings were the result of 
using only a single level of stimulus 
complexity (intermediate) and a single 
level of stimulus meaning (undeter- 
mined), and that the effects of proper- 
ties of the verbal label depend upon as- 
sociative and/or other properties of the 
stimuli employed. Indeed, it is known 
that the effect of verbal labeling de- 
pends upon the codeability of the visual 
forms (Ellis & Homan, 1968), and 
upon the meaningfulness of the forms 
(Ellis, Muller, & Tosti, 1966), findings 
suggesting that properties of the verbal 
label become important determinants of 
transfer performance for certain critical 
values of the stimulus. Additional 
evidence for this kind of complex in- 
teraction is seen in a study by Pfafflin 
(1960) who found positive transfer 
from verbal pretraining when the stim- 
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uli were low in associative consistency, 
and negative transfer with stimuli of 
high associative consistency. Since 
these findings suggest that the effects of 
properties of verbal labels depend upon 
associative and other properties of the 
stimuli, the first experiment in the 
series was designed to investigate the 
effects of properties of the labeling re- 
sponse, association value of the stimuli, 
and amount of practice on recognition 
of random shapes following preliminary 
paired-associate practice in labeling the 
shapes. 


EXPERIMENT I 
Method 


Experimental design—Ten Ss, volunteer 
university students, were assigned at random 
to each of 18 conditions of the experi- 
ment. Two levels of stimulus meaning 
(high- or low-association value), three types 
of labeling responses (meaningful-relevant 
words, meaningful-irrelevant words, or non- 
meaningful-irrelevant CVCs), and three 
levels of practice (2, 8, or 16 trials) were 
employed in a 2 X 3 X 3 factorial design. 

The experiment was conducted in two 
parts: In the first part (predifferentiation 
training) Ss learned to associate one of three 
types of verbal labels to each of a set of 
eight random shapes. Following pretrain- 
ing, Ss were given a multiple-shape recogni- 
tion test in which they attempted to select 
from a group of shapes those shapes learned 
during predifferentiation training. : 

Stimulus shapes and verbal labels—A list 
of eight low-association-value (No. 22, 23, 
24, 25, 26, 28, 29, 30) and high-association- 
value random shapes (No. 3, 5, 6, 7, 9, 10, 
11, 13) was selected from the set of six-point 
shapes scaled by Vanderplas and Garvin 
(1959a). The two lists had mean association 
values (AV) of 31% and 47%, respectively, 
with corresponding values of associative 
consistency (AC) and associative frequency 
(AF), as scaled by Ellis, Muller, and Tosti 
(1966), equated. Average values of AC and 
AF for the low-AV shapes were .50 and 
1.99, respectively; for the high-AV shapes, 
the values of AC and AF were .54 and 2.01, 
respectively. 

One group of Ss learned to attach mean- 
ingful-relevant (MR) labels to the shapes, 
labels known to be modal associates of the 
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visual forms (Ellis & Muller, 1964). A 
second group of Ss attached meaningful-ir- 
revelant (MIR) labels to the shapes, labels 
equal in frequency of usage (Thorndike & 
Lorge, 1944) to the MR labels judged inde- 
pendently by four Es to be unrelated to the 
visual forms. A third group learned non- 
meaningful-irrelevant labels, CVCs of 0% 
AV taken from Glaze’s list. In this fashion, 
it was possible to separate the effects of 
relevance (or “belongingness”) from that 
of meaningfulness defined in terms of asso- 
ciative availability. 

Predifferentiation training—The Ss were 
given either 2, 8, or 16 trials in learning to 
associate one of three types of verbal labels 
to the shapes. The shapes and labels were 
presented with projectors (Kodak Carousel) 
with duration controlled by Hunter timers. 
Each shape was exposed for a 4-sec. period 
consisting of a 2-sec. anticipation interval 
and a 2-sec. simultaneous presentation with 
the response label. Labels were pronounced 
by S; correct anticipations and errors were 
recorded. All S's were tested individually. 

Recognition test—Following predifferen- 
tiation training, all Ss were given a recogni- 
tion test that consisted of presenting S with 
16 cards, each card containing a set of five 
shapes mounted in a row. Eight of the cards 
contained a prototype, a shape experienced 
during predifferentiation training, and the re- 
maining four shapes were variations of the 
prototype. On the remaining 8 cards all 
five shapes were variations of the prototype, 
thus allowing a maximum score of eight cor- 
rect recognitions, The Ss were instructed 
to point to a shape if they thought it was 
one they learned during pretraining; if they 
thought that none of the shapes were ones 
they experienced initially, they were in- 
structed to say “none.” Five types of re- 
sponse measures were recorded ; however, the 
data of primary interest were the correct 
recognitions. The procedures used in the 
recognition test, including that of construct- 
ing distractors, were identical to those em- 
ployed by Ellis and Muller (1964) and Van- 
derplas and Garvin (1959b). 


Results 


Original learning—Table 1 shows 
the results of original learning on the 
predifferentiation task and indicates 
that response meaning-relevance was 
positively correlated with the number 
of label anticipations. No test of the 
significance of this trend was made, 


TABLE 1 


MEAN Correct ANTICIPATIONS DURING 
PREDIFFERENTIATION TRAINING 


HNA Pretraining Trials 


tion Value | TyPe of 
Senate a 
MR 3.6 6.8 8.0 
Low MIR 1.0 3.0 14 
CVC 0.6 1.5 4.0 
MR 2.5 1.7 8.0 
High MIR 1.1 5.4 6.0 
eve: 0.1 1.3 3.4 


however, since the data of primary in- 
terest were the scores on the recogni- 
tion test. 

Recognition performance.—The basic 
data obtained in the recognition test are 
shown in Fig. 1, which shows a plot 
of the number of correct recognitions as 
a function of number of pretraining 
trials for the conditions of the experi- 
ment. Recognition performance varied 
systematically with the meaningfulness- 
relevance of the verbal label, with 
recognition performance greatest fol- 
lowing paired-associate practice with 
MR labels, intermediate with MIR 
labels, and poorest with CVC labels. 
An analysis of variance of the correct 
recognitions revealed a significant ef- 
fect associated with type of verbal label, 
F (2, 162) = 23.72, p < .001. A sig- 
nificant improvement in recognition 
with increased practice was obtained, 
F (2, 162) = 27.60, p < 001, and as- 


LOW ASSOCIATION-VALUE SHAPES. HIGH ASSOCIATION-VALUE SHAPES 
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CORRECT necoomtions 
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PRETRAIMING TRIALS PRETRAINING TRIALS 


Fic. 1. Transfer of verbal pretraining to 
a recognition task. (Mean correct recogni- 
tions as a function of type of verbal labeling 
practice, amount of practice, and association 
value of six-point random shapes.) 
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sociation value of the shapes had no re- 
liable effect upon recognition, F (1, 
162) = .06, p > .05. 

‘Further inspection of Fig. 1 sug- 
gests that the principal contribution to 
gain in recognition performance 
stemmed from the relevance of the re- 
sponse label to the shape, as distinct 
from the meaningfulness of the re- 
sponse label. The difference between 
the upper two curves for the high-AV 
shapes, which compares the effect of 
MR labels with MIR labels, was sta- 
tistically significant, F (1, 54) = 17.36, 
p < .01, whereas the difference between 
the lower two curves, which compares 
the effects of MIR and CVC labels, 
was not, F (1, 54) = .36, p >.05. A 
similar comparison of curves for the 
low-AV shapes revealed that the differ- 
ence between the upper two curves 
was reliable, F (1, 54) = 8.01, p < .01, 
and that of the lower two curves was 
also reliable, F (1, 54) = 4.09, p< 
025. These comparisons indicate that 
response relevance contributed to gain 
in recognition for both the high- and 
low-AV shapes, whereas meaningful- 
ness of the label was an effective vari- 
able for recognition of only low-AV 
shapes, 


EXPERIMENT II 


The second experiment examined the 
generality of the findings of Exp. I 
using complex (24-point) shapes. 
Previous experiments (e.g, Ellis & 
Muller, 1964; Vanderplas & Garvin, 
1959b) had indicated that the effect 
of labeling practice on recognition in- 
teracts with the physical complexity of 
the visual forms, suggesting the possi- 
bility that response meaningfulness- 
relevance might also interact with com- 
plexity. 


Method 


All features of Exp. II were comparable 
to those of Exp. I except that complex (24- 
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point) shapes were employed. A total of 
180 volunteer university Ss were assigned at 
random to the 18 conditions of the experi- 
ment. Stimulus meaning, amount of prac- 
tice, and meaningfulness-relevance of the 
label were again varied factorially, and pre- 
training and recognition testing were iden- 
tical to procedures of Exp. I. The stimuli 
consisted of a list of eight low-AV (No. 21, 
23, 24, 25, 26, 27, 29, 30) and eight high-AV 
random shapes (No. 1, 4, 5, 6, 7, 9, 10, 11) 
selected from the set of 24-point shapes 
scaled by Vanderplas and Garvin. The two 
lists had mean AVs of 28% and 45%, re- 
spectively. Average values of AC and AF 
for the low-AV shapes were .27 and 1.35, 
respectively; average values of AC and AF 
for the high-AV shapes were .32 and 1.61. 
These values were systematically lower than 
those for the 6-point shapes. 


Results 


Original learning—Table 2 shows 
the results of original learning on the 
predifferentiation task and indicates 
that stimulus meaning and response 
meaning-relevance were positively cor- 
related with the number of label antici- 
pations. No test of the significance of 
either trend was made, however, since 
the data of primary interest were the 
scores on the recognition test. 

Recognition performance.—The prin- 
cipal data obtained in the recognition 
test are shown in Fig. 2, which is a 
plot of the number of correct recog- 
nitions for the conditions of the experi- 
ment. The figure indicates that recog- 


TABLE 2 


MEAN CORRECT ANTICIPATIONS DURING 
PREDIFFERENTIATION TRAINING 
COEN 


‘Associa. Trials 

i ie | Type of 

eee EE ae ig re 
MR 1.4 44 1:3 

Low MIR 0.6 3.0 5.1 
CVC 0.1 1.4 2.6 
MR 35 7.3 8.0 

High | MIR 1.5 5.5 n 
cvc 0.1 1.7 3. 
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Fic. 2. Transfer of verbal pretraining to 
a recognition task. (Mean correct recogni- 
tions as a function of type of verbal labeling 
practice, amount of practice, and association 
value of 24-point random shapes.) 


nition performance again varied as a 
function of the properties of the verbal 
label, with recognition performance 
greatest following _ paired-associate 
practice with MR labels, intermediate 
with MIR labels, and poorest with 
CVC labels. An analysis of variance 
of the recognition data revealed reliable 
effects associated with type of verbal 
label, F (2, 162) =9.52, p< 01; 
practice, F (2, 162) = 46.68, p < .001; 
and association value of the shapes, F 
(1, 162) = 21.53, p < .01. 

Inspection of Fig. 2 again suggests 
that the principal contribution to gain 
in recognition performance stemmed 
from the relevance rather than the 
meaningfulness of the response label. 
The difference between the upper two 
curves (MR vs. MIR) for the high- 
AV shapes was reliable, F (1, 54) = 
12.74, p < .01, whereas the difference 
between the lower two curves (MIR 
vs. CVC) was not, F (1, 54) =.69, 
p> .05. In turn, none of the curve 
comparisons for the low-AV shapes 
were reliable (p >.05). This latter 
finding indicates that the effect of as- 
sociative properties of the verbal label 
are negligible when labels are attached 
to stimuli of high physical complexity 
and low-association value. 


Experiment III 


The previous two experiments dem- 
onstrated that properties of the verbal 


label employed during pretraining exert 
powerful effects on subsequent recogni- 
tion performance. Moreover, it was 
shown that the effects of label proper- 
ties vary with physical and associative 
properties of the stimuli, and that re- 
sponse relevance was a more potent 
variable in influencing recognition than 
was response meaningfulness. The 
purpose of Exp. III was to examine 
the effects of label characteristics us- 
ing a different type of criterion measure 
in which S was required to learn new 
identifying responses to the stimuli. 
There was some evidence (e.g., Ellis & 
Muller, 1964; Vanderplas, Sanderson, 
& Vanderplas, 1964) indicating that 
the effect of verbal pretraining varies 
with the nature of the criterion task. 
For example, Ellis and Muller (1964) 
found that practice in giving distinc- 
tive verbal labels to stimuli facilitated 
the subsequent acquisition of new 
identifying (switch-pressing) responses 
to the stimuli, whereas identical pre- 
training conditions failed to produce 
comparable effects on recognition per- ' 
formance, Since little is known about 
comparisons between identification- 
learning tasks and perceptual tasks such 
as recognition, Exp. III was conducted 
so as to permit comparison of the trans- 
fer effects obtained in Exp. II with 
those of a transfer task requiring Ss to 
learn new identifying responses to the 
stimuli (A-B, A-C paradigm). 


Method 


Experimental design—The Ss were 90 
yolunteer university students assigned to each 
of nine conditions of the experiment, 10 Ss 
toa group. The Ss were given one of three 
types of pretraining, MR, MIR, or CVC 
labeling practice as in Exp. I and II, and 
an additional group received irrelevant- 
stimulus (IS) pretraining as a control 
for nonspecific transfer effects; Ss were 
given either 8 or 16 pretraining trials. In 
addition, a final group of Ss learned the 
transfer task without any  pretraining 
(NPT). These controls were unnecessary 
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for purposes of task comparisons; however, 
they do provide a base line to determine di- 
rection of transfer effects. Following pre- 
training, Ss were given a transfer task 
requiring them to learn an identifying re- 
sponse (pressing a switch) for each of the 
eight shapes they had learned during pre- 
training. 

Conditions of the experiment—The Ss 
labeled only the 24-point shapes of high AV 
employed in Exp. II. These shapes were 
used because response relevance contributed 
to large differences in recognition perform- 
ance, whereas the effects of response mean- 
ing on recognition was negligible. The same 
apparatus, procedures, labels, etc., employed 
in the predifferentiation stage of Exp. II 
were used in the predifferentiation stage of 
Exp. III. 

Transfer task—Following predifferentia- 
tion training, all Ss were immediately given a 
transfer task requiring them to learn to 
press one of eight switches on a response 
panel for each of the eight shapes they had 
previously learned. Each shape-response as- 
sociation was arbitrary. Each shape was 
exposed for 2 sec. and S was instructed to 
press the switch as rapidly as he could. The 
Ss were given confirmation of their correct 
responses by the appearance of a green light; 
an incorrect response was followed by no 
signal. A noncorrection procedure was em- 
ployed and correct responses were recorded. 


Results 


Original learning—Table 3 shows 
the results of original learning on the 
predifferentiation task and indicates 
that terminal acquisition performance 
was quite comparable to the analogous 
conditions employed in Exp. II. 

Transfer performance—Figure 3 
shows the performance curves for the 
transfer task. For purposes of analy- 


TABLE 3 


Mean CORRECT Anticipations DURING 
PREDIFFERENTIATION TRAINING 


Trials 
Type of Label 
8 16 
MR 1.4 8.0 
MIR 5.4 7.7 
CVC 2.0 3.2 
IS 6.0 6.7 
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© PRETRAINING TRIALS 16 PRETRAINING TRIALS 


CORRECT RESPONSES 


1 
TRIAL BLOCKS TRIAL BLOCKS 


Fic. 3. Transfer of verbal pretraining to 
an identification-learning task. (Mean cor- 
rect switch-pressing responses as a function 
of type of verbal labeling practice and 
amount of practice, 24-point shapes.) 


sis, appropriate pairs of curves were 
compared over Trial Blocks 6-15 using 
Duncan’s range test; overall compari- 
son of the curves by analysis of vari- 
ance indicated that the effects of pre- 
training were highly reliable, p < .01. 
The groups receiving eight paired-asso- 
ciate pretraining trials were first com- 
pared. The MR group showed per- 
formance superior to the MIR group, 
p <.05, indicating the importance of 
response relevance. Although the 
MIR group was superior to the CVC 
group, this difference was not reliable, 
p> .05. The control conditions were 
comparable, IS vs. NPT, p > .05, and 
neither control condition differed reli- 
ably from the CVC condition, p > 05. 
In turn, the MR condition was superior 
to either control condition, p < .01. A 
plot of the NPT condition in Fig. s 
is shown only for the condition of 16 
pretraining trials, since the same curve 
could be plotted on the figure for the 
eight pretraining trials condition. 

For the groups receiving 16 paired- 
associate pretraining trials, a shift in 
the role of response relevance on trans- 
fer performance is evident. The per- 
formance curves of Groups MR and 
MIR are now comparable, p > .05, in- 
dicating that response relevance no 
longer exerts an effect over and above 
that of response meaningfulness on 
transfer performance. In contrast, the 
difference between the performance 
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curves of Groups MIR and CVC now 
separate reliably, p< .01, indicating 
that response meaningfulness emerged 
as an effective variable after extended 
practice. Again, the control conditions 
were comparable, p > .05, and neither 
control condition differed reliably from 
the CVC condition, p > .05. 


Discussion 


The outstanding feature of these ex- 
periments is the marked effect of asso- 
ciative properties of verbal labels on 
transfer of predifferentiation training to 
both shape recognition and identification 
learning. The data of Exp. I clearly dem- 
onstrate that recognition of 6-point ran- 
dom shapes following paired-associate 
labeling practice is contingent upon as- 
sociative properties of the verbal label. 
Moreover, this relationship is dependent 
upon the association value of the shapes. 
The recognition of high-AV shapes was 
determined principally by the relevance 
as distinct from the meaningfulness of 
the response label. In contrast, recogni- 
tion of low-AV shapes was a function of 
both response relevance and response 
meaningfulness. The data of Exp. II 
similarly indicate that recognition of 24- 
point random shapes is dependent upon 
associative properties of the verbal label, 
although this relationship differed in 
part from that portrayed in Exp. I. The 
recognition of high-AV shapes was again 
principally determined by the relevance of 
the response label, as was the case in 
Exp. I, whereas the recognition of low- 
AV shapes was independent of either 
response relevance or response meaning- 
fulness, 

The problem remains as to how to ac- 
count for the complex effects of the 
verbal label and AV on shape recogni- 
tion, First, the effect of response rele- 
vance, as reflected in the superiority of 
the MR over the MIR condition, can be 
interpreted as a result of the “directing” 
Properties of the labels. The meaning- 
ful-relevant labels are the dominant asso- 
Ciates given to the shapes, as defined by 
Scaling procedures (Ellis, Muller, & 


Tosti, 1966), and are highly “representa- 
tive” of the shapes. Such labels, when 
used as response items in paired-associ- 
ate learning, are assumed to direct S’s 
orientation to those distinctive aspects 
of the stimuli of which they are descrip- 
tive. In contrast, the fact that low-AV 
24-point shapes were not differentially 
recognized as a function of label rele- 
vance or meaningfulness seems to be a 
matter of the relative “impoverishment” 
of these stimuli, These stimuli are 
known to elicit relatively few associa- 
tions. Moreover, any directing property 
of the meaningful-relevant label tends to 
be minimized because the most dominant 
associate elicited by these stimuli, used as 
the representative label for the MR 
condition, is less representative of these 
stimuli than the MR labels associated 
with the other stimulus conditions. 

The second finding of interest con- 
cerns the role of properties of the verbal 
label in transfer to a paired-associate 
task requiring S to learn new identifying 
responses to the shapes. An examina- 
tion of the performance curves indicated 
that response relevance contributed prin- 
cipally to the transfer effects after 8 
pretraining trials, whereas response 
meaningfulness emerged as the effective 
variable influencing transfer perform- 
ance after 16 pretraining trials. Of spe- 
cial interest is the fact that the role of 
response variables in the transfer task 
does not mirror in any direct fashion 
the role of these variables in the recogni- 
tion task. This finding implies that per- 
formance in transfer tasks requiring the 
learning of new responses is mediated by 
processes that are in part different from 
those that mediate performance in recog- 
nition tasks, In the former task, attach- 
ing MIR labels to the set of shapes pro- 
vides Ss with sufficient information about 
the distinctive features of the stimuli to 
enable them to learn more rapidly the 
transfer task than Ss given CVC labeling 
practice. More generally, MIR label- 
ing practice provides for a sufficient 
amount of cue learning critical for per- 
forming a new task in which the stimuli 
are the same as those learned during pre- 
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training, but does not provide for suffi- 
cient cue learning when the task requires 
recognition of a previously learned shape 
immersed among highly similar distractor 
items. In this connection it must be 
noted that the degree of label learning is 
not highly correlated with performance 
in the recognition task and cannot, there- 
fore, account for the magnitude of differ- 
ences in recognition performance, More- 
over, these results appear consistent with 
the view (Vanderplas, 1963) that trans- 
fer of training in perceptual learning de- 
pends on the relationship between pre- 
training and transfer-task activities. 

The net effect of these experiments is 
to suggest that a verbal label may serve at 
least two functions in predifferentiation 
studies, functions which differ somewhat 
depending upon the nature of the cri- 
terion task employed, First, the label 
can provide S with confirmation that he 
has reliably distinguished among the set 
of pretraining stimuli, as reflected in the 
number of correct anticipations. A cor- 
rect anticipation as such provides § with 
confirmation only that he can distinguish 
minimally among the pretraining stimuli; 
increased proficiency in responding can 
be, but need not be, accompanied by any 
additional learning about distinctive fea- 
tures of the stimuli beyond that minimally 
sufficient for correct anticipation. This 
function is present with almost any verbal 
label, provided that it is well integrated, 
and may be characterized as the general 
function of the label. This function ap- 
pears to be operative primarily when the 
criterion task requires that S learn 
to attach new responses to the same stim- 
uli and is supported by the fact that 
performance curves for the switch-press- 
ing task are correlated positively with 
the level of performance reached during 
pretraining. Second, the recognition 
data suggest that verbal labels may 
serve an additional function in which cri- 
terion performance is somewhat more 
dependent upon specific properties of the 
label. This second and more specific 
function emphasizes the “directing” prop- 
erties of a verbal label in which addi- 
tional distinctive features of the stimuli 
are learned over and beyond those mini- 
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mal aspects associated with the label’s 
confirming function. Although the di- 
recting property of the label is an im- 
portant determinant of recognition per- 
formance, the role of meaningfulness of 
the verbal label is minimal in influencing 
recognition. The relative roles of the 
confirming and directing functions might 
vary not only with the requirements of 
the transfer task but also with the nature 
of the intratask relations among the pre- 
training stimuli, It is possible, e.g., for 
the pretraining stimuli to be sufficiently 
similar so that the confirming function 
itself will reflect near-asymptotic learn- 
ing about the distinctive features of the 
stimuli, allowing no effective operation 
of the directing function of the verbal 
label. In this circumstance, it might be 
expected that differences in recognition 
performance between the MR and MIR 
conditions would be minimized, in con- 
trast to the results obtained in Exp. I 
and If, 
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INFORMATION FEEDBACK, INSTRUCTIONS, AND 
INCENTIVES IN THE GUIDANCE OF 
HUMAN CHOICE BEHAVIOR* 


WILLIAM C. HOWELL anv JOSEPH T. EMANUEL 
Ohio State University 


2 experiments explored the influence of information feedback (IF) on 
choice behavior in a task requiring discrimination between 2 popula- 


tions of meter readings. 


Sample information was provided through 


stimuli and IF, with the major objective being to determine the 
relative efficacy of these sources in guiding behavior under a variety 
of pertinent task conditions (% IF, specificity of instructional set, 


monetary payoff). 


Results supported the conclusion that IF is the 


only source of information made use of consistently in this situation ; 
stimulus information seemed only to inhibit discrimination. 


There seems to be widespread agree- 
ment that one—if not the only—im- 
portant function served by information 
feedback (IF) is guidance of future 
behavior (Annett, 1961; Bilodeau, 
1966; Bourne, 1966; Goldstein, Eman- 
uel, & Howell, 1968).? If this is so, 
one would expect acquisition and main- 
tenance of a particular mode of re- 
sponse, especially that which is desig- 
nated correct, to reflect the frequency 
with which IF guidance is provided. 
In support of this reasoning, perform- 
ance has been shown to vary directly 
with IF frequency or proportion for a 
variety of different tasks: e.g., tracking 
(Poulton, 1957), positioning (Bilo- 
deau, Bilodeau, & Schumsky, 1959), 


1The research reported in this paper was 
carried out at the Human Performance Cen- 
ter and was sponsored by the Aeromedical 
Research Laboratories, Aerospace Medical 
Division, Air Force Systems Command, 
Wright-Patterson Air Force Base, Ohio, un- 
der Contract No. AF 33(615)-2248 with the 
Ohio State University Research Foundation. 
Further reproduction is authorized to satisfy 
the needs of the United States Government. 

2 As Bilodeau (1966) has pointed out, it 
is immaterial for predictive purposes whether 
one chooses to emphasize future responses 
and speak of cueing or to emphasize past re- 
sponses and speak of reinforcement, so long 
as the terms remain at an empirical level. 


line drawing (McGuigan, 1959), con- 
cept formation (Bourne & Pendleton, 
1958), and card sorting (Bourne, 
1966). 

Recently, however, several reports 
have appeared suggesting that this re- 
lationship is not invariably so simple. 
First, McCormack, Bendig, and McEl- 
heran (1963) and McCormack and 
McElheran (1963) showed that re- 
action time is insensitive to IF propor- 
tion in excess of 20-30%. Subse- 
quently, others have reached much the 
same conclusion with respect to vigi- 
lance detection (Johnson, Payne, & 
Carr, 1966) and probabilistic decision 
making in a complex system (Gold- 
stein, Southard, & Schum, 1967). 
Finally Goldstein et al. (1968) have 
reported that decision making based 
upon conditional probability j udgments 
is totally insensitive to IF frequency: 
Ss learned as quickly and maintained 
performance as well when given no IF 
as they did under any of three other 
IF levels (including 100%). ; 

One obvious (and certainly not orig- 
inal) explanation for differences 1” 
IF potency lies in the fact that feed- 
back is but one of several sources of 
information upon which S may base 
his future behavior. How important 
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it is on any trial, and what is likely 
to happen to performance if it is with- 
held on some proportion of trials, may 
thus be simply a function of the relative 
use which S is able to make of alterna- 
tive sources of guidance—instructions, 
primary stimuli, etc.—in that particular 
task. At the extremes, of course, this 
proposition is self-evident: If one tells 
S exactly what to do and he is able to 
do it, there is little room for an IF 
effect; conversely, if one tells him no- 
thing, he must rely exclusively upon IF 
for guidance. 

While most tasks used to study IF 
fall somewhere between these two ex- 
tremes, it would appear that those 
which consistently yield a proportional 
IF frequency effect (e.g., positioning, 
card sorting) afford less pertinent 
guidance through instructions than do 
those which have shown little or no 
sensitivity to IF (eg., RT, choice 
behavior). If this interpretation is 
correct, it should be possible to produce 
both IF effects in a common task by 
appropriate manipulation of instruc- 
tions and IF; the only requirement 
would be that the task permit both 
sources to offer roughly the same sort 
of guidance in varying amounts. 

The present research, then, was de- 
signed to test this reasoning. Two 
studies were carried out: the first, to 
determine at a gross level the relative 
efficacy of several guidance media (in- 
structions, IF, primary stimuli); and 
the second, to establish whether, under 
conditions dictated by the first experi- 
ment, a proportional IF frequency ef- 
fect can be generated in a task pre- 
viously shown not to produce one. 


EXPERIMENT I 


The task used by Goldstein et al. 
(1968) was ideally suited for present 
purposes for two reasons: It permitted 
virtually the same critical information 
to be furnished via instructions, cum- 


ulative IF, or cumulative stimulus oc- 
currence; and its insensitivity to IF 
frequency under highly specific instruc- 
tions had already been demonstrated. 
Experiment I was designed as a fac- 
torial combination of IF and instruction 
specificity, the intent being to deter- 
mine, first, under what conditions it is 
possible to obtain changes in behavior, 
and second, whether IF and instruc- 
tions are equivalent means of guiding 
such changes. Since one condition 
provided neither IF nor specific in- 
structions, it was possible to establish 
the relative efficacy of the primary stim- 
uli as well. In addition, monetary pay- 
off was included as an orthogonal vari- 
able in case a motivational, rather than 
an informational, facet of IF were re- 
sponsible for its potency or lack there- 
of. The eight resulting conditions 
were administered over six blocks of 
100 trials each in order to permit ade- 
quate assessment of behavior changes. 


Method 


Subjects and design.—Sixty-four paid un- 
dergraduate students were assigned to one of 
eight treatment groups at random with the 
restriction that Ss of the same sex be dis- 
tributed as evenly as possible across groups. 
Groups were defined by the eight combina- 
tions of instructions (specific and nonspe- 
cific), IF (0% and 100%), and payoff 
(money and no money). The six 100-trial 
blocks constituted a within-S's variable in the 
mixed analysis of variance model. 

Apparatus and procedure —The general 
task and apparatus used are described fully 
elsewhere (Goldstein et al, 1968; Howell 
& Funaro, 1965). Briefly, S was required 
to choose, on each of a series of trials, which 
of a pair of ammeters (designated A and B) 
comprising the display was set at the higher 
value. Since only one of the two numbers 
selected as meter readings on any trial was 


3 Although magnitude of monetary incen- 
tive has been shown to be a rather weak 
variable in between-Ss designs (Schnorr, 
Lipkin, & Myers, 1966), it does exert a 
marked influence on choice behavior when 
one of the levels is zero (Myers, Fort, Katz, 
& Suydam, 1963). 
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actually displayed, it was necessary for him 
to make his choice based upon the value given 
and his accrued knowledge of the other 
meter’s behavior. Crucial information, 
therefore, was available in the statistical 
parameters of the A and B meter readings. 

Two populations of values between 0-200 
were sampled randomly to produce each pair 
of A and B readings; both were distributed 
normally with a common sigma of 25 units 
and means of 80 and 95. All readings 
assigned to the A (left-hand) meter for a 
given S were drawn from one of these dis- 
tributions and all B readings came from the 
other ; in order to control position bias, how- 
ever, the higher (ie„ X=95) distribution 
was assigned to each meter for half of the Ss. 

A total of 600 pairs of readings, corre- 
sponding to 600 decision trials, was presented 
to each S in two l-hr. sessions. Readings 
used as stimuli appeared alternately on A 
and B meters over the entire 600 trials so 
that S might gain equal familiarity with 
both distributions. 

As noted above, S’s evaluation of each 
stimulus reading should be influenced by his 
expectation concerning the other meter: i.e., 
if the other meter is usually higher than 
the particular value given, he should choose 
the other meter; otherwise, he should 
choose the one presented. In general, then, 
his behavior should be indicative of how well 
he is able to discriminate the differences in 
mean values produced by the two meters. 
Evidence of discrimination would appear in 
his choice behavior as differential criteria or 
empirical “cutoff points” for A and B meters. 
That is, the value of A at which he switches 
from a choice of A to one of B (his B 
cutoff) and the value of B at which he 
switches from B to A (his A cutoff) should 
not be the same if he discriminates between 
the two distributions and applies this knowl- 
edge to his choices. In fact, his chances of 
being correct would be greatest if his cutoff 
Points corresponded exactly to the distribu- 
tion means. 

Clearly, information regarding the nature 
of the two distributions which generate stim- 
ulus readings is of value in performing the 
task, Therefore, it was this information 
which was manipulated differentially through 
instructions and IF in the first study. Un- 
der nonspecific instructions (NI) Ss were 
told simply to attempt to choose the higher 
reading on each trial; under specific instruc- 
tions (SI) they were also made aware of 
all facets of the generating process, including 
a description of the individual and compara- 


tive features of the A and B distributions 
(except for the actual numbers involved). 

For IF conditions, S’s choice (activating 
an A or B switch) was followed im- 
mediately by a presentation of both A and 
B readings, enabling him to observe directly 
which one had actually been higher. No 
such feedback was administered in the no-IF 
conditions. It should be noted that the 
basic information provided by IF was of 
the same kind as that furnished by stimulus 
readings per se (by virtue of the alterna- 
tion procedure); without IF, however, 2 
trials were required to obtain one sample 
from each distribution, while with IF this 
information appeared in a single trial. Fol- 
lowing each session of 300 trials, summary 
feedback was presented to all Ss in terms of 
total number of correct responses. This 
was intended merely as a means of maintain- 
ing interest, and was considered to be vir- 
tually devoid of information value. 

Under payoff conditions, an incentive of 1¢ 
was awarded for each correct choice and a 
penalty of 1¢ was exacted for each error 
at the end of the experiment. No monetary 
rewards or penalties were used in the no- 
payoff conditions. 


Results and Discussion 


The primary dependent variable was 
the difference between empirical cutoff 
values established by each S for the 
A and B meters over a 100-trial block. 
This index appears most indicative of 
S’s behavior in discriminating between 
A and B distributions, and has proved 
quite sensitive in all earlier studies 
(Goldstein et al., 1968 ; Howell & Fun- 
aro, 1965). Each cutoff is calculated 
from responses made to the alternative 
meter: e.g., an A cutoff is defined as 
the point midway between the highest 
B value at which A is chosen and the 
lowest B value at which B is chosen 
over a block of trials (i.e., it repre- 
sents the average point at which S 
switches his choice from the meter on 
which a reading is presented to that 
on which it is not). At the outset of 
learning, before establishment of a 
discrimination, the A-B cutoff differ- 
ence is typically near zero; S behaves 
as though all values were drawn from 
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a single distribution. As samples are 
drawn and the discrimination is ac- 
quired, however, a systematic differ- 
ence gradually develops (see Howell & 
Funaro, 1965). 

Figure 1 summarizes the significant 
results of this study. Apparently, the 
typical discrimination occurs reliably 
only under IF conditions. Without 
IF, cutoff differences stabilize near zero 
(indicating no discrimination) after an 
initial bias favoring the wrong meter 
is overcome. An analysis of variance 
indicated that both blocks and IF were 
highly significant effects, F (5, 280) 
=7.24, p< .001, and F (1, 56) = 
8.015, p < .01, respectively. Post hoc 
comparisons of individual means iso- 
lated the effects as follows: Block 1 
differed significantly from all other 
blocks and the IF conditions differed 
significantly only after Block 1. Thus, 
the bulk of the discrimination seems to 
occur in 200 trials when IF is admin- 
istered. On the other hand, even 600 
trials, with 300 sample readings drawn 
from each meter population, is insuffi- 
cient to produce any evidence of a dis- 
crimination when the samples appear 
in the form of stimuli rather than IF. 

While specific instructions, coupled 
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Fic. 1. Comparison of mean cutoff differ- 


ences between the X =95 and X =80 meters 
obtained with and without IF over 600 trials. 


with the information available via stim- 
ulus samples, were incapable of sub- 
stituting reliably for IF, there was some 
evidence of discrimination under one 
no-IF condition: that for zero payoff 
and specific instructions. This is most 
clearly represented in Session 2 per- 
formance (i.e., Blocks 4-6, which ex- 
clude most of the acquisition trials). 
Here, the mean cutoff difference of 
5.40 obtained without IF is actually 
higher than the comparable IF mean 
(4.40) ; in all of the other IF vs. no-IF 
comparisons, the differences are on the 
order of these shown in Fig. 1 (i.e. 
very marked IF superiority with no- 
IF scores approximating zero). 

It is interesting to note that the 
one condition under which a discrimi- 
nation seems to have developed spo- 
radically in the absence of IF (SI-no 
payoff) is also characterized by un- 
usually large individual differences 
(SD = 5.50, which is nearly twice as 
high as for any other condition). In- 
spection of the individual functions sug- 
gested that this variability is attribut- 
able to two distinct types of behavior : 
Some Ss performed in much the same 
fashion as those receiving IF, while the 
rest produced functions very similar to 
those for the other no-IF conditions. 
This bimodality implies that some peo- 
ple, given the appropriate instructional 
set, are able to make the discrimination 
between meter populations on the basis 
of stimulus sampling while others are 
not; in contrast, few—if any—are cap- 
able of doing so in the absence of either 
IF or instructional guidance. 

The above reasoning is admittedly 
post hoc and speculative. It is of- 
fered only in the interest of reconciling 
the present results with those of Gold- 
stein et al. (1968) ; the latter, it will be 
recalled, obtained discrimination in the 
identical task situation without IF but 
with specific instructions. Since the 
mean cutoff functions produced by 
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comparable conditions in both studies 
were very similar, it seems reasonable 
to conclude that instructions may, for 
some individuals under special condi- 
tions, substitute for IF guidance; how- 
ever, they are clearly not as dependable 
a means of establishing a statistical dis- 
crimination. 

When specific instructions were 
coupled with monetary payoffs in the 
absence of IF, there was no hint of a 
discrimination between meter popula- 
tions. In fact, the asymptotic cutoff 
difference achieved after 400 trials was 
precisely zero for the group, and indi- 
vidual functions did not fall into the 
bimodal pattern observed in the com- 
parable no-payoff conditions. Mone- 
tary payoff, therefore, seems to have 
inhibited whatever propensity instruc- 
tions may have had for guidance of dis- 
crimination. Perhaps, as Miller and 
Estes (1961) observed in studying 
visual discrimination, preoccupation 
with the monetary aspects of the task 
diverts attention away from the pri- 
mary discrimination. 


EXPERIMENT II 


The results of Exp. I established that 
when instructions provide no clue as to 
the stimulus-generating process, IF is 
an unambiguous and effective source of 
guidance in probabilistic choice be- 
havior. Experiment II, therefore, 
sought to determine the precise IF 
frequency (or proportion) function un- 
der these optimal conditions using five 
en spaced proportions from 0- 

0% 


Method 


Subjects and design—Five experimental 
groups of eight Ss each were formed by 
random assignment of 40 paid undergradu- 
ates. Each group served under one of the 
following conditions: IF administered on 
0%, 25%, 50%, 75%, or 100% of the trials. 
Again, six blocks of 100 trials each were 
presented to each S in two sessions. 
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Apparatus and procedure—The task was 
in all respects identical to that described in 
Exp. I for the condition in which neither 
specific instructions nor monetary incentive 
was provided. In all, 600 trials were pre- 
sented to each S with 300 readings appearing 
on each meter in an alternating fashion. Un- 
der partial IF conditions, the “reinforced” 
trials were selected randomly with the re- 
striction that an equal proportion of the 
total IF instances occur within each block. 


Results and Discussion 


A mixed model analysis of variance 
applied to the cutoff difference scores 
failed to reveal any evidence of a blocks 
effect, F (5, 175) < 1.00, suggesting 
that most of the acquisition occurred 
within the first 100 trials. The in- 
fluence of IF proportion, on the other 
hand, was significant, F (4, 35) = 
3.187, p = .025, and, as indicated in 
Fig. 2, quite systematic. Level of dis- 
crimination, as reflected in the average 
magnitude of the cutoff differences, is 
directly related to the proportion of 
trials on which IF is administered. 
Except for the discrepancy at 0% IF, 
which reflects a slight bias favoring the 
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wrong meter, the trend approximates 
linearity. A minus difference is cer- 
tainly not indicative of discrimination ; 
it is attributable, in all likelihood, to 
nothing more esoteric than a slight 
preference for the meter on which each 
reading is presented. In the absence 
of any better guidance, this preference 
—which would bias empirical cutoffs in 
the wrong direction—tends to domi- 
nate behavior. If one regards dis- 
crimination as truly zero at the 0% 
point, the resulting function is, in- 
deed, best described as linear. 

The above finding is clearly at vari- 
ance with the notion that IF above 
some critical level (e.g., 25-30%) is 
ineffective in guiding behavior. On 
the contrary, it supports the directly 
opposite interpretation that each IF 
trial contributes proportionately to the 
development of the discrimination. 
Even more apparent is the fact that 
samples drawn from the two popula- 
tions and presented as stimuli contrib- 
ute nothing to the discrimination; only 
when these samples are involved in 
feedback do they affect observed cut- 
offs, 

Viewed in a somewhat different way, 
the present data also suggest that the 
efficacy of IF trials is highly dependent 
upon interpolated events. This is best 
illustrated by comparing the level of 
performance achieved under each per- 
centage of IF condition after compar- 
able absolute frequencies of IF (since 
the 25% condition contained only 150 
IF instances, this was used as the basis 
of comparison). The mean cutoff dif- 
ferences obtained at these points were 
approximately 2, 3, 6, and 8 for 25- 
100%, respectively. In other words, 
the efficacy of 150 instances of IF is 
far greater when massed than when 
no-IF instances intervene. Whether 
no-IF trials serve as a source of inter- 
ference or whether spacing per se is 
responsible cannot, of course, be 


answered on the basis of the present 
design. If, in fact, interference is in- 
volved, one is led to the curious con- 
clusion that potentially valuable infor- 
mation available through stimulus 
sampling not only fails to enhance the 
statistical discrimination, but actually 
inhibits it. 

One final result which deserves men- 
tion is the discrepancy between com- 
parable Exp. I and Exp. II conditions. 
While the terminal cutoff levels 
achieved under 100% and 0% IF 
in the two studies were not widely dis- 
crepant (9.70 vs. 11.87 for 100% and 
2.50 vs. 2.25 for 0%), the rate of 
acquisition apparently was: There was 
a significant improvement from Block 
1 to Block 2 in the first study and 
virtually none in the second. The most 
plausible explanation seems to lie in 
the fact that means reported for Exp. I 
(see Fig, 1) include the effects of sev- 
eral treatment conditions other than 
percentage of IF. While these effects 
were not potent enough to produce 
statistical significance, they may have 
had some influence, especially in the 
early trials. Considering only the 
strictly comparable Exp. I data (i.e., 
100% IF, no payoff, nonspecific in- 
structions), the discrepancy on Block 
1 between Exp. I and II means reduces 
from about 9 to 3 units. Thus, the 
safest generalization that can be made 
with respect to acquisition is that under 
high-IF proportions the greatest part of 
it occurs rapidly somewhere within 200 
trials. In view of the statistical nature 
of the task, it would be difficult to chart 
the function much more precisely than 
this (i.e. the unit of behavior is neces- 
sarily rather broad). 


CONCLUSIONS 


The present results serve to clarify 
the influence of IF on guidance of be- 
havior in a task requiring statistical 
judgments. In essence, they show: (a) 
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IF is necessary for reliable development 
of discrimination in this task in spite of 
alternative sources of guidance; (b) per- 
formance is directly related to the pro- 
portion of trials on which IF is admin- 
istered; and (c) no-IF trials reduce the 
efficacy of IF for establishing the critical 
discrimination. While they do not com- 
pletely explain the impotency of IF in 
earlier work on a similar task, these 
studies strongly suggest that instruc- 
tional set may be the determining factor. 
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RETENTION OF VISUAL AND VERBAL CODES 
OF THE SAME STIMULI* 


HARRY P. BAHRICK ann BARBARA BOUCHER? 
Ohio Wesleyan University 


80 Ss were shown drawings of common objects and later given verbal 
recall tests for the object names, followed by visual recognition tests 
scaled to reflect the accuracy of the visual retention. A 2X2X2 
design permitted assessment of the effects of degree of training, 
of retention interval, and of instructions to verbalize during training. 
The most significant finding was that the probability of recall of 
object names is essentially uncorrelated with the accuracy of visual 
recognition of these same objects by the same Ss. As an explanation 
the possibility of independent storage systems for the visual and 
verbal information related to the same objects was considered but 
rejected as improbable. Verbal recall 2 wk. after training seemed to 
be based largely upon the retrieval of visual storage. The recoding 
potential of visual storage into a verbal concept appears to be deter- 
mined by those aspects of the visual storage which distinguish the 
object class, and not by aspects which distinguish among objects of the 
same class. Visual storage losses of both aspects occurred during the 
2-wk, period examined, but no significant correlation existed between 


the 2 types of losses. 


Nonverbal information may be en- 
coded verbally to facilitate storage and 
retrieval. Some verbal recoding 
seems to occur almost instantaneously 
at the time of the input; in other in- 
stances the information may be stored 
nonverbally, and the recoding is de- 
layed until retrieval becomes neces- 
sary. Recoding would appear to be 
immediate in the case of reading mate- 
tial, One rarely learns or remembers 
much about the visual characteristics of 
print one has read. Rather, the visual 
stimuli are habitually recoded, and 
storage and retrieval apply mostly to 
this verbally transformed information. 
If one is asked to recall the color of a 
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couch in the living room of a friend’s 
home, however, it is likely that the ver- 
bal transformation occurs at the time of 
recall, and is based upon stored visual 
information. In addition to such gen- 
eral cultural habits, individual differ- 
ences, of course, exist in regard to the 
amount of verbal recoding of informa- 
tion. Such differences were noted as 
early as 1924 in connection with maze 
learning (Warden, 1924) and have 
been observed in other contexts since 
then (Bartlett, 1932; Posner & Konick, 
1966). 

In many instances a visual stimulus 
pattern might lead to nonverbal as 
well as recoded verbal storage, but 
little is known about the extent to which 
such multiple codes initiated by the 
same stimulus pattern interact, or the 
extent to which they remain inde- 
pendent of each other once they are in 
storage. A considerable number of 
investigations have compared visual 
and verbal memory; but, this has 
usually been done by comparing differ- 
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ent groups of Ss (Kurtz & Hovland, 
1953; Lieberman & Culpepper, 1965) 
or different types of stimulus input 
(Belbin, 1950; Cohen, 1967; Kurtz & 
Hovland, 1953; Paivio & Yarmey, 
1966). The purpose of the present 
investigation was to obtain two mea- 
sures of retention from the same Ss for 
the probability of verbal recall and for 
the accuracy of visual recognition of the 
same stimuli, and to determine the re- 
lations between these indicants over a 
period of time. 


METHOD 


The Ss in this study were shown draw- 
ings of common objects on an oversize mem- 
ory drum. They were tested for verbal re- 
call of the object names and for accuracy 
of visual recognition of the drawings. These 
tests were administered following two de- 
grees of training and for two retention in- 
tervals for each degree of training. Half 
of the Ss were instructed to name the ob- 
jects as they were presented, i.e., they were 
forced to recode verbally at the time of 
stimulus presentation. The other half of the 
Ss were given no such instructions and were 
thus free to delay the verbal coding to the 
time at which verbal recall was demanded. 

Stimulus material—Sixteen drawings of 
common objects each occupying an area of 
approximately 1.5 sq. in. were used. These 
drawings have previously been described in 
more detail (Bahrick, Clark, & Bahrick, 
1967). 

Procedure—Eighty men and women un- 
dergraduate volunteer Ss were assigned by 
systematic alternation to eight groups of 
10 Ss each. Four control groups of 10 Ss 
each, chosen from the same population and 
by the same method 9 mo. earlier, were 
available from a previous study (Bahrick 
et al., 1967). All 80 Ss were shown the se- 
quence of 16 drawings by exposing each 
drawing at a 2-sec. rate in the window of the 
memory drum. The independent variables 
of degree of training, retention interval, and 
forced vs. free verbal recoding were ar- 
ranged in a 2X2X2 design outlined in 
Table 1. 

The drawings were arranged in three ran- 
dom sequences, and these sequences were 
systematically alternated for Ss receiving 
nine exposure trials. A 30-sec. intertrial in- 
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TABLE 1 
SUMMARY OF THE EXPERIMENTAL DESIGN 


Group Name | Condition | Trials” | interval 
FR-1-IT free 1 1 min. 
FR-1-DT | free 1 2 wk. 
FR-9-IT free 9 1 min. 
FR-9-DT | free 9 2 wk. 
IR-1-IT instructed 1 1 min, 
IR-1-DT instructed 1 2 wk. 
IR-9-IT instructed 9 1 min. 
IR-9-DT instructed 9 2 wk. 


terval was use. The Ss receiving only one 
exposure trial were assigned to one of the 
three sequences by systematic alternation. 
The Ss were instructed to observe the series 
of drawings carefully as they would be re- 
quired to recognize them later. They were 
also told to disregard the sequence in which 
the drawings were shown. No instructions 
regarding yerbalization were given to the 
free-recoding (FR) groups. The Ss in the 
instructed-recoding (IR) groups were given 
identical instructions, but were told in addi- 
tion to call out the name of each object as 
the drawing appeared in the window. All 
Ss received verbal recall tests followed im- 
mediately by visual recognition tests. These 
tests were administered either immediately 
after training for the immediate-testing 
(IT) groups or 2 wk. later for the delayed- 
testing (DT) groups. The verbal recall 
test required Ss to list the names of the ob- 
jects without regard to order. An answer 
sheet with 16 spaces was provided, and a 
time limit of 1 min. was imposed. For 
the visual recognition tests each of the 
original drawings was exposed on the mem- 
ory drum, positioned randomly among 10 
distractor drawings of the same type of 
object. The distractors were chosen on the 
basis of an interval scale of similarity 50 
as to represent five degrees of dissimilarity 
to the correct drawing. Figure 1 shows a 
sample row from the recognition tests. The 
correct choice is labeled “0.” $ 
This scale of distractors and the details of 
the recognition-testing procedure are identi- 
cal to those described in Bahrick et al. 
(1967). ‘The Ss were allowed 8 sec. to mark 
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Fic. 1. One row from the recognition test. 
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their recognition choices on an answer sheet, 
and then the next drawing was exposed to- 
gether with its alternatives. Three dif- 
ferent random orders of rows for the recog- 
nition tests were assigned by systematic al- 
ternation to Ss. 

The four control groups from the pre- 
vious investigation had been treated in 
like manner as Ss in the four FR groups, 
but they had been given no verbal recall tests 
preceding the visual recognition test. This 
also diminished the interval between train- 
ing and recognition testing by 1 min. 
These control groups permit a determination 
of the effect of the administration of the 
recall test on the recognition test. 


RESULTS AND DISCUSSION 


Verbal recall tests were scored to 
determine the number of correct object 
names listed; visual recognition tests 
were scored by categorizing responses 
as correct, or as first-, second-, third-, 
fourth-, or fifth-degree errors in ac- 
cordance with the similarity scales from 
which the alternatives were selected. 
Table 2 shows the total correct recall 
score for each group, the number of 
recognition responses in each error 
category, and the number of correct re- 
call responses associated with each of 
the recognition categories. Thus, the 
recall score of 27 listed with the recog- 
nition score of 41 in Column 2, Row 5 
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of Table 2, means that 41 of the 160 
recognition responses of this group 
were first-degree errors, and that Ss 
were able to recall the names of 27 of 
the 41 objects for which they com- 
mitted visual recognition errors of the 
first degree. 

Recall performance-—Analysis of 
variance applied to recall data indi- 
cates the expected significant effects 
due to the degree of training, F (1, 
72) = 18.86, p < .01, and due to the 
retention interval, F (1, 72) = 74.40, 
p<.0l. Verbalization during train- 
ing also leads to improved verbal recall, 
F (1, 72) = 4.58, p < .05, and this is 
consistent with previous findings (Bel- 
bin, 1950; Murray, 1965, 1966). 
However, the effect is observed only 
if retention is tested immediately. Two 
weeks after training, verbal recall 
scores for the instructed-recoding 
groups are actually somewhat lower 
than for the free-recoding groups, and 
the interaction Recoding Instructions 
X Retention Interval is significant, F 
(1, 72) =8.27, p < .025. None of 
the other interaction terms approach 
significance (p > .10). These results 
suggest that verbal recall immediately 
after training is based, at least in part, 


TABLE 2 


NUMBER or RESPONSES IN EACH CATEGORY OF THE RECOGNITION TEST 
WITH ASSOCIATED CORRECT RECALL SCORE 


Rec Rec- Recog- Rec- Total 
Cor- | Cor- | og- | Cor- Reve: Cor- | og- | Cor- | nition | Cot | 8 Cor- | Cor- 
Group |recoe-| pect] piion feet | Error | ean Error [Recan] ZO" eect) Error Recall| Recall 
nition 1 2 3 5 
FR-1-IT 53} 28 | 43 | 18 | 32 | 16] 12 | 3} 10] 7 | 10) 5 11 
FR-1-DT 38 | 12 | 40 | 12 | 27 | 8 | 19] 4] 26) 9 | 12 | 0 45 
FR-9-IT 11g | eset oat | eee tO) Sollee) teal E ital ea 
FR-9-DT as. | at [igor | onaualeeaainl) 1) au aan E eee O 12 
IR-1-1T La ere Pe A E E este Pete ee I Son lea ese 
IR-1-DT aal °7 | 37 | 16] 39 | 628 6| 13] 6) 9) 4 4s 
IR-9-IT 67 | 45 | 47 | 31 | 23 | 16 | 11) 9 12 | 8 0 | 0 | 10 
IR-9-DT 31 | 24 | 45 | 18] 28 | 8 | 18) 7] 1 5 i 4 | 66 
Control-1-IT | 77 39 28 6 5 3 
Control-1-DT | 41 47 35 21 8 3 
Control-9-IT_| 125 23 5 2 2 j 
Control-9-DT | 60 59 18 12 6 
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upon immediate verbal coding; it is 
facilitated by verbalization. It would 
appear, however, that verbal recall 2 
wk. later may be based primarily on 
visual storage. This seems to be the 
most plausible explanation for the sig- 
nificant interaction Recoding Instruc- 
tions X Retention Interval. 

Visual recognition scores—tThe ef- 
fect of recall upon recognition perform- 
ance is demonstrated by the data in 
Table 2. The previous finding (Bah- 
rick et al., 1967) that the technique of 
employing distractors of scaled similar- 
ity on the recognition tests yields a 
sensitive and reliable measure of the 
acquisition and retention of the recog- 
nition skill is confirmed. With few ex- 
ceptions monotonic gradients of gener- 
alization can be plotted from the data of 
each group, and the slope of the gradi- 
ents reflects the degree of training and 
the retention interval. For a graph 
and discussion of comparable gradients 
see Bahrick et al. (1967). Comparison 
of the control-group data in Table 2 
with the comparable FR groups in this 
investigation shows that the perform- 
ance of the control groups is somewhat 
superior, Of the four comparisons be- 
tween paired groups only the one in- 
volving the FR-1-IT groups shows a 
large difference, The extension of the 
retention interval by 1 min. appears to 
have lowered the performance of this 
group in comparison with its control. 
This additional amount of time was 
apparently less significant after nine 
training trials and obviously trivial for 
the DT groups. It would appear that 
there exist only minor interference ef- 
fects of the verbal recall test upon sub- 
sequent visual recognition perform- 
ance. It is possible, of course, that 
other interference effects occurred 
which are not reflected by the indicants 
used here. Interference effects of re- 
call upon subsequent recognition per- 
formance have been reported by Belbin 
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(1950) and were discussed by Kurtz 
and Hovland (1953) and more re- 
cently by Hanawalt and Tarr (1961). 

Effect of verbalization on recognition 
performance—Verbalization during 
training reduces visual encoding as evi- 
denced by comparison of recognition 
performance for the FR and IR groups. 
Individual comparisons between groups 
show that the difference occurs only 
with nine training trials where it per- 
sists over the 2-wk. retention period, 
It would appear that the verbalizing in- 
structions constitute, in effect, a dual 
task for Ss and that at higher levels of 
training the verbal responses occur 
partly at the expense of visual learning. 

Interrelations of visual and verbal 
performance with regard to the same 
stimuli—This analysis is directed at 
the principal question asked in the 
present investigation. To determine 
whether the likelihood of verbal recall 
was a function of the accuracy of the 
visual memory of the stimulus, the re- 
call frequencies in Table 2 were divided 
by the associated recognition frequen- 
cies. The quotients yield verbal recall 
probabilities for stimuli recognized 
with various degrees of accuracy and 
are shown in Table 3. In this table 
data have been pooled for recognition 
errors of third-, fourth-, and fifth-order 
magnitude in order to obtain probabili- 
ties based upon larger frequencies, i-€., 


TABLE 3 


RECALL PROBABILITY ASSOCIATED WITH 
VARIOUS MAGNITUDES OF 
RECOGNITION ERROR 


Group 0 1 2 
FR-1-IT 53 | 42 .50 Al 
FRA-DT | .33 | .30 | -30 | -75 
FR-9-IT | .55 | .50 | .50 a 7 
FR9-DT | 55 | .36 | -48 | c56 
IR-1-IT 62 | 66 | 54 | 3 
IR-1-DT 21 43 AS rt 
IR-9-IT 67 | 66 | .70 | -T7 
IR-9-DT | .47 | 40 | .29 |- 
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in order to increase reliability. The 
table shows no trend for higher recall 
probabilities to be associated with 
greater accuracy of recognition. This 
remains true not only for the IR 
groups, but also for the FR groups, for 
both degrees of training, and for both 
intervals of retention, It thus appears 
clear that verbal recall probability is 
unrelated to the accuracy of the visual 
code as measured in this investigation. 
This finding permits two alternative 
explanations: either the verbal recall 
depended upon verbal storage estab- 
lished through verbal coding at the time 
of stimulation, and this verbal storage 
remained independent of the visual 
storage throughout the retention 
period, or the verbal recall was de- 
pendent upon aspects of the visual 
storage not reflected in the recognition 
tests used here, 

The latter interpretation appears the 
more probable. To be sure, the free- 
recoding groups may have encoded 
stimuli verbally at the time of presenta- 
tion without any instructions to do so. 
However, the results of the recall data 
strongly suggest that this type of stor- 
age became a comparatively unim- 
portant determinant of recall perform- 
ance 2 wk. later. At the end of 2 wk. 
the free-recoding groups no longer dif- 
fered significantly from the instructed- 
recoding groups on the verbal recall 
test, despite the fact that the verbaliz- 
ing groups originally stored signifi- 
cantly more verbal information. This 
finding, supported by unsolicited re- 
ports of Ss, suggests that verbal recall 
2 wk. after training does depend 
heavily upon the recoding of visual 
storage. Yet, recall probability is un- 
related to the accuracy of the visual 
storage as measured by the recognition 
tests. It is concluded that the retri- 
eval and verbal recoding potential of 
visual storage depends on character- 
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istics of the storage essentially uncor- 
related with the encoded degree of 
visual detail. Clearly there must be 
some limits to this principle. No $ is 
likely to be able to verbalize the pres- 
ence of a handle on the stimulus cup if 
he cannot indicate memory of a handle 
by his choice of the appropriate alterna- 
tives in Fig. 1. However, it is ap- 
parent that the ability of S to report 
that he saw a cup is independent of 
whether or not he remembers that the 
cup had a handle. 

The recoding potential of visual stor- 
age to a verbal concept must therefore 
be determined by those aspects of the 
visual storage which distinguish it as 
an object class, and not by aspects 
which distinguish it from other ob- 
jects of the same class. Only the latter 
type of storage losses are reflected by 
the recognition tests used in this in- 
vestigation. Although visual storage 
suffers retention loss of both types dur- 
ing the 2-wk. period examined here, 
there appears to be no significant cor- 
relation between the two types of loss. 
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OF REINFORCEMENT MAGNITUDE: 
A TEST OF INCENTIVE-MOTIVATION THEORY * 
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Incentive-motivation theory assumes that a stimulus which is paired 
with a reinforcing event acquires motivational properties with respect 
to instrumental responses, the strength of which are directly related to 
the magnitude of reinforcement. To test these assumptions, 1 group 
of rats (Group P) received pairings of 1 stimulus (tone or clicker) 
with a single food pellet and a 2nd with 10 pellets. For Group C, 
these 4 events were presented unsystematically with respect to one 
another. Then all S's were trained to press a bar for food at the onset 
of a light, and then were given bar-press generalization test trials with 
the tone and clicker. Consistent with expectation, Ss showed enhanced 
generalization when the stimuli had previously been established as 
CSs for food, However, contrary to incentive-motivation theory, this 
facilitation was no greater to the 10-pellet paired stimulus than to 


EFFECT OF A CS FOR REINFORCEMENT 
UPON INSTRUMENTAL RESPONDING AS A FUNCTION 


the 1-pellet paired stimulus. 


One of the most interesting sets of 
predictions that follow from Spence’s 
(1956) rg- Sg- incentive - motivation 
theory of positively reinforced instru- 
mental learning is that one should be 
able to manipulate the strength of an 
instrumental response (R) to its dis- 
criminative stimulus (S) through 
manipulations of the pairing relation- 
ship between S and the reinforcer (SE) 
that are carried out independently of 
R. The general expectation is that any 
manipulation of the S-S* pairing rela- 
tionship which, on the basis of what is 
known about observable classically con- 
ditionable responses, should modify the 
strength of the rg conditioned to S, 
should thereby also modify the amount 
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of incentive motivation elicited by S, 
and hence modify the strength of R 
when it is allowed to occur, In other 
words, one expects transfer of training 
from manipulations carried out in the 
classical conditioning paradigm to be- 
havior in the instrumental learning 
paradigm, when the two employ the 
same stimuli and reinforcer. 

There is by now a fairly sizable body 
of literature, employing a number of 
different instrumental learning tasks, 
showing that such classical > instru- 
mental transfer of training does obtain 
(eg. Bower & Grusec, 1964; Estes, 
1948; Trapold & Fairlie, 1965; Tra- 
pold & Winokur, 1967). And in gen- 
eral the results of these experiments 
are consistent with the predictions of 
incentive-motivation theory. 

Despite the general agreement be- 
tween these data and Spence’s (1956) 
theory, however, the assumption that 
pairing S with S? imbues S with moti- 
vational properties, and that it is the 
manipulation of this motivation that is 
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the basis for the transfer of training 
from the classical to the instrumental 
paradigm, remains in question. The 
one direct experimental test of this 
motivational assumption (Trapold, 
1962) failed to find general response 
energizing properties accruing to an 
S®-paired stimulus in that such a stim- 
ulus did not serve to potentiate the 
magnitude of a reflex startle response 
elicited in its presence. 

The purpose of the present experi- 
ment was to provide a different type 
of test of this motivational assumption, 
and particularly of the assumption that 
the level of this motivation is directly 
related to the magnitude of reinforce- 
ment for which the stimulus is a CS. 
The basis for this experiment comes 
from a finding reported recently by 
Trapold and Winokur (1967). They 
found that if S is first trained with 
food reinforcement to make a specific 
response to the appearance of a specific 
training stimulus, and then is tested 
for the amount of stimulus generaliza- 
tion of that response to a different stim- 
ulus, generalization is greatly enhanced 
if the generalization test stimulus has 
previously been established as a CS 
for food, than if it has not. This is one 
of the classical — instrumental transfer 
effects alluded to earlier as being con- 
sistent with Spence’s (1956) theory. 
According to that theory, pairing the 
test stimulus with reinforcement should 
have imbued it with incentive-motiva- 
tional properties which, at the time 
of generalization testing, would have 
served to multiply whatever generalized 
habit strength the test stimulus evoked, 
and hence resulted in a stronger te- 
sponse to the test stimulus. 

If this analysis is correct, and if the 
major subsidiary assumption of incen- 
tive-motivation theory, namely, that the 
amount of incentive motivation is 
greater the larger the magnitude of re- 
inforcement (Spence, 1956), is also 
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true, then it follows that in this gener- 
alization-testing paradigm, generaliza- 
tion should be more facilitated to a 
stimulus that has been made a CS for 
a large magnitude of reinforcement 
than to a stimulus that has been made 
a CS for a small magnitude. This 
experiment tested that prediction using 
procedures essentially identical to those 
employed in the Trapold and Winokur 
(1967) study. 


METHOD 


Subjects and apparatus:—The Ss were 20 
naive male Sprague-Dawley albino rats, ap- 
proximately 120 days old at the beginning 
of the experiment. 

The apparatus consisted of two identical 
Grason-Stadler E3125A-300 operant-condi- 
tioning chambers (with the left bar perma- 
nently removed) and the necessary program- 
ming and recording equipment. 

Procedure—The procedure may be broken 
down into six successive phases: adaptation, 
magazine training, preliminary bar-press 
training, classical pretraining, discrete trial 
bar-press training, and generalization testing. 

Prior to adaptation, 10 Ss were randomly 
assigned to each of the chamber , and there- 
after each S received all its treatments in its 
assigned chamber. Adaptation consisted of 
two consecutive daily sessions during which 
Ss were placed in the chamber for 30 min. 
with the bar removed and the jewel light 
over the bar slot on. i 

Magazine training consisted of four daily 
sessions. With the bar absent from the 
chamber, each S received 24 noncontingent 
reinforcements with a variable interreim- 
forcement interval averaging 1 min. In each 
session, a random half of these reinforce- 
ments consisted of a single .045-gm. Noyes 
pellet, while the other half consisted of 10 
such pellets delivered to the food cup at the 
rate of 2/sec. t 

Following magazine training, the right re- 
sponse bars were inserted into the chambers, 
and each S received two sessions, irk 
which each press of the bar was reinforce 
with a single pellet to a total of 50 reinforce- 
ments per session. cat 

Following preliminary bar-press training, 
the bars were again removed from the cham- 
bers and all Ss received classical pretrainmg 
for the next 38 consecutive daily sessions: 
For Group P (prepaired, n = 6/chamber) 
each session consisted of 12 3-sec. presenta- 
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tions each of a clicker (approximately 20 
pps) and tone (approximately 2,000 cps), ir- 
regularly intermixed, with a variable inter- 
stimulus interval averaging 1 min. For half 
of the Ss of Group P in each chamber, each 
presentation of the tone was followed im- 
mediately by a single reinforcement pellet, 
and each presentation of the clicker was fol- 
lowed immediately by 10 pellets. For the 
other half of Group P the clicker was paired 
with a single pellet, the tone with 10 pellets. 
Group NP (not paired, »=4/chamber) re- 
ceived the same pattern of tone and clicker 
presentations, and the same number of 1- and 
10-pellet reinforcements, but the reinforce- 
ments were delivered unsystematically with 
respect to the tone and clicker presentations 
and with respect to time, the only restriction 
being that no two events could follow one 
another by less than 7.5 sec. 

Next, the bars were reinserted into the 
chambers, and all Ss were trained on a dis- 
crete trial bar-pressing task for nine daily 
sessions, 24 trials per session, Each trial 
began with the onset of the houselight. The 
first bar press after trial onset terminated 
the houselight and delivered a single rein- 
forcement pellet. The intertrial interval was 
variable with a nominal average of 1 min. 
However, to this nominal interval was added 
the condition that the onset of any trial 
had to be preceded by at least 20 sec. with- 
out an intertrial response. Since this pro- 
cedure very rapidly results in a near-zero 
level of intertrial responding, such responses 
will hereafter not be considered. The re- 
sponse measure with this instrumental task is 
the latency of responding from trial onset. 

Next, Ss were tested for generalization of 
bar pressing to the tone and clicker for 
seven sessions, Each session consisted of 
24 trials with the houselight training stimu- 
lus, exactly as described for discrete trial bar- 
press training. In addition, after Trials 6, 
12, 18, and 24, Ss received a single test trial 
with one of the two auditory stimuli in place 
of the houselight, the two stimuli being pre- 
sented alternately over test trials. Which 
test stimulus was presented first was coun- 
terbalanced across Ss. The procedure on 
test trials was otherwise identical to train- 
ing trials, 


RESULTS AND DISCUSSION 


The results of generalization testing 
are shown in Fig. 1. To reduce the 
range of obtained latencies to plotable 
proportion and to more closely approxi- 


mate the distributional requirements of 
analysis of variance, mean latencies for 
blocks of test trials for each S were 
converted to logarithms. The data 
plotted for Group NP is the mean 
across the 8 Ss of this group of the log 
of the mean latency to both test stimuli 
for each session. The data for Group 
P is presented in two plots, the mean 
across the 12 Ss of that group of the 
log of mean latency on trials to the 1- 
pellet stimulus and similarly for trials 
to the 10-pellet stimulus. The data for 
the training stimulus is the average 
across all 20 Ss of the log of mean 
latency on Trials 6, 12, 18, and 24 of 
each session; i.e., the trials immediately 
preceding the generalization test trials. 

It should be noted first that the laten- 
cies of Group P to both stimuli were 
much shorter than those of Group NP 
on all test sessions. A repeated-mea- 
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sures analysis of variance, comparing 
the test-trial performance of Groups P 
and NP across test sessions, yielded a 
significant group effect (p < .0001). 

Next, it should be noted that there 
were no substantial differences between 
the 1-pellet and 10-pellet stimuli of 
Group P in the strength of the re- 
sponses they evoked on any of the test 
days. Nowhere does the difference 
between performance to the 1-pellet 
and 10-pellet stimuli even approach 
conventional levels of statistical signifi- 
cance, 

The enhancement of stimulus gen- 
eralization to stimuli which have been 
established as CSs for food provides 
a replication of that aspect of the Tra- 
pold and Winokur (1967) study. 
Moreover, in the present study, this 
facilitation shown against a control 
group that (a) was more completely 
equated with the experimental group 
on factors other than the pairing rela- 
tionship between the stimuli and the 
reinforcer during the classical condi- 
tioning phase, and (b) more nearly 
approximated the “truly random” con- 
trol group, which Rescorla (1967) has 
argued, provides the theoretically most 
appropriate control condition for stud- 
ies of classical > instrumental transfer 
of training. 

The failure to find more facilitation 
to the 10-pellet than to the 1-pellet stim- 
ulus is clearly inconsistent with the in- 
centive-motivational interpretation of 
classical — instrumental transfer of 
training, 

Of course, this prediction was made 
on the general assumption that the rela- 
tively novel instrumental task used 
here is sensitive to magnitude of re- 
ward in a manner similar to the more 
conventional tasks, such as the straight 
runway, that have provided the bulk of 
the data leading to the development 
of the incentive-motivation concept. 
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Clearly, if that were not the case, this 
experiment could hardly be considered 
a critical test of the incentive-motiva- 
tion notion. 

In order to determine whether this 
task is sensitive to reward magnitude, 
a second experiment was conducted, 
Adaptation, magazine training, and 
preliminary bar-press training were es- 
sentially identical to those phases of 
the main experiment reported here. 
Similarly, discrete trial bar-press train- 
ing was identical to that described 
above except that each S received 12 
tone and 12 clicker trials per session 
in an irregular order. For four Ss, re- 
sponses to the tone were reinforced 
with 10 pellets, and responses to the 
clicker by a single pellet. For another 
four Ss, these stimulus-magnitude com- 
binations were reversed. 

Figure 2 presents mean log mean la- 
tency to both the 1- and 10-pellet stim- 
ulus for the first 12 sessions of training. 
Clearly, response latency did depend 
upon reinforcement magnitude, both 
during acquisition and after responding 
had become asymptotic. Indeed, even 
after 25 additional training sessions 
beyond those depicted in Fig. 2, re- 
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sponding to the 10-pellet stimulus con- 
tinued to be faster than responding to 
the 1-pellet stimulus. 

Thus, the failure to find more facili- 
tation of generalization to the 10-pellet 
stimulus was not due to a general in- 
sensitivity of this type of task to rein- 
forcement magnitude. 

The results of this experiment, then, 
reinforce those of Trapold (1962) in 
showing that stimuli which have been 
established as CSs for reinforcement do 
not facilitate instrumental responding 
via a motivational mechanism. This 
conclusion leaves two major problems 
for the theory of instrumental re- 
ward learning. First, if a stimulus 
that has been paired with reinforce- 
ment does not acquire motivational 
properties, what does it acquire that 
can account for the effects of such 
pairings upon subsequent instrumental 
responding? Second, if different mag- 
nitudes of reinforcement do not estab- 
lish different levels of incentive motiva- 
tion, how does one account for the well- 
documented fact that instrumental per- 
formance generally is positively related 
to reward magnitude? 

In answer to the first question two 
major possibilities suggest themselves. 
The first, and the theoretically less in- 
teresting, is that the “classical” manip- 
ulations do not establish classically 
conditioned processes of any particular 
consequence for instrumental respond- 
ing. Rather, as has been pointed out 
before (cf, Rescorla & Solomon, 1967; 
Trapold & Winokur, 1967), since 
some instrumental behavior is re- 
quired from Ss during the classical 
conditioning phases of this type of ex- 
periment, it may be that these instru- 
mental responses are what transfer to 
the instrumental task. There are a 
sizable number of specific ways that 
such classically acquired instrumental 
Tesponses might interact with the mea- 


sured response of the instrumental task 
and produce facilitative and/or inter- 
ference effects of the type that have 
been found. And until experiments 
have been conducted showing classical 
— instrumental transfer of reward 
learning in the absence of possible med- 
iation via classically acquired instru- 
mental responses, this type of explora- 
tion must be retained as a viable alter- 
native. 

Another alternative to incentive- 
motivation theory for explaining clas- 
sical — instrumental transfer effects 
would maintain the assumption that 
these effects are mediated by classically 
conditioned processes, but would view 
the mode of interaction of these pro- 
cesses with instrumental behavior as 
other than motivational. One simple 
example of such a theory places the 
burden of explanation upon the as- 
sumption that on any given trial of an 
instrumental learning experiment, the 
feedback stimuli, s,’s, of the classically 
conditioned r,’s should occur before 
the instrumental response, and hence 
should become a cue for it in the same 
way that an exteroceptive stimulus that 
regularly precedes a reinforced instru- 
mental response becomes a cue for that 
response. Tf this were the case, and if 
S had already received training suffi- 
cient to condition an instrumental re- 
sponse to Sg, then any exteroceptive 
stimulus (such as the generalization 
test stimuli of this experiment) to 
which rg has been conditioned should 
by virtue of the cue value of sg, tend 
also to elicit the instrumental response. 
Such a theory accounts nicely for the 
enhanced stimulus generalization to a 
stimulus that has been made a CS 
for the reinforcer, and, on the assump- 
tion that the sg resulting from a 10- 
pellet rz, is not appreciably different 
from that resulting from a 1-pellet Tg, 
this theory can account for the finding 
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that the amount of facilitation did not 
vary with the magnitude of reinforce- 
ment with which the generalization test 
stimulus had been paired. 

As for the question of how, in the 
face of these conclusions about incen- 
tive motivation, to account for ordinary 
magnitude of reward differences, the 
authors have no concrete proposals to 
offer. It should be kept in mind, how- 
ever, that even though incentive-moti- 
vation theory has come to be used to 
deal with both magnitude of reward 
phenomena and classical — instru- 
mental transfer phenomena, there is at 
present no compelling empirical basis 
for believing that the two are in fact 
mediated by the same underlying learn- 
ing processes, Further research is 
clearly needed to clarify the relation- 
ship between these two sets of phe- 
nomena, 
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It is often possible to construct unidimensional scales of multidimen- 
sional stimuli. Thurstonian scaling theory predicts that the standard 
deviations of differences in unidimensional paired-comparisons scaling 
of such stimuli should covary with the qualitative similarity between 
the stimuli being compared. In a test of this prediction, 78 Ss 
judged 11 facial expressions from the Lightfoot series with respect to 
emotional intensity. Stimuli were selected so as to fall in 3 different 
clusters with large similarity within clusters and small similarity 


between clusters. 


deviations than between-clusters comparisons for all 11 


The possibility of consistent judg- 
ments of complex stimuli has been rec- 
oganized for a long time (cf. Holling- 
worth, 1913). Unidimensional Thurs- 
tonian scales have been constructed 
from such continua as seriousness of 
criminal offenses and conservatism of 
attitude statements, to name just a few 
of many possible examples (Ekman 
& Sjöberg, 1965; Stevens, 1966). In 
addition, the newer so-called direct 
methods of scaling devised primarily 
by Stevens (Stevens, 1957; cf. Ekman 
& Sjoberg, 1965) have now been used 
successfully many times on complex, 
nonsensory continua such as those just 
mentioned (Stevens, 1966). 

The fact that unidimensional scales 
may be constructed from these continua 
constitutes somewhat of a paradox 
(Sjoberg, 1968) since stimuli are most 
often clearly multidimensional. Appar- 
ently, Ss are able to collapse a multi- 
dimensional space into various unidi- 
mensional aspects or attributes. 

1This study was supported by a grant 
from the Swedish Council for Social Sci- 
ence Research. 

2The author is indebted to Catharina 
Lundberg and Lars Bergman for assistance 
in experimental work, to Trygg Engen for 
making available the Lightfoot series of 
facial expression, and to Janet Frederiksen 
for valuable comments on the manuscript. 


Within-cluster comparisons gave smaller standard 


stimuli. 


According to Thurstonian scaling 
theory (Torgerson, 1958) the disper- 
sion of comparative judgments is af- 
fected by the correlation between the 
latent distributions of “discriminal pro- 
cesses” (subjective values). When 
the correlation between the two stimuli 
in a pair increases, the standard devia- 
tion of differences will decrease. Con- 
sequently, if the correlation is higher in 
pairs consisting of similar stimuli than 
in pairs of dissimilar stimuli, the stand- 
ard deviations of differences should be 
smaller for similar stimuli. The cre- 
ation of Thurstonian methods that al- 
lows for variation in covariance terms 
(Sjöberg, 1962, 1967) makes a direct 
test of this prediction possible. 

The prediction was strongly sup- 
ported in a study of verbal stimuli by 
Sjéberg (1968). The present study 
was performed in order to test it with 
a set of complex nonverbal stimuli. 

Facial expressions were chosen as 
stimuli. It is well known that facial 
expressions are multidimensional (e.g. 
Abelson & Sermat, 1962). The attri- 
bute to be scaled was expressed emo- 
tional intensity. 

The following predictions were 
made: (a) perceived emotional in- 
tensity would be a unidimensional vari- 
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able, and (b) the standard deviations 
of differences would covary with the 
similarity between the stimuli being 
compared. 

The plan of the experiment was to 
investigate the dimensionality of the 
variable by means of a count of circu- 
lar triads in paired comparisons (Ken- 
dall & Babington-Smith, 1940), by 
studying the goodness of fit of unidi- 
mensional Thurstonian models for 
paired comparisons and category rat- 
ings, by comparing magnitude-estima- 
tion scales using different standards, 
and by comparing the scales resulting 
from Thurstonian methods with those 
resulting from the use of magnitude 
estimation. 

The standard deviations of differ- 
ences were to be estimated from paired 
comparisons using the method of suc- 
cessive-intervals scaling of paired com- 
parisons (Sjöberg, 1967). It was also 
considered interesting to investigate 
whether the standard deviations of 
magnitude estimations would be a 
function of the qualitative similarity 
between standard and variable. 

The desirable variation in similarity 
was to be produced by selection of stim- 
uli forming homogenous clusters in 
terms of the kind of emotion ex- 
pressed. Clusters should be clearly 
different from each other. Within- 
cluster comparisons would, then, pro- 
duce a high-similarity condition and 
between-cluster comparisons a low- 
similarity condition. 


METHOD 
Selection of Stimuli 


The entire Lightfoot series of facial èx- 
pressions (Engen, Levy, & Schlosberg, 1957) 
was first rated by 10 Ss as to emotional 
intensity of expressions. The Ss also identi- 
fied verbally the emotional content of each 
expression. Each picture was shown once, 
The Ss first rated, using the method of mag- 
nitude estimation, its emotional intensity in 
comparison with a standard picture shown 
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at the same time. After this rating, S wrote 
down a guess as to what emotion was ex- 
pressed, e.g., “happiness,” “depression,” or 


i, ” 


Eleven stimuli were then selected on the . 


grounds that they should be unambiguous in 
content and that the same intensity range 
should be spanned by all the emotions repre- 
sented. The first requirement was given 
priority, and because of this, the second re- 
quirement could only be met to a very modest 
extent. Three standards that would be about 
equal in intensity were selected for the main 
experiment. 

The following stimuli were chosen from 
the Lightfoot series: No. 14, 1, 35, 55, 5, 33, 
52, 47, 11, 22, and 10. They are labeled A 
through K, respectively, in the present study. 
According to the results of the pilot study, 
Pictures A-D represent joy or pleasure; 
E-G, sorrow; and H-K, anger. Thus, stim- 
uli were selected so as to form three different 
clusters. 


Subjects 


Seventy-eight psychology and art history 
students served as Ss; 57 females and 21 
males. 


Stimulus Presentation 


The experiment was conducted in a lecture 
hall and a seminar room. Ten to 15 Ss 
participated in a given session. The pictures 
were projected on a screen in front of Ss by 
means of one or two projectors (Leitz). 
The size of the projected pictures was 1x 
.8 m. When two pictures were projected 
simultaneously, two identical projectors were 
used and the distance between the pictures 
was 15 cm. The Ss were seated 3-5 m. 
from the screen. 


Procedure 


Three kinds of rating methods were em- 
ployed by each S: paired comparisons, cate- 
gory ratings, and magnitude estimation. The 
different methods were always employed in 
the above order. For each method two dif- 
ferent random orders of pairs or single 
stimuli were used; equally often backwards 
and forwards, Each S gave only one rating 
for each pair or single stimulus. Positions 
within a pair (left-right and right-left) were 
rotated over Ss. The responses were re- 
corded in booklets with 5-6 responses on each 
page. 

Presentation time for each pair was about 
30 sec. The Ss who did not give a judgment 
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of some pairs or single stimuli were allowed 
to see them again and rate them. 

Paired comparisons—Seven rating cate- 
gories were used for paired comparisons, 
plus the category of equality. The scale 
was presented as a string of numbers as 
follows: 7 6 5:4) 3 271 0121M 
A rating of 0 represented no difference in 
intensity. Ratings to the left of 0 indi- 
cated the left picture to be most intense and 
ratings to the right the opposite. Larger 
category numbers indicated larger differences. 

The use of the rating scale was first ex- 
plained by means of four examples of 
ratings illustrating the use of categories. 
Then, three pairs of practice pictures were 
presented from the Lightfoot series. The first 
two pairs were homogenous in content and 
the third was heterogenous. It was stressed 
very strongly that S's should rate emotional 
intensity regardless of content and that the 
size of difference rating also referred to 
difference in intensity regardless of content. 
The distinction appeared to be clear to all Ss. 

Category ratings —Seven categories were 
used in category ratings defined by numerical 
labels. The meaning of the scale was ex- 
plained by means of a few examples. A rat- 
ing of 1 was to be given to a very small in- 
tensity, 7 to a very large intensity. Each 
stimulus was presented only once. No other 
picture was exposed while one stimulus was 
being rated. í 

Magnitude estimation—In magnitude esti- 
mation the standard was assigned the number 
10. This number was shown in the lower 
edge of the standard picture. The vari- 
able stimulus was to be assigned a num- 
ber reflecting the relation between the two 
emotional intensities: an intensity half of 
the standard would be given the number 5, 
one twice as large the number 20, etc. Any 
numbers could be used as responses. The 
standard appeared randomly in both posi- 
tions and was shown anew for each vari- 
able stimulus. Stimuli A, G, and H served 
as standards. 
~ The presentation order of the different 
standards was varied. All six possible per- 
mutations of the three standards were used 
equally often; ie., 13 Ss were exposed to 
each particular order. 

Instructions explaining the use of the 
three kinds of rating methods were included 
in each response booklet. They were also 
presented orally by E. Each instruction was 
immediately followed by the application of 
the rating method in question. $ 

The entire experiment lasted approxi- 
mately 70 min. 


RESULTS 
Circular Triads 


The mean number of circular triads 
in paired comparisons was 2.56. This 
is 5% of the maximum possible number 
of intransitivities. Twenty-nine Ss 
had no circular triads at all (37% of 
the sample). 

The maximum number of circular 
triads to be expected if the equality 
category had not been available was 
estimated as follows. The number of 
triads including at least one equality 
judgment was counted. A fraction of 
this number was added to the number 
of circular triads. The fraction was $, 
since that is the probability of a circular 
triad for a maximally inconsistent S 
rating 11 stimuli, The mean error 
rate estimated from these corrected 
frequencies of intransitivities for Ss 
who had any circular triads initially 
was 11%. Of all responses 12.7% 
were equality statements. 


Scale Values and Fit of 
Scaling Methods 


A summary of results from the ap- 
plication of various scaling methods is 
given in Table 1. 


TABLE 1 


ScaLE VALUES FROM PAIRED COMPARISONS, 
CATEGORY RATINGS, AND MAGNITUDE 
ESTIMATION (GEOMETRIC MEANS) 


=a ty 
Magnitude Estimation 
S Gsregordl cer N 
ulus | parisons | Ratings |standard|Standard|Standard 
‘A is H 
A | 0.12 | 0.43 10.00 | 6.54 6.36 
B 1.24 | 2.44 21.15 | 16.34 | 16.81 
Cc 1.07 1.91 | 19.00 14.08 | 13.79 
D | 1.31 2.40 | 20.91 16.07 | 15.89 
E 0.39 | 0.86 12.67 8.18 7.89 
F 0.00 | 0.00 8.32 5.50 | 5.18 
G | 0.65 1.40 | 14.97 10.00 | 9.89 
H | 0.69 1.36 | 14.67 10.67 | 10.00 
I 1.32 2.41 | 21.32 16.81 | 16.77 
pi 1.49 | 2.58 22.58 | 17.76 18.51 
K | 1.90 3.19 | 27.41 21.47 | 22.53 
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Fic. 1, Plots of the three magnitude scales against each other (geometric means). 
(The straight lines were fitted with the method of least squares.) 
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parisons data were analyzed using the 
method described by Sjöberg (1967). 
The category ratings were analyzed us- 
ing the method described by Diederich, 
Messick, and Tucker (1957). 

Goodness of fit was measured as the 
square root of the mean squared differ- 
ence between empirical and reproduced 
proportions. The fit was .028 for 
paired comparisons and .019 for cate- 
gory ratings. These values are in the 
range usually found. The category 
boundaries were found to be spaced in 
the usual manner; i.e., the size of the 
intervals between boundaries increased 
with category number. This was true 
both for paired comparisons and cate- 
gory ratings. 

Magnitude estimation—The three 
scales resulting from application of dif- 
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Successive intervals 


a o 
10 20 30 o 10 
Magnitude estimation 


° 


Fic. 2, Plots of the three sets of scale values against each other. 


a o 


Magnitude estimation 


ferent standards have been plotted 
against each other in Fig. 1. The re- 
sults are quite consistent. The ratio 
properties may be doubtful because of 
nonzero intercepts in two plots. This 
lack of proportionality does not seem 
to have any direct bearing on the uni- 
dimensionality of the attribute, how- 


ever. For further analysis the three 
scales were averaged (arithmetic 
mean). 


Relations between scales. —Figure 2 
shows the pairwise relations among the 
three scales, The relation between the 
paired-comparisons scale and magni- 
tude estimation is linear and the 
other two plots show slight degrees of 
curvilinearity. If paired-comparisons 
data are analyzed according to Case V 
(dividing equality judgments equally 
between the two responses in a pair), a 


Successive intervals 


20 30 o 1 2 2 
Paired comparisons 


(A least squares 


line was fitted to the paired-comparisons-magnitude-estimation plot and the ganaue 
Y =a log X +b to the successive-intervals-magnitude-estimation plot. The slightly curve 
regression of successive intervals on paired comparisons was fitted by eye.) 
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TABLE 2 
STANDARD DEVIATIONS OF DIFFERENCES FROM PATRED COMPARISONS 

A B c D E F G H I J K 

A — 43 44 78 66 45 68 58 53 69 55 

B 43 H 36 49 18 71 74 76 18 15 63 

C 44 36 Tr. 64 69 66 79 94 58 105 16 

D 78 49 64 fF 82 76 17 83 68 64 71 

E 66 78 69 82 a 52 13 15 70 86 85 

F 45 71 66 76 52 = 54 62 68 62 92 

G 68 74 79 17 73 54 a 60 66 19 103 

H 58 76 94 83 75 62 60 = 56 60 62 

I 53. 18 58 68 70 68 66 56 = 66 59 

J 69 75 105 64 86 62 79 60 66 — 59 

K 55 63 76 71 85 92 103 62 59 59 = 

Note.—The decimal point and zero have been left out. 

similar curvilinearity occurs in a sons data may have been caused by 


plot against magnitude-estimation data. 
The curvilinear trend is in agreement 
with the usual finding when Thurston- 
ian methods are compared with magni- 
tude or ratio estimation (Stevens, 


1966). 


Standard Deviations of Differences and 
Magnitude Estimations 


The set of standard deviations of dif- 
ferences from paired comparisons is re- 
ported in Table 2. The mean standard 
deviations for the within- and between- 
clusters conditions are reported in 
Table 3. The comparisons of the two 
sets of means in Table 3 show that the 
prediction of larger between-clusters 
standard deviations is confirmed for 
all 11 stimuli. 

The standard deviations of magni- 
tude estimations did not show the same 
clear trend. Figure 3 shows the plot 
of standard deviations of magnitude 
estimations against geometric means. 
The regression line was fitted to be- 
tween-clusters comparisons (i.e, the 
standard and variable stimuli from dif- 
ferent clusters). The hypothesis was 
that the value for within-cluster com- 
parisons would fall consistently below 
the regression line. That prediction 
was confirmed for six of eight points. 

The trend found in paired-compari- 


the fact that the variation in scale 
values is smaller within clusters than 
between. If there is a positive cor- 
relation between absolute size of differ- 
ence and standard deviation, one would 
expect between-clusters comparisons to 
give larger standard deviations; be- 
tween-clusters comparisons having, on 
the average, larger differences in scale 
values. 

The point-biserial correlation be- 
tween standard deviations and the 
low- vs. high-similarity condition was 
—.50. A positive correlation between 
standard deviations and absolute size 
of difference did occur; the product- 
moment correlation was .20. Size of 


TABLE 3 


MEAN STANDARD DEVIATIONS BETWEEN 
AND WITHIN CLUSTERS 


Mean between 


Mean within 
Clusters 


Cluster 


Stimulus 


AT -HoOBMmsoowS 


Note.—The decimal point and zero have been left out, 
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Fic. 3. A plot of standard deviations of 
magnitude estimations against their corre- 
sponding geometric means. (The regres- 
sion line was fitted only to the between- 
clusters points.) 


difference correlated —.17 with simi- 
larity (point-biserial correlation). 
Thus, the raw correlation between simi- 
larity and standard deviations had been 
increased by the variation in size of 
difference. This error should be 
partialled out. The partial correla- 
tion between similarity and standard 
deviations was —.48, a very slight de- 
crease. One may conclude that the 
contribution of size of difference varia- 
tion to the relationship between stand- 
ard deviations and similarity is neg- 
ligible in the present data. 


DISCUSSION 


The two main predictions in this study 
were that the attribute would be unidi- 
mensional and that standard deviations 
of differences would covary with qualita- 
tive similarity. These two predictions 
were confirmed. Circular triads were 
few and scaling methods assuming uni- 
dimensionality fitted well. The scaling 
methods produced results related to each 
other in the usual way (cf. Stevens, 
1966). The prediction concerning the 
standard deviations of differences was 
confirmed for all 11 stimuli, 
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The same prediction was confirmed 
for verbal stimuli in a previous study 
(Sjöberg, 1968). Thus, the hypothesis 
of multidimensional structures in stan- 
dard deviations derived from paired com- 
parisons has now some empirical backing. 

It seems, then, that the standard devia- 
tions of differences may be used to in- 
vestigate the multidimensional structure 
that gives rise to the unidimensional at- 
tribute scaled. Possibly, they may make 
separate similarity estimations unneces- 
sary in some situations, since they may 
be expected to covary with such esti- 
mations. 

The standard deviations in magnitude 
estimation showed the expected trend but 
it was less clear than in the case of 
paired comparisons. Further empirical 
work is called for concerning this issue. 
For purposes of multidimensional scal- 
ing it would be useful to have the pos- 
sibility of using some manifest dispersion 
term in situations where a complete 
Thurstonian analysis is too costly or not 
desirable for other reasons. 

A suggestive finding of peripheral in- 
terest in this study is the tendency to a 
linear relationship between the paired- 
comparisons scale and magnitude estima- 
tion, With a stimulus range such as the 
present one, one would expect a clearly 
curvilinear relation (see Stevens, 1966). 
Such a relation was found for category 
ratings and for the Case-V analysis of 
paired-comparisons data. The method 
of difference ratings has, so far, been 
compared three times with magnitude 
estimation and the result has in all cases 
been a linear relation, A decisive ex- 
periment would, however, use a much 
larger range in scale values than the 
present one. 3 

Finally, it may be of interest to dis- 
cuss the present results in the light of 
work concerned with the dimensionality 
of facial expressions (for reviews see 
Hake, 1966; Nummenmaa, 1964). 
Schlosberg (1941) first suggested a 
two-dimensional space from evidence on 
confusion data. He suggested that this 
space was spanned by two bipolar axes, 
which he labeled Pleasantness-Unpleas- 
antness and Attention-Rejection, He 
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was able to show that explicit ratings of 
these two dimensions could predict cate- 
gorical assignments of expressions 
(Schlosberg, 1952). Later, Schlosberg 
(1954) suggested a third dimension of 
Sleep-Tension. 

Later work has mainly used some kind 
of multidimensional scaling based on 
judgments of similarity or factor analy- 
sis based on Osgood’s semantic differen- 
tial (see Osgood, 1966). On the whole, 
Schlosberg’s early conclusions have been 
confirmed, with the possible exception of 
the third dimension of Sleep-Tension. 

Intensity has most often been consid- 
ered a function of the distance from an 
origin consisting of a neutral expression. 
Actual scaling of this attribute across 
several different kinds of emotions has, 
however, rarely been attempted. The 
present results show that such scales can 
be constructed and that the multidi- 
mensionality is preserved in the variabil- 
ity of the comparative judgments. Thus, 
the multidimensional space may be shown 
to affect data in at least three different 
ways: confusions of categorical assign- 
ments, similarity or dissimilarity ratings, 
and variability of comparative judgments. 
Several interesting problems arise in this 
connection, It would, e.g., be worthwhile 
to make detailed comparisons of con- 
fusion, similarity, and variability spaces. 
Different attributes could also be used 
to generate variabilities and the result- 
ing spaces could be compared. In gen- 
eral terms, the importance of a certain 
multidimensional description of facial ex- 
pressions would increase given a finding 
of invariance across widely different re- 
sponse modes and models of data analysis. 
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IMAGERY AND VERBAL MEDIATION INSTRUCTIONS 


IN PAIRED-ASSOCIATE LEARNING * 


JOHN C. YUILLE? AND ALLAN PAIVIO 


McGill University 


This study investigated the effects of mediation instructions and noun 
concreteness (C) on paired-associate (PA) learning. Ss were given 
1 of 3 sets of PE aA to learn pairs: to employ imaginal mediators, 
verbal mediators, or simple repetition. S then practiced his technique 
with a PA list composed of either concrete or abstract nouns. Sub- 
sequent to the practice list, S was given 4 alternating learning and 
recall trials with 1 of 4 lists. Each list contained 16 pairs of 1 of the 
combinations of stimulus and response C; i.e., concrete-concrete, con- 
crete-abstract, etc. Concrete nouns, particularly on the stimulus side 
of pairs, facilitated recall of the learned list. Noun C of the practice 
list did not affect performance on the learned list. Ss instructed to 
use repetition showed poorer performance on the lst 2 trials than Ss 
instructed to use mediators, and noun C affected the repetition group 
less on the lst trial. Contrary to expectations, the imagery and 
yerbal mediators set did not result in recall differences. However, 
latencies of mediator discovery obtained during the practice period did 
differ as a function of mediation instructions. These results were 
considered in terms of alternative interpretations, with the suggestion 
that imaginal mediators may be more effective than verbal mediators. 


University of Western Ontario 


A number of studies have indicated 
that paired-associate (PA) learning 
may be facilitated by imaginal media- 
tors (e.g, Bugelski, Kidd, & Segmen, 
1968; Paivio, Yuille, & Smythe, 1966; 
Reese, 1965; Wallace, Turner, & Per- 
kins, 1957) as well as verbal mediators 
(eg., Kiess & Montague, 1965; Mar- 
tin, Cox, & Boersma, 1965 ; Underwood 
& Schulz, 1960). The present paper 
stems from a series of studies in which 
the two types of mediators have been 
theoretically linked to the abstractness- 

` concreteness of stimulus items, as well 
as to experimentally aroused mediation 
sets. These studies have been guided 
by the hypothesis that concrete nouns 
derive their meaning through associa- 


1This study was supported by a grant 
from the National Research Council of 
Canada (Grant APA-87). The paper is 
based on a thesis submitted by the first 
author to the University of Western Ontario 
in partial fulfillment of the PhD degree. 

2 Now at the University of British Colum- 
bia, Vancouver 8, British Columbia, Canada. 


tion with concrete objects and events 
as well as through association with 
other words, and thereby acquire the 
“capacity” to evoke both nonverbal 
imagery and verbal processes as asso- 
ciative reactions. Abstract terms, on 
the other hand, derive their meaning 
more consistently through intraverbal 
experience and are more likely to 
arouse verbal associative processes 
rather than imagery. The theory has 
been supported by latency data. Yuille 
and Paivio (1967) asked Ss to press 
a key when they discovered a mediator 
for a pair of nouns: The latency of 
discovery of imaginal mediators was 
slower for pairs with abstract stimulus 
members than for those with concrete 
stimuli, but this difference did not oc- 
cur under a verbal mediation set. 
Paivio and Yuille (1967) extended 
the above reasoning to PA learning, 10- 
structing one group to use imagina 
mediators, another verbal mediation, 
and a third rote repetition. All Ss were 
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given concrete-concrete (C-C) and ab- 
stract-abstract (A-A) pairs to learn. 
The mediation instructions clearly facil- 
itated learning, but the expected inter- 
action of Concreteness X Mediation 
Set was not obtained. It was sug- 
gested that mediation instructions may 
not adequately control Ss’ use of 
mediators. 

The present study was, in part, an 
attempt to induce more persistent medi- 
ation sets, Each S was tested individ- 
ually and practiced imaginal or verbal 
mediation or repetition with one list of 
pairs before learning another list of 
paired nouns. Several predictions 
were made on the basis of the theo- 
retical approach and previous findings. 
Concreteness should facilitate learning 
more on the stimulus than response 
side of pairs. Both imaginal and ver- 
bal mediation sets should produce bet- 
ter learning than rote repetition. Set 
and concreteness were expected to 
interact in such a manner that both 
imagery and verbal sets would be effec- 
tive with pairs having concrete stimu- 
lus members (ie., C-C and C-A pairs) 
with imagery being more potent, 
whereas the verbal set would result in 
better learning with abstract stimulus 
(A-C and A-A) pairs. In addition, 
the first phase of the study constituted 
a replication of the Yuille and Paivio 
(1967) mediation latency experiment. 


METHOD 


Subjects $ 

Two hundred and forty introductory psy- 
chology students (128 males, 112 females) 
served as Ss. 


Paired-Associate (PA) lists 


The 128 nouns used to construct the PA 
lists were selected from a pool of 925 nouns 
for which concreteness (C) and imagery (I) 
ratings and meaningfulness (m) data (Noble, 
1952) are available (Paivio, Yuille, & Madi- 
gan, 1968). Of the words selected, 64 are 
high I and concrete and 64 are low I and 
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abstract. All nouns are comparable in m 
and frequency of occurrence (Thorndike & 
Lorge, 1944). Two lists of 16 pairs each, 
designated CC1 and CC2, were constructed 
by randomly pairing the 64 concrete nouns. 
Two such lists were required because some 
Ss practiced their set (see below) with C-C 
pairs and subsequently learned a C-C list. 
For the same reason, two 16-pair A-A lists, 
AA1 and AA2, were constructed by ran- 
domly pairing the 64 abstract nouns. 
Finally, 32 of the abstract and 32 of the 
concrete nouns were paired to form two lists, 
CAI and CA2, of 16 concrete-abstract pairs 
each, Additional lists were formed by 
“turning over” the pairs in each of the 
original six lists so that each stimulus noun 
became a response and vice versa. In the 
case of lists CA1 and CA2 this resulted in 
two lists of abstract-concrete (A-C) pairs. 

Each pair in the resulting 12 lists and 
each stimulus term was typed in pica capitals, 
photographed, and mounted as a 35-mm. 
slide. 


Procedure 


Mediation phase-—The experimental ses- 
sion began with S being presented one of 
three sets of tape-recorded instructions. The 
imagery set (I set) asked S to link each 
pair of nouns with a mental image: “for 
example, the pair ‘garden-money’ might be 
linked by a picture of a garden with money 
in it” The verbal set (V set) instructed S 
to link each pair with a word or phrase “for 
example, with the pair ‘garden-money,’ you 
might use ‘garden vegetables earn money.’ ” 
The third set of instructions (R set) told S 
to “repeat each pair of words [out loud] a 
number of times.” Eighty Ss received each 
set. 
After viewing 3 pairs to familiarize him 
with the procedure, S was presented, one at a 
time, the 16 pairs of one of the CC or AA 
lists (120 Ss were presented a CC list, 120 
an AA list). Presentation was by a slide 
projector on a rear projection screen. The 
Ss given the I or V sets were asked to 
press a switch when they found the appro- 
priate mediator. This stopped a clock and 
allowed E to record the latency of discovery 
of each mediator, After pressing the switch, 
S gave a brief description of the mediator 
he used. Each pair was projected on the 
screen for 16 sec, and the interpair interval 
was 5 sec. ThelSs given the R set repeated 
each pair while it was on screen, 

PA learning phase—Upon completion of 
the mediation phase, Ss were required to 
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learn a different list (one of the CC, CA, 
AC, or AA lists). This phase consisted of 
four alternating learning and recall trials. 
During a learning trial, each pair appeared 
on the screen for 4 sec. with an interpair in- 
terval of 1 sec. On recall trials, the stimu- 
lus member of each pair was presented for 
7 sec., with an interstimulus interval of 1 
sec, and Ss responded orally. Sixty Ss 
learned each type of list. 

All combinations of mediation set, type of 
practice pair, and type of learned pair were 
included in a factorial design comprising 24 
cells with 10 Ss in each cell. The design in- 
volved 240 Ss since no repeated measures 
were involved with these factors. 


Postexperimental Questionnaire 


After completion of the PA learning 
phase, S was asked to complete a two-page 
questionnaire. The first page contained 
two 7-point scales. On the first scale S 
indicated how easy or difficult he found the 
instructed mediation technique. The defining 
ends of the scale were 1, Very Difficult, to 
7, Very Easy. The S indicated how often 
he used the instructed technique on the sec- 
ond scale, defined by limits 1 (“none of the 
pairs”) to 7 (“all the pairs”). The 16 pairs 
S had learned were typed in a column on 
the left-hand margin of the second page of 
the questionnaire. Instructions asked S to 
write a brief description of how he linked 
each pair during learning. 


ReEsutts 


The data for the subgroups within 
each cell that received the “turned 
over” and alternate lists (e.g., Lists 
CC1 or CC2) were collapsed in all the 
analyses reported below. 


Mediation Phase 


The mean latencies of mediator dis- 
covery under the I set were 6.92 and 
10.11 sec. for C-C and A-A pairs, re- 
spectively; for the V set groups the 
means were 7.47 and 8.34 sec. An 
analysis of variance of these data re- 
vealed a significant main effect of type 
of pair, F (1, 156) = 26.76, p < 01, 
with mediators being discovered more 
rapidly for C-C than for A-A pairs. 
The main effect was qualified, however, 


by the significant interaction of Pair 
Type X Set, F (1, 156) = 8.87, p< 
.01. Comparisons by ¢ tests indicated 
that Ss instructed to find imaginal 
mediators required significantly longer 
for A-A than C-C pairs, t (40) = 5.76, 
p < .01, whereas the pair attribute did 
not affect Ss given the V set, ¢ (40) 
= 1.57, p < .10. This interaction sup- 
ports the prediction that variation of 
the concreteness of pairs will affect the 
discovery of imaginal but not verbal 
mediators, and represents a replica- 
tion of the similar finding reported by 
Yuille and Paivio (1967). 


PA Learning Phase 


An analysis of variance of the learn- 
ing scores involved two levels each of 
stimulus and response C, two types 
of practice pairs, three instructional 
sets, and four trials. Table 1 presents 
the mean total recall scores for the ex- 
perimental variables, collapsing over 
trials. 

Significant main effects of stimulus 
C, F (1, 216) = 222.47, p < .01, and 
response C, F (1, 216) = 106.88, p < 
01, indicate that concreteness of either 
member of a pair facilitated recall over 
all conditions. The larger F ratio ob- 
tained for the stimulus variable is con- 
sistent with past findings (e.g., Paivio 


TABLE 1 


Mean Torat RECALL OVER Four TRIALS, AS 
A FUNCTION oF TYPE OF PRACTICE PAIR, 
Set, AND TYPE OF LEARNED PAIR 


d Pair 
Type of Type of Learned Pai 


Practice Set 
he cc | ca | ac | AA 
I 58.9 | 56.8 | 53.9 | 25.2 
cc | v | 585] 51.8 | 48.4 | 27.0 
R | 47.1] 47.6 | 45.3 | 27.9 
I 59.3 | 54.7 | 50.9 | 26.7 
AA | V | 584] 53.1 | 44.2 | 283 
R | 486 | 51.1 | 40.9 | 21.9 


Note.—The standard deviations range 2.8-10.? 


(N = 240). 
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et al, 1966) and with the imagery 
hypothesis. The interaction of the 
Stimulus Variables X Response Vari- 
ables was also significant, F (1, 216) = 
64.92, p < .01. This interaction re- 
flects the particularly poor recall for 
Ss learning A-A pairs, and can be in- 
terpreted to mean that variation in 
response C was more effective when the 
stimulus was abstract, which differs 
from interactions obtained with mixed 
lists (e.g., Paivio et al., 1966) where 
variation of response C has had a 
greater effect when the stimulus is 
concrete, 

The other significant main effects in- 
volved the set variable, F (2, 216) = 
13.09, p < .01, and trials, F (3, 648) 
= 1150.92, p < .01. The Ss given the 
I and V sets had higher recall for all 
types of pairs than Ss using repetition, 
and recall generally improved as a func- 
tion of trials. 

The remainder of the significant ef- 
fects involve interactions with Trials. 
The Stimulus x Trials, F (3, 648) = 
21.59, p < .01, Response X Trials, F 
(3, 648) = 7.14, p < .01, and Stimulus 
X Response X Trials, F (3, 648) = 
12.27, p <.01, interactions indicate 
that recall differences among the four 
types of pairs decreased as a function 
of Trials. Recall differed for all four 
types of pairs on the first trial while on 
the last trial only A-A pair recall dif- 
fered from the others. The set variable 
also interacted with trials, F (6, 648) 
= 23.54, p < .01. The Ss given the 
I and V sets had superior recall when 
compared to Ss given the R set on 
Trials 1 and 2, but this difference had 
disappeared during the last two trials. 
The triple interactions of set and trials 
with the stimulus and response vari- 
ables, respectively, were also signifi- 
cant, F (6, 648) = 5.94, p < 01, and F 
(6, 648) = 3.16, p<.01. The su- 
Periority of concrete nouns as stim- 
uli and as responses was greater on the 
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function of mediation set and stimulus con- 
creteness. 


first trial, but not thereafter, for Ss 
instructed to use imagery or verbal 
mediators, than for Ss given the R set. 
The effect on the first trial is illustrated 
in Fig. 1 for the stimulus variable.* 
To summarize these results, the 
strongest effects obtained, in terms of 
persistence over trials, were those at- 
tributable to variation of stimulus and 
response C, with the former having a 
greater effect than the latter. Lower 
recall for Ss using the R set, as com- 
pared to the I and V set groups, 
occurred for the first two trials but 
disappeared on the thitd trial. The 
expected interactions of stimulus C, 
response C, and set were found only on 
the first trial. The recall of Ss given 
the R set was less affected by varia- 


8 The four-way interaction of Type of 
Practice Pair X Stimulus X Response X 
Trials was also significant, F (3, 648) = 
2.72, p<.05. Analyses at each trial 
yielded no significant effects inyolving type 
of practice pair, however, and the four-way 
interaction is generally uninterpretable. Ac- 
cordingly it will be ignored in the subse- 
quent discussion. 
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tion of the stimulus and response at- 
tributes than was the recall of those 
asked to use images or verbal media- 
tors. 


Postexperimental Reports 


Two judges classified each S’s medi- 
ator descriptions as imaginal, verbal, 
some other type of mediator, repetition, 
or nothing. The total number of pairs 
for which S followed his instructed 
set was computed from this classifica- 
tion and an analysis of variance per- 
formed on these scores. This analysis 
revealed that Ss given the I and V sets 
reported following their respective sets 
for more pairs than did Ss given the R 
set, F (2, 216) = 43.52, p < 01. The 
Ss also reported following their set 
more often in the case of pairs with 
concrete rather than abstract stimuli, F 
(1, 216) = 4.76, p < .05, or responses, 
F (1, 216) = 3.89, p < .05. The anal- 
ysis of the 7-point “none of the 
pairs”—“all the pairs” scales confirmed 
this pattern. 


Discussion 


The interactive effect of set and type 
of pair on the latency of mediator dis- 
covery during the mediation phase of the 
study would lead one to expect poorer 
recall of abstract stimulus pairs by Ss 
given the imagery learning set as com- 
pared to those given the verbal set. The 
predicted difference was not obtained. 
These results can be interpreted in terms 
of the distinction between the discovery 
and utilization of mediators (Schulz & 
Lovelace, 1964). The mediation latency 
and PA learning results of the present 
study suggest that both imaginal and 
verbal mediators can be easily discovered 
and utilized with concrete stimuli, hence 
the high recall for C-C and C-A pairs. 
With abstract stimulus pairs, however, 
imaginal mediators are apparently diffi- 
cult to discover (according to the latency 
data), while the more easily discovered 
verbal mediators may be difficult to utilize. 
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Assuming this to be correct, one reason 
for such difficulty might be that quickly 
discovered verbal mediators are superficial 
and ineffective. If verbal mediation re- 
quires representation of an imaginal na- 
ture to be effective, then Ss employing 
verbal mediators with pairs containing 
abstract stimuli would not be expected to 
have higher recall than Ss employing 
images. The ineffectiveness of the two 
types of mediators with abstract material 
is for two different reasons: Imaginal 
mediators are difficult to discover, while 
verbal mediators, although easily dis- 
covered, are relatively ineffective in the 
absence of imagery. Concrete stimuli, on 
the other hand, may evoke imagery so 
readily that images can be used as medi- 
ators by both I and V set groups. Alter- 
natively, of course, it can be argued that 
Ss in both the I and V set groups used 
only verbal mediators. However, no dif- 
ferences as a function of noun C would 
then be expected, nor would this explana- 
tion predict the interactive effects of 
concreteness and set on mediation latency 
(cf. Yuille & Paivio, 1967). 

Variation of C on both the stmiulus 
and the response side of pairs accounted 
for the largest proportion of the variance 
in recall scores and, consistent with pre- 
vious findings, stimulus C had the more 
potent effect. This finding, too, supports 
an interpretation in terms of stimulus- 
evoked imagery, particularly since any 
differential availability, of verbal associ- 
ates was controlled by equating concrete 
and abstract items on m. It should be 
noted in particular how poorly A-A pairs 
were learned despite their m being equiv- 
alent to that of other pairs. The precise 
explanation of the mechanisms involved in 
all these effects obviously remains in 
doubt, but, considered in conjunction with 
previous findings, they indicate that 
imagery can no longer be ignored as 
a factor in certain verbal learning 
situations. 


REFERENCES 


Bucetsxr, B. R, Kmp, E., & SEGMEN, J. 
Image as a mediator in one-trial paired- 
associate learning. Journal of Expert: 
mental Psychology, 1968, 76, 69-73. 


IMAGERY AND INSTRUCTIONS IN PA MEDIATION 441 


Kuss, H. O., & Montague, W. E. Natural 


language mediators in paired associate 
learning. Psychonomic Science, 1965, 3, 
549-550. 


Martin, C. L., Cox, D. L., & Boersma, F. J. 
The role of associative strategies in the 
acquisition of P-A material: An alterna- 
tive approach to meaningfulness. Psycho- 
nomic Science, 1965, 3, 463-464. 

Noste, C. E. An analysis of meaning. Psy- 
chological Review, 1952, 59, 421-430. 

Pawo, A., & Yurre, J. C. Mediation in- 
structions and word attributes in paired- 
associate learning. Psychonomic Science, 
1967, 8, 65-66. 

Pawo, A. Yue, J. C, & Manican, S. 
Concreteness, imagery, and meaningful- 
ness values for 925 nouns. Journal of 
Experimental Psychology, 1968, 76(1, 
Pt. 2). 

Paio, A., Yule, J. C, & Smyrue, P. 
Stimulus and response abstractness, imag- 
ery, and meaningfulness, and reported 
mediators in paired-associate learning. 
Canadian Journal of Psychology, 1966, 
20, 362-377. 

Reese, H. W. 
learning in children. 


Imagery in paired-associate 
Journal of Experi- 


mental. Child Psychology, 1965, 2, 290- 
296. 

Scuutz, R. W., & Loverace, E. A. Mean- 
ingfulness and the associative phase of 
paired-associate learning: A methodologi- 
cal consideration. Psychonomic Science, 
1964, 1, 37-38. 

THORNDIKE, E. L., & Lorce, I. The teach- 
er's word book of 30,000 words. New 
York: Teachers College, Columbia Uni- 
yersity, Bureau of Publications, 1944. 

Unperwoop, B. J., & ScHutz, R. W. Mean- 
ingfulness and verbal learning. Chicago: 
Lippincott, 1960. 

Warrace, W. H., Turner, S. H., & Per- 
xins, C. C. Preliminary studies of human 
information storage. Signal Corps Pro- 
ject No. 1320, 1957, University of Penn- 
sylvania, Institute for Cooperative Re- 
search, 

Yun, J. C, & Pawo, A. Latency of 
imaginal and verbal mediators as a func- 
tion of stimulus-response concreteness and 
imagery. Journal of Experimental Psy- 
chology, 1967, 75, 540-544. 


(Received October 26, 1967) 


Journal of Experimental Psychology 
1968, Vol. 78, No. 3, 442-455 


LONG-TERM PROBABILITY LEARNING WITH A 


RANDOM SCHEDULE OF REINFORCEMENT * 
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10 Ss served 25 days in a standard 2-choice probability learning task. 
There were 350 trials each day on a random 50:50 noncontingent 
schedule. The sequential dependencies were intensively analyzed to 
test the linear and pattern models of stimulus sampling theory, and an 
averaging model based on a principle of information integration. The 
linear and pattern models did not do very well. The averaging model 
had better success although it too was unable to account for the exten- 


sive alternation responding that developed. 


Long-term experiments in probabil- 
ity learning have both empirical and 
theoretical interest. On the empirical 
side, such work relates to the question 
of whether steady-state behavior is 
essentially the same as in the usual 
short-term experiments. Various 
workers have suggested, e.g., that prob- 
ability matching may be a short-term 
phenomenon. Long-term data are also 
required for single-S analyses which 
have certain desirable properties. 
Thus, the existence and nature of indi- 
vidual response strategies are difficult 
to assess without such data. 

On the theoretical side, consider- 
able disagreement has arisen over the 
interpretation of existing data. Much 
of this work has been done to test stim- 
ulus sampling theory, and Estes 
(1964) considers that the evidence, 
taken as a whole, provides firm sup- 
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port to the theory. There are various 
discrepancies between the data and 
the theory, however, and other investi- 
gators have felt that the nature of 
these discrepancies makes the theory 
fundamentally untenable (e.g., Ander- 
son, 1964a). 

Long-term data are important for 
the resolution of this question. In the 
first place, single-S analyses avoid cer- 
tain technical problems that arise in 
testing models against group data. In 
addition, the above discrepancies may 
represent short-term contamination by 
preexperimental response tendencies. 
For instance, the negative recency ef- 
fect (Jarvik, 1951) is sharply opposed 
to theoretical prediction. However, 
Anderson (1960) and Edwards 
(1961) found that the effect tended to 
adapt out over trials. If the effect is 
transient, it may not be a fundamental 
difficulty for the theory. 

In support of this view, Estes 
(1964) cites the experiment of Fried- 
man, Burke, Cole, Keller, Millward, 
and Estes (1964) which has special 
relevance here. In that experiment, 
80 Ss received three sessions in a non- 
contingent two-choice task. In the 
first two sessions the event frequencies 


442 


= a a 


LONG-TERM PROBABILITY LEARNING 443 


shifted every 48 trials over a range of 
values. In most of the third session 
the event frequencies were constant at 
80:20, and the data from this session 
were intensively analyzed. These 
analyses yielded impressive support for 
the theoretical models: probability 
matching was obtained, the fits to the 
sequential dependencies were good, and 
no negative recency effect was visible. 
Friedman et al. suggested that this suc- 
cess resulted from the adaptation of 
preexperimental guessing habits during 
the first two sessions leaving the be- 
havior under the control of the rein- 
forcing events. 

As impressive as this experiment is, 
a serious objection has been raised by 
Anderson (1964a), who pointed out 
that the shifting schedule of the first 
two sessions meant that each event re- 
curred on successive trials more than 
chance. Shifting the event frequen- 
cies every 48 trials induced moderately 
sizable sequential dependence within 
the event sequence, even though the 
events were completely random within 
each 48-trial block. Previous events 
could thus acquire discriminative con- 
trol over the response. According to 
previous data, this would produce a 
repetition response tendency; i€, a 
tendency to predict next the event that 
occurred just previously. Since such 
behavior would be particularly amen- 
able to the given model, its success 
may have only reflected response ten- 
dencies that were trained in by the 
shifting schedule during the two initial 
Sessions. That transfer effects can be 
extensive and long lasting has been 
demonstrated (Anderson, 1960; Witte, 
1964). Accordingly, other schedules 
might train in different response ten- 
dencies that would require different 
models. 

_ The present experiment was de- 
Signed with these considerations in 
mind, The Ss were run 25 sessions 


of 350 daily trials under a completely 
random 50:50 schedule. Although 
this schedule did not allow a test of 
probability matching, it was considered 
necessary in order to avoid any pos- 
sible discriminative element such as 
in Friedman et al. (1964). For this 
schedule, the sequential dependencies 
provide a powerful test of the theo- 
retical predictions. 


Theory 


Linear and pattern models —These 
are the two stimulus sampling models 
treated by Friedman et al. (1964). 
Both are simple conditioning models 
that had been developed in a framework 
of reinforcement theory. 

Since a 50:50 schedule was used, the 
models had to be tested against the se- 
quential dependencies. For the linear 
model, the theoretical development was 
made by Anderson (1959). Similar 
derivations for the pattern model are 
summarized in Atkinson and Estes 
(1963). These models are well 
known and need not be discussed 
further. 

Foulkes procedure—This approach 
was introduced by Foulkes (1959) and 
applied to experimental data by Feld- 
man and Hanna (1966). Basically, 
it is a procedure of searching for pat- 
terns of events that correlate with dif- 
ferent response probabilities. At one 
level it can be viewed simply as a 
statistical tool to search for significant 
patterns. However, if it is assumed 
that Ss actively discriminate the sig- 
nificant patterns, these may be con- 
sidered as perceptual states, and the 
procedure itself a finite-state model. 
As such, it is a model for distinguish- 
ing significant states and does not 
predict response probability, although 
additional assumptions may be added 
for this purpose. The operation of the 
model can most easily be described in 


use, 
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Weighted average model—This 
model arises from a conception of the 
organism as an information integrator. 
The stimulus events are not considered 
as reinforcers, but as pieces of infor- 
mation that bear on the judgments and 
choices. At a quantitative level, it is 
assumed that the integration is per- 
formed by an averaging process. Such 
an averaging model is basically simple 
and has been discussed in previous 
work (Anderson, 1962, 1964b, 1964c). 
Applied to the present task, it may be 
written as 


k 
Pn = Wolo + D wists iJ 


where the w; are weights whose sum is 
unity. Here p, is the response prob- 
ability on Trial n, and w; and s,4 are 
the weight and value of the event on 
Trial n-i. For the two-choice task, 
the values of the two events would 
ordinarily be taken as 0 and 1. 

In previous work with very short 
stimulus sequences, Jo has been con- 
sidered as the strength of an initial or 
neutral response. For the present task 
Io may be considered analogous to an 
expectation based on the sequence as 
a whole, Accordingly, it will here be 
considered constant although, in gen- 
eral, it would change with changes in 
the schedule. The presence of the Jo 
term has some important theoretical 
implications. In particular, if wo 0, 
then the asymptote of response over a 
subsequence or run of the same event 
is less than unity. 

To apply the model, an assumption 
of path independence is also made. 
That is, it is assumed that w; the 
weight associated with the ith previous 
trial, is constant, independent of what 
events occurred on adjacent trials. A 
simple and powerful test of goodness 
of fit is then obtained on the following 
basis (Anderson, 1964c). Since two 
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events may occur each trial, there are 
2* possible subsequences for the k pre- 
vious trials, and these sequences may 
be considered as constituting a 2* fac- 
torial design. Equation 1 then implies 
that, in analysis of variance terms, the 
interactions of this design are zero. 

It is of special interest to note that, 
at a purely formal level, the averaging 
model includes the linear and pattern 
models as special cases. The other 
models may be expressed in a form 
analogous to Equation 1 (Anderson, 
1959). The w; then become functions 
of a single parameter, 0, with wo 
asymptotically zero. As a conse- 
quence, the above test of fit applies to 
all these models at once. 


METHOD 


The Ss predicted which of two events 
would occur on each of 350 daily trials. 
Each event occurred equally often on a 
completely random schedule. 

Apparatus—The Ss sat in comfortable 
chairs separated by dividers at one end of a 
carpeted, soundproof room. Twelve feet 
distant on the opposite wall a large white 
signal lamp was centered at S’s eye level. 
Two blue event lamps were located slightly 
below and symmetrically about the white 
signal lamp. Large armrests on each chair 
contained silent, double-pole, self-returning, 
knife switches. 

An IBM 526 Summary Punch was used 
to program the stimuli and record the 
responses. Time intervals were programme 
by electronic timers. The apparatus and E 
were located in an adjoining room and one- 
way glass allowed visual monitoring of Ss. 

Stimuli—Stimulus sequences were gen- 
erated by computer and punched on cards 
for use in the control-recording apparatus, 
A different random sequence of 350 trials 
was used for each session for each squad. 
On each trial one or the other event oC 
curred and each had constant probability of 
4 on each trial. TAR 

At the beginning of a trial, the white s187 
nal lamp was lit for 1 sec., and the signa 
tone was sounded at the same time. After 
a 2-sec. interval, one of the blue reinforcing 
event lamps came on for 1 sec. There was 
then an intertrial interval of 23 sec. after 
which the signal lamp and tone came on 
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again, signalling S to make his prediction 
for the next trial. 

Subjects and instructions—The instruc- 
tions were mainly similar to those used pre- 
viously (Estes & Straughan, 1954; Fried- 
man et al., 1964). The Ss were told to 
push their response switch to the right 
or left to indicate their prediction of which 
blue event lamp would flash. They were 
told to respond to each trial even if it 
seemed like a pure guess, and that it was 
important to keep their attention on the task, 
yet they should not try too hard but rather 
let their reactions take care of themselves. 
These instructions were given on each of the 
first two sessions, and on the initial session 
of each subsequent week, Talking during 
the session was not allowed. 

The Ss were run in squads 5 days a week 
for 5 wk. Four full squads of 4 Ss were 
started in the experiment. Dropping during 
the first week was mildly encouraged and 2 
Ss did so, after two and four sessions, Te- 
spectively. In addition, 1 S dropped after 
two full weeks, 2 after three full weeks, and 
1 dropped in the fourth week. Of the re- 
maining 10 Ss, 7 had perfect attendance, 
2 missed one session each, and 1 (510 be- 
low) missed five sessions evenly distributed 
over weeks. 

The data for the 4 Ss who did not serve all 
5 wk. were analyzed by the weighted aver- 
age model on a day-within-week basis. On 
the whole, their behavior appeared to be of 
the same nature as that of the other 10 Ss. 
For convenience, therefore, only these 10 Ss 
are reported here. 

The Ss were recruited from undergraduate 
psychology courses in the summer of 1964 
and all were under 21 yr. of age. They were 
paid $2 per session plus a bonus of $10 
for perfect attendance. 


RESULTS 


All the data analyses will deal with 
sequential dependencies that show how 
short sequences of events affect re- 
sponding. The first 50 daily trials 
were considered as warm-up, and all 
dependencies were tabulated over the 
last 300 trials of a session. 

The results reported here are on a 
weekly basis, pooled over 1,500 trials 
each week (except for three Ss as 
noted above). With a 50:50 schedule, 
a sequence of four specified events oc- 


curs around 94 times in 1,500 trials. 
Thus, each entry of Table 2 below is 
based on approximately 94 cases. 
Similarly, the rightmost point of each 
run curve of Fig. 1 and 2 is based on 
approximately 47 instances; each pre- 
ceding point on each curve is based on 
twice as many instances. s 
Homogeneous runs of events—A 
quick view of the general nature of the 
behavior can be obtained from the run 
curves of Fig. 1 and 2. Each curve 
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Fic. 1. Weekly run curves for Ss 1-5. 
(The ordinate is the proportion of Ex predic- 
tions for the E: runs in the left panel, and of 
E predictions for the E: runs in the right 
panel. The numeral by each curve is the 
week number.) 
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Fic. 2. Weekly run curves for Ss 6-10. 
(The ordinate is the proportion of E, predic- 
tions for the E: runs in the left panel, and of 
Es predictions for the Es runs in the right 
panel, The numeral by each curve is the 
week number.) 


plots the proportion of predictions of 
one event as a function of the number 
of previous occurrences of that same 
event. There is one panel for each S, 
and within each panel the left and right 
sides show the run curves for the left 
(E,) and right (E2) events. The 
five numbered curves on either side 
give the data for Weeks 1-5, as 
labelled. Except for position prefer- 
ence, the curves for the left and right 
events should be essentially the same. 
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It is clear from Fig. 1 and 2 that 
response probability changes consider- 
ably over the course of a run for most 
Ss. The pattern of change, however, 
varies markedly from one S to another. 
Moreover, some individual Ss change 
considerably from one week to the 
next. 

For S6 and S7, the run curves are 
what would be expected on the basis 
of the various conditioning models that 
have been applied to probability learn- 
ing: exponential growth curves with 
an asymptote of unity. The S7 shows 
some initial negative recency that dis- 
appears in the later weeks. For S6, 
there is an upward shift in the first 
point of the run curve that would be 
interpreted as an increase in the condi- 
tioning parameter or learning rate. 

None of the other eight Ss shows 
this simple type of run curve. Indeed, 
53, S5, and S9 evidently show some 
form of alternation behavior since the 
first point on every run curve is below 
the chance level of 50%. Since this 
occurs already in the initial week, it 
presumably represents a natural re- 
sponse tendency. The alternation 
ramifies over successive weeks for S5 
and S9. Indeed, for S9 the positive 
recency of the early weeks gradually 
but completely vanishes. 

The remaining five Ss show a variety 
of response patterns. The S10 has a 
strong position preference (for E1), 
while S1 appears to be developing an 
alternation tendency in Weeks 4 and 
5. For 54 and S8, the run curves are 
essentially horizontal or show a slight 
downward slope. 

This preview of the data makes clear 
that the simple probability learning 
schedules used in this experiment 
evoke a marked variety of behaviors. 
As a consequence, it is necessary to 
present the empirical results in more 
detail than usual before proceeding to 
the theoretical analysis. 
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TABLE 1 


PROPORTION OF ALTERNATION RESPONSES FoR E. 
ACH WEEK AND Ss FoR ALTERNATION 
SEQUENCES oF DIFFERENT LENGTHS 


Event Sequence Event Sequence 
Ss Week S Week 
11 | 112 | 1421 | 11212 | 112121 11 | 112 |1121 | 11212 | 112121 
1 1 34 | 24 | 37 | 50 36 6 1 20 | 44 | 42 
2 14 | 17 | 28 | 45 56 2 30 | 54 | 45 i 36 
3 33 | 26 | 38 | 66 68 3 21 | 36 | 32 | 37 | 40 
4 37 | 42 | 40 | 70 79 4 26 | 31 | 36 | 46 | 46 
5 54 | 48 | 39 | 75 77 5 08 | 08 | 13 | 20 | 11 
2 1 50 | 44 | 48 | 44 40 7 1 13 | 30 | 35 | 63 
2 39 | 38 | 48 | 28 46 2 16 | 44 | 28 | 53 32 
3 37 36 47 48 56 3 15 | 46 | 23 | 41 37 
4 27 | 47 | 44 | 57 50 4 14 | 40 | 35 | 58 | 60 
5 44 | 52 | 48 | 58 52 ©) 07 | 34 | 28 | 52 | 54 
3 1 49 | 68 | 71 | 61 56 8 1 45 | 44 | 50 | 52 | 51 
2 48 | 65 | 58 | 60 54 2 42 | 45 | 44 | 51 | 49 
3 38 | 56 | 53 | 59 42 3 30 | 32 | 36 | 30 | 32 
4 48 64 60 | 66 60 4 20 | 24 | 26 | 23 22 
5 46 | 64 | 58 | 60 62 5 24 | 20 | 17 | 16 | 13 
4 1 49 | 56 | 62 | 54 56 9 1 36 | 70 | 78 | 80 | 94 
2 49 | 53 | 44 | 50 70 2 52 | 72 | 82 | 88 | 82 
3 48 | 45 | 44 | 42 52 3 76 | 89 | 90 | 98 | 96 
4 46 | 38 | 50 | 48 58 4 88 | 94 | 99 | 97 | 99 
5 38 | 34 | 39 | 38 62 5 92 | 96 | 96 | 99 | 98 
5 1 74 | 66 | 78 | 60 60 10 1 26 | 41 | 38 | 33 | 54 
2 55 | 64 | 64 | 58 76 2 44 | 50 | 42 | 50 | 47 
3 47 | 58 | 62 | 58 52 3 42 | 44 | 46 | 52 | 46 
4 56 | 50 | 68 | 60 68 4 46 | 50 | 43 | 48 | 55 
5 70 | 60 | 62 | 59 58 5 50 | 46 | 49 | 49 | 37 


Note.—Entries are proportions of alternation responses; decimal point omitted. 


Alternation curves—The need to 
consider these curves was shown below 
in the main analysis of the four-event 
dependencies. Since these curves are 
analogous to the run curves, they are 
treated briefly at this point. 

Each entry in Table 1 is a proportion 
of alternation responses, conditional on 
the occurrence of the event sequence at 
the head of the column. An alternation 
response (Anderson, 1960) is a pre- 
diction of the stimulus event other than 
the just previous one: predictions of 
Events 2, 1, 2, 1, and 2, in the five 
Successive data columns. The entry in 
the rightmost column is pooled over the 
event sequences 112121 and 221212 of 
which there were about 47 total in- 


stances each week. All entries are un- 
weighted means over the two positions 
and so may differ slightly from the cor- 
responding data of Table 2. 

The main feature of Table 1 is the 
prevalence of alternation responding. 
Since alternation responses are tabu- 
lated, an entry greater than .50 is in- 
dicative of alternation responding. This 
behavior may occur primarily in the 
initial weeks alone, as with $6, in the 
later weeks alone, as with S1, or over 
the duration of the experiment, as with 
S3, S5, S7, and S9. Only S8 and 510 
fail to show alternation. Closer inspec- 
tion of the data shows that almost 
every S has a distinct pattern of alter- 
nation responding. The alternation 
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TABLE 2 


FOURTH-ORDER STIMULUS DEPENDENCIES FOR EACH WEEK FoR EAcH S 


Four Preceding Stimulus Events 
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A 
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10} 
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a 

3 
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Note.—Entries are probabilities of 
events; decimal points are omitted. 


predicting Event 1, conditional on occurrence of specified four preceding 


M = averages over Weeks 2-5. 
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data thus corroborate the indications of 
individual response strategies already 
seen in the Tun curves. 


Four-Event Dependencies 


Basic data.—The basic data of this 
report are the stimulus dependencies of 
Table 2. Each entry is a proportion 
of predictions of Event 1, conditional 
on the occurrence of the four previous 
events listed in the column heading. 
For instance, in Week 1, S1 predicted 
Event 1 on .612 of the trials in which 
Events 2, 1, 1, and 1, in that order, 
had occurred on the four preceding 
trials. 

Although Table 2 is cumbersome, 
this degree of detail was considered 
necessary because of the marked indi- 
vidual differences, and because most Ss 
showed considerable trend over succes- 
sive weeks. However, Table 2 is best 
approached through two derivative 
analyses. The first gives the serial 
position data, and the second is the 
statistical analysis. 
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Serial position data—These data 
give a direct index of the influence of 
each separate previous trial on the 
present response. The first row for 
each S in Table 3 shows the influence 
of the immediately preceding trial. 
Each entry is the difference in response 
probability when Event 1 and when 
Event 2 occurred on the previous trial. 
The second, third, and fourth rows are 
defined in exactly the same way, but 
with respect to the second, third, and 
fourth previous trials. These data were 
all calculated directly from Table 2 so 
that, e.g., the entry of .345 for S1 for 
Week 1 is obtained from the sum of 
the first eight entries minus the sum 
of the second eight entries in the first 
row of Table 2. 

Nonzero entries in Table 3 indicate 
a real influence of the specified pre- 
vious trial. As a rough guide to the 
inspection of these data, a single 
entry of between .06 and .08 is sig- 
nificantly different from zero. The 
column means are taken over Weeks 
2-5 and a mean about .025 is signifi- 


TABLE 3 
SEQUENTIAL DEPENDENCY DIFFERENCES FOR EAcH WEEK For EACH S 


Ary 
Week Week 

Previ- Previ- t 
ous M |s| ous 1 

PAN aia 2 3 4 5 Dae 2 3 4 5 
| 6 o5 | 451 

1 1 345 2 140 | —068 230 | 6 1 334 183 442 384 7 
EEEE EEIE ai) a) Sa) a) a 

—123| —071| —111| —050 | — = 
4 002 028 076 103 178 096 4 018 072 039 066 007 046 
7 
2 1 050 172| 7 1 476 431 455 436 546 46" 
2 037 dos ose 2al 080| 093 2 268| 246| 246 293 un ars 
3 —010 | —040 | —059 022 035 | —010 3 —074 095 126 pa 100, tA 
4 —027 | —027 009 083 037 026 4 103 114 122 

584 413 

3| 1- | —159| —099 s| —121 | —070 | —054 | 8| 1 070} 131| 375| 563| _ 
g ERTI] iE iaj ael ael | | iit) ooe] oia R ae 
a |043 008 | oot Oe 032 | 029 4 009 | —025 | —045 | 017 | —033 | —022 
$ 1 063 | —006 065 112 252 106 | 9| 1 —120 | —311 Rie EAS Tage ree 
3 os | 023 | 02e | Tose | —066 Z026 S NEME 365| 9] 024]. 006 | —002 
4 | 2088 034 | —028 Zois | 7008 | -001 4 072| 089] 065| o14| 019| 047 
5| 1 |—418| -187 | —os6 | —131 | —306 | —170 | 10| 1 340| 078| 17 O81 _033 or7 
3 Zote 02s 020 H Tue 008 3 oss Os -017 025 000 016 
a |Zoa2| 027| 007] 2033 | Z031 | -008 4 050| 037 | —004 | —048 | —028 | —011 


Note.—Entries are probability differences; decimal point omitted; significant means are in boldface. 
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cant. Significant means are indicated 
by boldface type. 

The main features of Table 3 are 
quickly summarized. First, it is evi- 
dent that the immediately preceding 
trial has a strong influence on the pres- 
ent response. For $3, S5, and $9, 
these entries are negative, reflecting the 
alternation behavior already noted. The 
effect of the second previous trial is 
significant for six Ss though much 
smaller in size. The third and fourth 
previous trials have rather small effects, 
on the whole, but are reliable in half 
of the Ss. 

The magnitude of the differences be- 
tween Ss may be illustrated by the 
Week 5 data in which the first-order 
dependency ranges from +.795 to 
—.886. The within-Ss differences are 
also strong as may be seen in the data 
for S1. There, the first-order depend- 
ency is large initially, increases yet fur- 
ther, and then decreases to become 
negative on Week 5. The fourth-order 
dependency is negligible in the initial 
weeks and then increases to sizable 
values by Week 5. 

The entries of Table 3 are theoreti- 
cally proportional to the weight param- 
eters, wi; of Equation 1. Of course, 
the validity of interpreting these entries 
as weights depends on the validity of 
the averaging model. 

Statistical analysis —As remarked in 
the introduction, analysis of variance 
may be used to test the fit of the models 
to the data. Such analyses were run 
for each S on the basic data of Table 2, 
excluding Week 1 in which preexperi- 
mental habits might still be important, 
The data from Weeks 2-5 then form 
a five-way factorial design, with weeks 
and the four previous trials as factors. 

The results of these analyses are 
shown in Table 4 where, for con- 
venience of inspection, the F ratios 
significant at the .05 level are in bold- 
face type. The error mean square in 


the lowest row gives the variability es- 
timate used in the tests. 

For a single week, the 16 entries in 
any row of Table 2 represent a 24 
factorial design. The four previous 
trials constitute the factors of the de- 
sign, each at two levels as represented 
by the two events. In this design, 
right-left symmetry implies that the 
two-way interaction of any pair of 
trials is zero. Consequently, those 
three-way interactions that involved 
weeks and any pair of trials were ex- 
pected to be negligible. Accordingly, 
they were pooled with the five-way 
interaction to yield an error term on 
21 df. This error term is an over- 
estimate so that the statistical tests are 
somewhat conservative. The amount 
of overestimation appears to be small, 
but even marked overestimation would 
not affect the essential implications of 
the results. More detailed analyses 
were also made, on a day-within-weeks 
basis. These gave about the same 
overall picture of the results and are 
not reported here. 


The tests of the theoretical formulations 
are based on the trial interactions that will 
be considered below. Before taking them 
up, there are some general features of the 
data that require comment. 

The F ratios in the first four rows of 
Table 4 test the main effects of the four 
previous trials. The algebraic values of 
these effects are just the entries in the 
column of means in Table 3 which have 
already been discussed. Each mean in Table 
3 is tested against zero by the corresponding 
F in Table 4. For example, in the column 
for S1, the F of 302.56 shows that the mean 
of .230 in Table 3 is reliably different from 
zero. Similarly, the F of 22.30 in the third 
row shows that the third previous trial has 
a significant effect on the response, of magni- 
tude —.062 as listed in Table 3. i 

The lower portion of Table 4 gives in- 
formation on temporal trends. Significance 
of weeks denotes a shift in the overall re- 
sponse level from one week to the next. It 
is interesting that most Ss show reliable 
changes in response preference although, as 
a rule, these changes are of the order of a 
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TABLE 4 


SUMMARY OF SINGLES ANALYSES OF VARIANCE OF THE 
STIMULUS DEPENDENCIES oF TABLE 2: WEEKS 2-5 


Source | df | S1 S2 S3 s4 SS s6 s7 s8 S9 S10 
Trial ea 
1 | 302.56 | 172.36 | 15.74 | 64.03 | 147.22 | 1472.01 | 1200.01 | 1306.94 
2 1 | 35.89 | 49.99 | 137.18 -29 | 3.98 | 137.67 | 414.03 5 138 
3 | 1 | 22:30 106 | 39,98 | 3193 333 | 17.05 | 20.72| 1134 1162 
4 | 1 | 53.25] 3:79] 455 £00 :28 | 15.25] 59.02| 3.50 69 
12 1 2.02 418 3.85 2.98 88 2.93 72 1,90 
13 1 ‘00 ‘01 104 200 | 1112 164 ‘00 <94 03 
14 1 Maersk 129 00 | 1.24 2.52 701 110 1:04 
ashen 134 ‘32 | 9.10 30 | 3:70] 2:23 39 ‘03 ‘00 
24 1 108 :26 103 00 | 1:76 39 168 ‘09 3163 
34 1 14 4.38 | 438 36 | 1.33 119 148 118 1.42 
123 1 | 76.16 | 13.76 | 3.69 42| 647 | 69.16] 13.96 14 
124 1 | 21.06 | 1.07 10 | 204] 2.73 | 1:16] 23.24 397 “06 
134 | 1 | 37:26] 8.48 29 | 1199 142 | 25106] 4.36 33 1.34 
234 | 1 | 3054| 247| 1.93 ‘00 ton 43| 7.73| 2:00 321 
1234 1 18 89 87 1.38 40 1,04 1.28 09 48 
Weeks | 3 53 | 3.03 | 19.38 | 607] 112 | 341| 3,44] 8.31 8.44 
1 | 3 | 10742 | 965 | 10.67 | 17:00 | 1406 | 117.10| 3.94| 84.51 1.86 
2 | 3 | 615 | 15.64 140 (s6 | 14.40 | 1649] 1.50) 4.73 2.52 
3 | 3] 219] S, ‘837 | 4186 | 218] 3.83] 477] 1:33 1.61 
4] 3 5:57 | 316 | 1.20] 1.05 | 1.09 1.57 48} 139 2105 
svi ad fin 3.69 | 2.32 | 1.16 68} 499| 169] 2.67 94 153 83 
124 | 3 1.08 | 1.63 | 122 134 19 1.72 179 (94 186] 2.00 
134 | 3 120 | 2153 166 35} 296] 285] 1.80) 1.81] 143]> 215 
234 | 3 1:70 | 2.39 oa | 116) 311 1.25 179 ‘66 106| 1.73 
MS. | 21 | .00279 | .00274 | .00293 | .00280 | .00314 | .00221 | .00291 | .00209 | .00382 | .00258 


Note.—Entries are F ratios; those significant at .05 level are in boldface. Critical values are 4.32 and 3,07 for 


1/21 and 3/21 df. 
a MS¢ = error mean square. 


few percent. For instance, the F of 6.07 
for $4 is based on successive weekly means 
of .512, .449, 522, and .484. 

Each two-way interaction involying weeks 
tests whether the influence of one of the 
previous trials varied over time. For exam- 
ple, the F of 107.42 for the W1 interaction 
for S1 tests for reliable differences among 
the corresponding four entries in the first 
line of Table 3, viz., .586, .262, .140, and 
—.068. Similarly, the F of 6.15 for the W2 
interaction shows that the second-order 
dependency differences, .127, .073, .128, and 
—011, are significantly different from one 
another. 


Theoretical tests—From the theo- 
retical view, the critical terms in the 
statistical analysis are those interactions 
involving previous trials only. Accord- 
ing to the stimulus sampling models, 
and the weighted average model, as 
well, all such interactions should be 
zero. In fact, the two-trial interactions 
in Table 4, such as 12 and 23, are 
nearly all nonsignificant. This gives 
slight support to the models, however, 


since the same would be expected from 
right-left symmetry alone. 

The important tests of Table 4 are 
those that involve three previous trials. 
Contrary to the predictions of the mod- 
els, 6 of the 10 Ss show significant 
three-trial interactions. For 5 of these 
6 Ss, more than one such interaction is 
significant, and the F ratios are gen- 
erally quite sizable. The theoretical 
implications of these discrepancies will 
be discussed in the next section, and the 
empirical locus of the interactions will 
be taken up here, Fortunately, it is 
simple to track these interactions down. 

To illustrate the procedure, consider 
the 123 interaction for S1. To exhibit 
this interaction, the means of Table 2 
are pooled over the fourth previous 
trial and are listed in Table 5. The 
upper and lower halves of Table 5 are 
2x 2 tables based on the second and 
third previous events. Zero 123 inter- 
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action would imply that the pattern of 
entries should be the same in each half 
except for an additive constant. These 
patterns are portrayed by the boldface 
difference scores which show that the 
23 interaction is different, though com- 
plementary, in the two halves of the 
table, 

Inspection of the table suggests that 
the major source of the interaction lies 
in an unduly low response to the 121 
alternation sequence, and an unduly 
high response to the 212 alternation 
sequence. This diagnosis is confirmed 
by reference to Table 2. Given the 
alternation sequence, 2121, the proba- 
bility of predicting Event 1 is .343. In 
marked contrast, the probability is 
greater than .6 for all seven other 

` sequences that end with Event 1. The 
other alternation sequence, 1212, pre- 
sents exactly the same picture: When 
a short stretch of alternation occurs in 
the event sequence, S1 falls in step 
with it. 

For S1, the 123 interaction is mainly 
due to this alternation tendency. The 
same tendency also underlies the other 
three-trial interactions for S1 as can be 
seen by similar breakdowns. In addi- 
tion, this $ responds to double alter- 
nations though the tendency is slight 
and is seen only in one or two other Ss. 

The other Ss were treated in the 
same way, and in each case the signifi- 
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cant three-trial interactions were local- 
ized in an alternation tendency. To this 
extent, at least, the behavior shows 
considerable uniformity. 

This analysis led to the construction 
of the alternation curves of Table 1, 
discussed above. With one exception, 
those Ss who show alternation behavior 
in Table 1 are the same Ss who show 
significant three-trial interaction. The 
exception, $3, shows an alternation 
tendency that depends only on the 
previous two events. 


Foulkes procedure—The Foulkes (1959) 
procedure tries to identify the stimulus pat- 
terns to which S is presumed to respond. 
To illustrate, suppose that P is a pattern of 
k successive events, and consider two sub- 
sequences, E:P and E:P, which differ only 
in the (k+1)st previous event. All occur- 
rences of each subsequence are classed ac- 
cording as the next response is a prediction 
of Ei or of Ex These data form a 2X2 
table of frequencies, and a significant chi- 
square implies a correlation between the 
present response and the event k+1 trials 
ago. It is then assumed that the pattern to 
which S is responding includes at least the 
Pattern P, and the complete pattern is de- 
lineated by working backward to a nonsig- 
nificant chi-square. 

The modifications introduced by Feldman 
and Hanna (1966) were used here, with com- 
puter tabulation and testing for each S. The 
main finding was the prevalence of alterna- 
tion responding, already discussed, which 
showed up as large chi-squares when com- 
paring subsequences such as 1212 and 2212. 
However, little else was obtained from these 


TABLE 5 
Locus or 123 INTERACTION FOR S1 


Events Probability Difference Events Probability Difference 
111 67 211 -62 
17 —.05 
121 50 221 67 
112 34 212 53 
—.03 22 
122 37 222 31 


Note.— Entries are probabilities of predicting Event 1, conditional on occurrences of three specified preceding 


events. 


Events are listed in order of occurrence from left to right. In the upper half, Event 1 is most recent in 


each case, and the events of the second and third previous trials form a 2 X 2 design. 


Ts 
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analyses. The procedure was then extended 
to search for patterns based on responses and 
stimuli over three previous trials. Although 
510 showed considerable response repetition 
toward the end of the experiment, there was 
otherwise little correlation between previous 
and present responses, in general agreement 
with previous work (Anderson, 1960; Ed- 
wards, 1961). 

In summary, the Foulkes procedure was 
not very useful here. Possibly it is more 
appropriate when specific patterns are built 
into the stimulus sequence as in Feldman 
and Hanna (1966). The overall analysis 
of variance, however, yielded a more detailed 
and better organized account of the present 
data. At the same time, each chi-square 
test corresponds to a single degree-of-free- 
dom test in the analysis of variance which 
can, therefore, be applied more generally. 


Discussion 
Theoretical Implications 


The relevant data have now been pre- 
sented and need only be summarized in 
relation to the theoretical models. Both 
the linear model and the pattern model 
derived from stimulus sampling theory 
make the same predictions for the present 
data, Accordingly, they will be consid- 
ered together, 

Linear and pattern models—These 
models disagree with the data in almost 
every respect. First, they predict that 
the run curves of Fig. 1 will be simple 
growth curves approaching unity. This 
is evident only for S6 and S7 as has been 
noted. The failure of the other Ss to 
conform to prediction cannot be attributed 
to a low learning rate except in one or 
two cases. 

Second, the sequential dependency dif- 
ferences of Table 3 also contradict the 
theory. As previously shown (Anderson, 
1959), the four entries within any column 
for a given S have the theoretical values 


9, 01 — 0), 0(1—8)?, 0.1 —8)% 


where @ is the learning rate. If the first- 
order dependency is used directly as an 
estimate of ø, then the mean entries in 
Table 3 for $1 should be related as 


.23, .18, .14, 11, 


which differ markedly from the observed 
values, 


.23, 08, —.06, .10. 


The fits for the other Ss are no better, in 
agreement with previous findings ( Ander- 
son, 1964a). 

Moreover, these estimates of @ are ac- 
tually negative for S3, 55, and $9. Pos- 
sibly, negative values of Ø could be 
rationalized by redefinition of the re- 
sponse. However, the positive recency 
in the run curves for these Ss would 
then be difficult to interpret. 

Finally, six Ss show sizable and sig- 
nificant interactions, contrary to the pre- 
dictions that may be obtained from the 
models. Indeed, $6 and $7, who show 
theoretically orderly run curves, are 
among the more discrepant in this respect. 
These interactions arose from alternation 
tendencies that are particularly difficult to 
handle theoretically. 

Although it is thus evident that the 
linear and pattern models do not account 
for the behavior, the theoretical implica- 
tions are naturally limited by the 50:50 
schedule that was used. As noted earlier, 
this schedule was chosen to avoid any 
possible discriminative stimulus compo- 
nent. Unfortunately, such a schedule car- 
ries the risk of producing adverse mo- 
tivational effects. Moreover, the lack of 
any way of doing reasonably well may 
have potentiated the large individual dif- 
ferences as well as the marked temporal 
variation within the individual Ss. The 
use of a 75:25 schedule may have been 
preferable from the theoretical point of 
view, and caution should be used in inter- 
preting the present findings. 

Nevertheless, the data do add to the 
evidence that probability learning may 
not be a suitable situation for testing 
simple models derived from stimulus sam- 
pling theory. One situation that includes 
elements of probability learning and that 
produces stable behavior over long pe- 
riods of time is simple psychophysical de- 
tection. Recent work suggests that the 
simple conditioning models developed for 
probability learning provide very reason- 
able interpretations of response bias in 
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detection experiments (Atkinson & Kin- 
chla, 1965; Friedman, Carterette, Naka- 
tani, & Ahumada, 1968). 

Averaging model.—This formulation 
does somewhat better than the linear and 
pattern models. First, the failure of the 
run curves to approach unity is easily 
accounted for. Indeed, the theoretical 
asymptote for a run of Event 1 is simply 
Wolo + (1— wo). If Ip =, then the 
asymptote for a run is 1 — wọ/2. Sim- 
ilarly, the asymptote for a run of Event 
2 would be wo/2. 

Position preferences can also be handled 
in this way since they correspond to 
values of Jọ different from 3. That an 
adequate model needs to allow for such 
preferences is seen in the significance of 
weeks in the main analyses for most Ss. 
An especially strong preference is shown 
by S10 who predicts Event 1 on .67 of 
the trials. 

The sequential dependency differences 
of Table 3 present no problem in them- 
selves. Indeed, the differences for pre- 
vious Trials 1 through 4 in any column 
are estimates proportional to w, through 
W4, respectively. These values are, of 
course, free parameters for the averaging 
model. In this respect, it has a distinct 
advantage over the simple conditioning 
models that needs to be kept in mind in 
any comparison, However, this greater 
freedom is inherent in an approach based 
on information integration rather than 
simple conditioning, and the data indicate 
that it is necessary. 

Nevertheless, the averaging model is 

not able to account completely for the 
sequential dependency data. For some 
S's, of course, the fit was good. In view 
of the amount of data, it is not unreason- 
able to suppose that these Ss did follow 
the averaging model. However, at least 
half of the Ss showed strong interactions 
in the test of goodness of fit. 
i These interactions were largely local- 
ized in alternation responding to alterna- 
tion subsequences. This degree of uni- 
formity affords some hope that a simple 
theoretical resolution of the discrepancy 
may be possible. 

An earlier two-response hypothesis 
(Anderson, 1960) may provide a basis 
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for resolving the discrepancy. This pos- 
tulated two separate response tendencies, 
alternation and repetition, that combined 
to produce the observed response. The 
earlier attempt to develop this hypothesis 
within the framework of a linear condi- 
tioning model was not successful. Sub- 
sequent work has provided considerable 
support for the existence of the two re- 
sponse tendencies, however. Accordingly, 
it may be worthwhile trying to incor- 
porate them into an averaging-type model. 


Empirical Implications 


In the main, the present long-term data 
are consistent with and extend previous 
short-term work with probability learn- 
ing. The tendency for the immediately 
preceding event to carry much greater 
weight on the present response than yet 
earlier events has already been discussed. 
Several other such results deserve brief 
comment in relation to previous work. 

A striking feature of the behavior was 
the extent of alternation responding. 
Such behavior has been noted in short- 
term data (Anderson, 1960, 1966) but 
long-term training seems to have brought 
it out much more strongly. Since such 
behavior is not differentially reinforced, 
its extensive development is puzzling. 

The tendency of the run curves to ap- 
proach asymptotes less than unity is in 
general agreement with previous work. 
Actually, short-term experiments are sel- 
dom conclusive in this respect because of 
insufficient data for runs of more than 
a few trials. However, one short-term 
experiment used repetition-type sequences 
that provided very long runs (Anderson, 
1960). In this experiment, the tendency 
for run curves to asymptote below unity 
was very clear. It has also been observed 
in medium-term data by Witte (1964). 
In the present data, it may be noted, the 
tendency cannot reasonably be attributed 
to preexperimental habits of negative 
recency. It may be suggested, therefore, 
that it is characteristic of probability 
learning. In any case, it deserves close 
theoretical attention. n 

Individual differences in probability 
learning have been noted (Anderson & 
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Grant, 1957) but little studied. In the 
present data they are extremely large; 
however, they may represent differences 
in amount rather than differences in kind, 
since a major locus of the differences 
seemed to be the alternation responding 
to alternation sequences. This is prob- 
ably not a sufficient basis to account for 
the behavior, but the two-response hy- 
pothesis mentioned above may be more 
successful. 

Even within a single S, there were 
marked changes in response over time. 
Indeed, even after 5 wk. in the experi- 
ment, it could hardly be said that Ss had 
reached a stable state. Of course, the 
50:50 schedule provided maximal oppor- 
tunity for these individual variabilities to 
manifest themselves. Schedules with un- 
equal frequencies, or conditional prob- 
ability schedules, may produce greater 
uniformity in the overt behavior, and this 
may be desirable for certain purposes. 
However, an adequqate behavior theory 
must be able to allow for the individual 
differences that do exist. 
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SERIAL ACQUISITION AS A FUNCTION OF NUMBER 


OF SUCCESSIVELY OCCURRING LIST ITEMS? 


JAMES F. VOSS 
University of Pittsburgh 


A 16-unit serial list was presented for 40 trials under 5 experimental 
conditions. In each condition each item occurred at its respective 
serial position on 20 of the 40 trials and an “X” occurred in lieu of 
the item on the remaining 20 trials. The independent variable was 
how many items of the list occurred successively on any one trial: 
1, 2, 4, 8, or 16; e.g., in Cond. 1, the sequence on Trial 1 was item- 
X-item-X-etc. and on alternate trials it was X-item-X-item, etc. 
For Cond. 2, the sequence was item-item-X-X-item, etc. Thus, 
each item occurred equally often at its respective serial position but 
the length of chain of successive items was 1, 2, 4, 8, or 16 units. 
The results indicated that correct responses, conditional probability 
of adjacent item responses, and pattern of intrusions varied as a 
function of the experimental conditions. The findings were interpreted 
as showing that serial acquisition consists of placing items along a 
temporal dimension with the functional stimulus being a compound 
stimulus of each item, i, serving as a cue, and the list locus of 


item i+1. 
context of the present findings. 


The associative structure of a serial 
list has been manipulated in single- 
stage experiments by varying the con- 
ditional probability of successive items 
(Voss, 1966), the normative associa- 
tive strength of adjacent items (Die- 
thorn & Voss, 1967), or the strength of 
a cluster of related items (Weingartner, 
1964; Wright & Bernstein, 1965). In 
two-stage experiments, attempts to 
manipulate the associative structure of 
serial lists have consisted of designs in 
which serial learning was preceded by 
acquisition of a related paired-asso- 
ciate list (e.g., Young, 1959) or a re- 
lated serial list (eg., Battig, Brown, 
& Schild, 1964). 

Associative structure in the present 
experiment was manipulated by vary- 
ing the number of successively occur- 


1The experiment reported in this paper 
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Results of other studies also were evaluated in the 


ring list items. More specifically, a 
16-unit serial list was presented for 40 
trials with the modification that each 
item occurred on only 20 trials and an 
“X” occurred in place of each item on 
the remaining 20 trials. Five experi- 
mental conditions were employed in 
which the number of items occurring 
successively was 1, 2, 4, 8, or 16; 
e.g., in one condition, Items 1, 3, 5, 
. .., 13, 15, occurred on one-half of 
the trials and the remaining items oc- 
curred on the other one-half of the 
trials. At the other extreme, all 16 
items were presented on one-half of the 
trials and no items were presented on 
the other one-half of the trials. It was 
hypothesized that if chaining is im- 
portant in serial learning, then perform- 
ance should improve as the number of 
successively occurring adjacent items 
increases ; on the other hand, if the ab- 
solute position hypothesis holds, then 
there should be no difference in acquis!- 
tion as a function of condition because 
each item in each condition occurs 
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equally often at its respective serial 
position. 


METHOD 


Procedure. —A 16-unit modified serial list 
was presented for 40 trials at a 2-sec. rate 
with an 8-sec. intertrial interval. The modi- 
fication consisted of presenting an X in 
place of the item on the remaining 20 trials. 
The final 2 sec. of the 8-sec. intertrial in- 
terval consisted of presentation of a cue X. 

Five experimental conditions (EC) were 
employed in which 1, 2, 4, 8, or 16 items were 
presented successively on any particular trial. 
Each condition consisted of alternate A and 
B trials with one-half of the items occur- 
ring on each trial; e.g, in Cond. 1, Items 
1, 3, 5, . . . appeared on the A trials and 2, 
4, 6,... appeared on the B trials. For one- 
half of the Ss in each EC, Trial A was 
presented first; for the remaining one-half of 
the Ss, Trial B was presented first. In 
terms of the conditional probability of oc- 
currence of successive items, Cond. 1, 2, 4, 8, 
and 16, respectively, involved conditional 
probabilities of .00, .50, .75, .875, and 1.00.2 

The anticipation method was used and Ss 
were carefully instructed so that they under- 
stood they were to learn one list even 
though the complete list may not occur on 
any particular trial. The Ss also were in- 
structed to try to anticipate the next item of 
the list whenever an X was presented and 
also to try to anticipate the next list item 
even though they expected an X to occur. 
Following the instructions Ss were asked to 
describe in their own words what they were 
to do. 

„Materials and apparatus—The following 
list of 16 high-frequency words was pre- 
sented via Carousel projector: wish, street, 
Justice, ocean, woman, lamp, stove, eagle, 
smooth, health, wagon, king, shoes, dark, 
fruit, earth. 

Subjects.—The Ss were undergraduate stu- 
dents at the University of Pittsburgh. The 
N of each of the five EC was 12, with one- 
half of the Ss in each condition serving in 
one of the two trial alternation conditions. 
The Ss were assigned to the 10 conditions 
via a table of random numbers. 
pele Cee 


*The conditional probabilities presented 
assume that the first item on any trial fol- 
lows the sixteenth item of that trial. This 
unrealistic assumption is for simplification 
and has only a small effect on the conditional 
Probability values. 


Resutrts * 


Correct response frequency—The 
significantly differing mean correct re- 
sponse frequencies for Cond. 1, 2, 4, 8, 
and 16 were 188.3, 216.6, 281.7, 308.3, 
and 412.0, respectively, F (4, 50) = 
5.01, p < .01. Application of Duncan's 
range test indicated that Cond. 1 and 16 
differ significantly, p < .01. In addi- 
tion, Cond. 16 performance was signifi- 
cantly superior to Cond, 4 performance, 
p < .05, and Cond. 4, 8, and 2 perform- 
ance was significantly superior to 
Cond. 1, p < .05. The analysis of vari- 
ance also indicated a significant serial 
position (SP) source of variation, F 
(15,750) = 21.15, p < .01, and SP in- 
teracted significantly with EC, F (60, 
750) = 2.58, p < .01. 

Figure 1 presents the SP curves for 
the five ECs. Inspection of Fig. 1 
indicates (a) the SP curves for Cond. 
1 and 16 are strikingly similar; (b) 
Cond. 8 yielded a low point at SP9, 
thus indicating the existence of two 
subcurves, SP1-8 and SP9-16; (c) the 
low points at SP9 and SP13 indicate 
that Cond. 4 consists of two four-unit 
subcurves for SP9-12 and 13-16, and a 
somewhat irregular curve from SP1-8; 
and (d) Cond. 2 yielded a relatively 
smooth curve from SP1 and SP6, but 

3In no case did the alternation condition 
yield a significant main effect nor did it 


interact significantly with any other variable, 
It is, therefore, not discussed in this section. 
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Fic. 1. Mean correct responses as a func- 
tion of serial position for the five experi- 
mental conditions. 
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TABLE 1 


CONDITIONAL PROBABILITIES OF CORRECT AND INCoRRECT RESPONSES AT 
SERIAL PosITIONs 7 AND i + 1 


Condi- 
tional Serial Position #+1 
Experi- | Probability 
mental 
Condition 
s |étt} 2] 3 ]4)51]6)] 71] 8 | 9 | 10] 41 | 12 | 13 | 14} 15 | 16 
1 C | C |.57} .70 | .60 | .72 | .57 | .62 | .58 | .60 | .47 | .63 | .65 | .52 | .52 | .53 | .65 
; I C | 15) .23 | .08 | .18 | .12 | .14 | .10 | .17 | .11 | 13 | .14 | .15 | .19 | .20 | .24 
2 C | C [.83[.84 |.81 |.77 | .83 | .58 | .89 | .61 | .89 | .61 | .92 | .59 | .93 | .55 | .92 
I C | .21).25 | .22 | .24 | .23 | .08 | .20 | .10 | .20 | .12 | .26 | .13 | .28 | .16 | .34 
4 C | C |.80|.89 |.88 | .61 | .74 | 86 | .84 | .59 | .88 | .75 | .89 | .52 | .91 | .80 | .95 
1 C }.23) .13 |.12 |.20 | .17 | .16 | .10 | .05 | .24 | .08 | .18 | .06 | .27 | .09 | .16 
8 C | C }.78] .79 |.86 | .94 | .83 | .90 | .82 | .58 | .93 | .76 | .73 | .80 | .84 | .89 | .87 
I C } .07 | .09 | 16 | .17 | .08 | .20 | .13 | .05 | .29 | .11 | .19 | .21 | .15 | .15 | 12 
16 C | C }.85 | .91 | .87 |.96 | .88 | .94 | .79 | .92 | .77 | .86 | .90 | .89 | .87 | .91 | .93 
I C | .17) .22| 15 | .21 |.13 | .19 | .06 | .21 | .11 | .18 | .20 | .17 | .16 | .16 | .19 


Note.—C =correct; I =incorrect. 


performance from SP7 to SP16 was 
poorer for the first item of each pair 
of items. 

Conditional probability data—Table 
1 presents the conditional probability 
data for each pair of SP at each EC. 
The two rows in each section of the 
table present, respectively, the probabil- 

„ity of a correct response (C) at SPi 
+1, given a correct response (upper 
row) or an incorrect response (lower 
row), at SPi, 

One finding of note is that in Cond. 
2, 4, and 8, the point at which a break 
in the chain occurred generally is char- 
acterized by a relatively low C-C con- 
ditional probability. In Cond. 2, this 
result occurred for each pair of items 
after SP4; for Cond. 4 and 8, the result 
held for each chain. 


A second finding is that the major 
differences in Cond. 1 and 16 is in the 
C-C data. Although more correct re- 
sponses occurred in Cond. 16, the data 
show higher C-C conditional probabili- 
ties in Cond. 16 but approximately 
equivalent I-C probabilities in the two 
conditions. This finding and the con- 
ditional probability results of Cond. 2, 
4, and 8 indicate that presentation of an 
item at SPi yields a greater likelihood 
of a correct response at SPi + 1. 

Intrusions—The first column of 
Table 2 presents a summary of the in- 
tralist intrusion frequency data for each 
EC. In addition, the second and third 
columns present the percentage of 
mean intralist intrusions to list items of 
to X, respectively; and the fourth and 
fifth columns present the percentage of 


TABLE 2 


Mean Intratist INTRUSION FREQUENCY, PERCENTAGE oF INTRUSIONS TO ITEMS AND X, 
AND PERCENTAGE OF ANTICIPATORY AND PERSEVERATIVE ERRORS 
For EACH EXPERIMENTAL CONDITION 


aE 
Experimental | Mean 1 bed X , sls 
Condition eaa l eaae ts Cees | pah [ee Soros eerti Ean 

1 92.1 59.1 40.9 69.7 30.3 

2 65.0 41.3 58.7 62.1 37.9 

4 103.0 38.0 62.0 51.0 49.0 

8 100.7 33.7 66.3 58.5 41.5 

16 67.1 26.1 73.9 70.2 29.8 
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anticipatory and percentage of persev- 
erative errors, respectively. The two 
breakdowns are independent. An 
analysis of variance performed on the 
intrusion frequency data yielded no sig- 
nificant difference as a function of EC, 
F (4, 50) = 1.05. 

With respect to whether intralist in- 
trusions occurred to an item or to an 
X, an analysis of variance was con- 
ducted on the are sin transformed 
percentage of intrusion to X data with 
the finding that the percentage of in- 
trusions to X increased significantly 
over Cond. 1-16, F (4, 50) = 9.40, p 
< .01, with a significant linear trend, 
F (1, 50) = 34.1, p < .01. 

The arc sin transformed percentage of 
anticipatory errors data were submitted 
to an analysis of variance. Although 
the analysis revealed a nonsignificant 
EC source of variation, F (4, 50) = 
2.40, a significant quadratic component 
was found, F (1, 50) = 4.43, p < .05. 
This finding indicates that anticipatory 
errors tend to vary in a bow-shaped 
manner with the lowest percentage at 
Cond. 4. 

In order to analyze the intralist in- 
trusions in more detail, the data were 
tabulated according to the list locus of 
intrusion item, relative to the position 
where the intrusion occurred (Peter- 
son, Brewer, & Bertucco, 1963). Spe- 
cifically, for each item, i, the intrusions 
from the i+1 item, the i+ 2 item, 

., i +15 item and intrusions from 
thei—1,i—2,... ,i—15 item were 
tabulated. These tabulations, i.e., t + 
1, i+ 2, etc, were then summed for all 
items. For example, for the first item, 
the intrusions from SP2 (i+ 1), SP3 
(i+2),..., SP16, (i+ 15) were 
tabulated. For the second item, the 
intrusions tabulated were the SP1 (i — 
1), SP3 (+1), SP4 (+2), SP16 
(i+ 14th). The same tabulations 
were performed for the third, fourth, 

. , sixteenth items. The intrusion 
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frequencies for each i—15, i—14, 
...+,4+15 point then were summed 
over all of the items and divided by the 
number of items used in the summing 
procedure. There were 15 scores in i 
—1andi+1, 14 scores in i — 2 and 
t+ 2, etc. 

Figure 2 presents the mean intrusion 
frequency for SP relative to the ith SP. 
The following points appear note- 
worthy: (a) Cond. 16 intrusions, pre- 
dominantly anticipatory, decrease as a 
function of distance from the ith item. 
Moreover, the Cond. 1 intrusion dis- 
tribution is quite similar to Cond. 16. 
(b) In Cond. 4, the anticipatory intru- 
sion frequency tends to be greatest at 
the i+ 4, i +8, and i+ 12 SP, thus 
strongly supporting the notion that Ss 
in Cond. 4 learned the list in chains of 
four, and intrusions resulted from con- 


a 
-i + 
a 
5 4 
4 6 
“at 0 +6 +i 3 At 


SP RELATIVE TO ith sp 


Fic. 2. Mean intrusion frequency as a 
function of relative distance from the ith 
SP for each experimental condition, 


MEAN INTRUSION FREQUENCY 


460 


fusing items from the same relative 
position of other four-unit chains. (c) 
The Cond. 2 intrusion pattern from i + 
1 to i + 6 suggests a tendency for Ss to 
confuse items as a function of the rela- 
tive position of the next three pairs of 
items. (d) Condition 8 is characterized 
by intrusions coming from similar posi- 
tions of the second 8-unit chain, espe- 
cially i +8. The relatively high intru- 
sion frequency at the i — 5, i — 9, and 
i — 13 SP of Cond. 4 and the relatively 
high intrusion frequency at the i— 9 
SP of Cond. 8 suggests the occurrence 
of confusion between an item and the 
item which precedes its same relative 
position in another chain. 


Discussion 


The present results provide evidence 
against the absolute serial position hy- 
pothesis of serial learning (e.g., Young, 
Hakes, & Hicks, 1967). Specifically, the 
occurrence of performance differences as 
a function of EC when each item was 
presented equally often at its respective 
SP suggests that if Ss used the absolute 
SP as one aspect of a functional stimulus, 
its effect was negligible or its influence 
on learning was obscured by more power- 
ful effects. 

With respect to the chaining hypothe- 
sis, the significant effect of EC and the 
conditional probability data may be in- 
terpreted as supporting the chaining hy- 
pothesis. In particular, the data suggest 
the superior performance in Cond. 16 to 
Cond. 1 may be attributed to the presenta- 
tion of adjacent items. Although the data 
are reasonably convincing in their support 
of chaining, an alternative interpretation 
is presented which is in greater agree- 
ment with the present data as well as 
other particular results, 

In order to develop the interpretation, 
it first is necessary to make a distinction 
between the cue function and the associa- 
tive function of an item. The former is 
the property of an item, i, which serves 
as a cue for S to respond, i.e., to try to 
anticipate the next item, i+ 1; the latter 
is the property of an item, i, to develop an 
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association with its succeeding item, i+ 1, 
The importance of this distinction lies in 
the assumption that an item, i, may serve 
as a cue for a response without an as- 
sociation necessarily developing between 
the i and i+ 1 items. Furthermore, this 
distinction implies that when i is pre- 
sented, S may try to state item i+1 by 
using any process at his disposal and not 
necessarily by forming an association be- 
tween items i and i+ 1. 

Stage analyses of serial learning 
(Slamecka, 1964; Voss, 1966) has sug- 
gested that Ss first try to learn the list 
items and their respective SP, then try to 
place them at their proper locus. Assum- 
ing that the acquisition of the items per 
se has occurred, it is argued that the pres- 
ent results as well as other findings sug- 
gest that item placement consists primarily 
of ordering a list of items along a tem- 
poral dimension and not of learning a 
sequence of associations, and, further- 
more, that the functional stimulus in item 
placement primarily consists of: (a) S’s 
knowledge of the relative list locus of an 
item, i+ 1 and (b) the cue property of 
item i. Moreover, if upon presentation 
of i, S anticipates i+ 1, presentation of 
i+ 1 serves to confirm his response. In 
other words, § learns i+1 and, at the 
same time, its general list locus. How- 
ever, S needs something more to indicate 
where specifically to place i+ 1. Item 
i provides this information in its cue 
capacity. 

Evidence for such a position is found in 
the intrusion data. Specifically, in Cond. 
1 and 16, intrusions decreased as distance 
from an item’s correct position increased; 
in Cond. 2 and especially Cond. 4 and 8, 
the pattern of intrusions can only be in- 
terpreted in terms of a confusion between 
an item’s correct locus and the same locus 
in other subchains, Furthermore, a re- 
mote association explanation of the in- 
trusion data is not adequate because if the 
intrusions in fact were based upon remote 
associations, then the Fig. 2 curves of all 
five conditions should have been highly 
similar, or, based on contiguity of pres- 
entation, intrusions should be from within 
the same chain. But clearly, such was 
not the case. The intrusion data, there- 
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fore, strongly indicate that Ss were try- 
ing to place items along a dimension and 
errors resulted from confusion of relative 
dimensional positions. 

The conditional probability data may be 
interpreted as showing that list items are 
superior to Xs as cues. Thus, the rela- 
tively high correct-correct conditional 
probabilities found with adjacent items 
are taken to indicate the superiority of 
cue function of an item as opposed to X. 
Moreover, in Cond. 1 the item-X-item pro- 
cedure either did not provide an adequate 
cue for a response or, given a response, X 
did not provide adequate confirmation. A 
similar argument may be used as an in- 
terpretation for the correct-correct condi- 
tional probability data of Cond. 4 and 8 
at the first unit of subchains; i.e., on any 
given trial the first unit of a subchain was 
preceded by an X and the X was a poor 
cue, 

In addition to the present findings, the 
results of four other types of experiments 
may be interpreted as supporting the 
Present position. First, Slamecka (1967) 
found that ordering the output of items 
along a temporal dimension or along a 
Spatial dimension yielded equivalent re- 
sults, Moreover, Slamecka argued, since 
the former consisted of contiguous tem- 
poral presentation of the items and the 
latter did not, the crucial aspect of serial 
learning is placing the items along a 
dimension and not forming sequential 
associations. 

The second line of research consistent 
with the present findings is that of ex- 
Periments designed to manipulate tem- 
Poral or spatial perceptual characteristics 
of the task; eg., starting point varia- 
tion retards acquisition (e.g., Ebenholtz, 
1963), elimination of intertrial interval 
retards acquisition (e.g, Breckenridge, 
Hakes, & Young, 1965), and different 
Spatial displays of stimuli influence ac- 
quisition (eg., Asch, Hay, & Diamond, 
1960). 

A third type of study supporting evi- 
dence for the importance of placing items 
along a perceptual dimension is found in 
the work of Ebenholtz (1963, 1965, 1966a, 
1966b). Using numerals, colors along 
a white-gray-black dimension, lines vary- 


ing in length, or the items of a previously 
learned serial list as stimuli in a PA task, 
it was found that ease of learning in- 
creased as the dimension extremes were 
approached. 

The fourth body of literature consists 
of experiments designed to determine 
whether associations are formed between 
adjacent items in serial learning. A large 
number of Serial— PA transfer studies 
(eg., Young, 1959) have shown little or 
no transfer effect, although, as Postman 
and Stark (1967) note, a small degree of 
positive transfer in early trials has been 
found consistently. Recently, two studies 
(Postman & Stark, 1967; Young, Hakes, 
& Hicks, 1967) have been reported in 
which Ss were informed of the relation 
between the serial and PA lists with the 
result that transfer effects were found in 
the respective uninstructed groups and 
such effects increased in the respective 
instructed groups. 

Two comments may be made regarding 
the Serial— PA transfer studies and, in 
particular, the role of instructions in such 
studies. First, if the Serial > PA design 
yields positive transfer, the conclusions 
to be reached are ambiguous, Specifically, 
there is a difference between trying to 
determine how serial learning takes place 
and determining how the effects of serial 
learning are used. Positive transfer 
could mean that Ss were able to use in 
the PA task what they learned in the 
serial task by some process other than 
chaining. Moreover, the previously men- 
tioned findings with respect to instruc- 
tions could readily be interpreted as show- 
ing that Ss did not in fact learn the serial 
task by chaining because if they did, there 
really should be no difference between the 
instructed and uninstructed experimental 
groups. : 

The second comment refers to an issue 
raised by Postman and Stark, “the ques- 
tion arises of whether a score much closer 
to 100% should not be expected on the 
assumption of association chaining [p. 
351].” The results and interpretation of 
the present experiment provide an alterna- 
tive reason to the three offered by Post- 
man and Stark for the lack of 100% 
transfer in the Serial PA design. Spe- 
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cifically, when an item, i, is considered as 
a cue and one part of a compound stim- 
ulus, and not as analogous to the single 
stimulus of a paired associate, the degree 
of transfer likely would not be great be- 
cause the other important component of 
the compound stimulus, the list locus of 
the next item, is not present in the PA 
task. Such a view likely may be studied 
experimentally by using serial learning 
procedures analogous to the PA stimulus 
selection studies (Brown & Rubin, 1967). 

Finally, two comments are offered in 
relation to other work. First, normative 
associative chains (e.g., Diethorn & Voss, 
1967; Saufley, 1967) may facilitate ac- 
quisition within each item of a normative 
chain which has a greater cue value and 
likely is virtually a sufficient cue for oc- 
currence of the i+ 1 response. List locus 
of i+ 1 thus should be relatively less im- 
portant as a component and results do 
Suggest such findings (Diethorn & Voss, 
1967). Second, it is possible, especially 
in a difficult section of the list, that pre- 
ceding items of a list, ie. i, i—1, i—2, 
etc., may constitute the cue rather than 
only the ith item (Horowitz & Izawa, 
1963). 
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OPERANT CONDITIONING OF GSR AMPLITUDE* 
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An experimental group of 40 Ss was given a 5¢-associated light 
as positive reinforcement contingent on the amplitude of a GSR fol- 
lowing a lever press. A corresponding group of 40 control Ss had 
their reinforcement schedules yoked to that of their experimental 
partners. The Ss were told that reinforcement was dependent on the 
accuracy of their estimate of the conclusion of a 1-min. interval, 


as indicated by their lever press. 


Mean GSR amplitude following 


the lever press was significantly greater for the experimental (con- 
tingent-reinforcement) Ss than for the noncontingent controls, indi- 
cating successful operant conditioning of a nonspontaneous autonomic 


response, 


Contrary to theoretical predictions 
(Keller & Schoenfeld, 1950; Kimble, 
1961; Konorski & Miller, 1937), suc- 
cessful attempts to modify autonomic 
behavior with operant-conditioning 
techniques have been reported in a 
number of recent investigations (¢g., 
Fowler & Kimmel, 1962; Grings & 
Carlin, 1966; Kimmel & Baxter, 1964; 
Kimmel & Hill, 1960; Kimmel & Kim- 
mel, 1963; Shearn, 1962). However, 
reports of negative findings are also 
prevalent (e.g., Mandler, Previn, & 
Kuhlman, 1962; Mednick, 1964; 
Mowrer, 1938; Stern, 1967). 

The GSR has been a frequent choice 
for investigation. In many of the 
studies, the procedure has been to rein- 
force spontaneously occurring Te- 
sponses, with rate of response (per unit 
time) as the measure of conditioning. 
The results obtained with this pro- 
cedure, even when reported as demon- 
strating ‘successful modification of the 
autonomic response, vary considerably 
from those generally observed when re- 

1 This research was conducted at Indiana 
University, Department of Psychology, and 
supported by Contract Nonr (15), Office of 
Naval Research, Roger W. Russell, project 
director. The authors are indebted to R. W. 
Russell for helpful advice and to other 
Project members for aid in data reduction 
and analysis. 


inforcement is made contingent upon 
skeletal muscular responses. Thus, 
Kimmel and Hill (1960) only found a 
significant increase in response rate in 
the interval immediately following ces- 
sation of reinforcement (early extinc- 
tion) rather than the period during 
which reinforcements were delivered. 
Two studies have been reported in 
which the responses to be reinforced 
were nonspontaneous. Kimmel and 
Baxter (1964), employing a classical 
avoidance technique in which occur- 
rence of a response during the CS in- 
terval resulted in the nonoccurrence of 
the shock UCS, reported that there was 
a significant increase in average GSR 
amplitude for both experimental and 
yoked-control groups. Further, the 
average amplitude of response for the 
experimental group was found to be 
significantly greater than that of the 
controls. Grings and Carlin (1966), 
using a similar technique, found the ex- 
pected increase in frequency of re- 
sponse to the CS when production of 
the response resulted in avoidance of 
shock, and also found that this fre- 
quency decreased when avoidance was 
made contingent on nonresponse. 
Paradoxically, however, these investi- 
gators observed a steady decline in the 
amplitude of the GSR over the trials 
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in which avoidance of shock was con- 
tingent upon the production of a re- 
sponse, although there was an even 
greater reduction in amplitude for trials 
in which nonresponse resulted in avoid- 
ance, 

The type of stimuli employed in some 
of the previously mentioned studies as 
“positive” and “negative” reinforcers 
appears to contribute to the difficulties 
of interpretation and replication. Kim- 
mel and Hill (1960) chose “pleasant” 
and “unpleasant” odors as opposite re- 
inforcing stimuli, but reported that no 
differentiation could be made between 
them in their effects on the dependent 
autonomic variables, Kimmel and his 
associates have also employed dim 
lights as positive reinforcers for auto- 
nomic activity. However, the appeti- 
tive nature of these stimuli is not ob- 
viously apparent, and other investiga- 
tors using them have failed to obtain 
conditioning (e.g., Stern, 1967). 

The purpose of the present study was 
to examine the effect of positive rein- 
forcement on the amplitude of the GSR 
in a situation where the reinforcer was 
clearly rewarding and where GSRs 
were reliably produced, but without re- 
course to a noxious UCS. In a pre- 
vious study (Doehring, Helmer, & 
Fuller, 1964) it was found that there 
was a high probability that a GSR 
would occur in the interval immedi- 
ately following a lever press made by 
Ss to indicate their estimate of the con- 
clusion of a time interval. In the cur- 
rent study, positive (monetary) rein- 
forcement was made contingent upon 
the amplitude of this GSR. The tech- 
nique also gave Ss an interesting task 
and provided E with discrete and con- 
veniently spaced trials. 


METHOD 


Subjects—The Ss were 83 volunteer un- 
dergraduates taking courses in the Depart- 
ment of Psychology at Indiana University. 
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Apparatus—The GSR was recorded on an 
Esterline-Angus recti-graph driven by a 
Fels Dermohmmeter (Model 22A, Yellow 
Springs Instrument Co.). A current of 70 
pa. dc was imposed across zinc-manganese- 
dioxide electrodes (Davis, Siddons, & Stout, 
1954) attached to S’s left palm and upper 
arm. Contact with the palmar surface was 
made through a 5% saline-soaked cotton ball 
of approximately 1 sq. in., while the refer- 
ence electrode on the upper arm was placed 
over a saline-soaked rubber sponge of ap- 
proximately 12 sq. in. 

The Ss were seated in a large, comfortable 
chair in an electrically shielded, sound-in- 
sulated, and air-conditioned room adjacent 
and connected by an intercom to the con- 
trol room in which Æ and the recording 
equipment were located. A masking white 
noise at 60 db. SPL was delivered to S 
through earphones throughout the session. 
The E was able to observe S by means of 
an inspection port located in the wall of the 
testing chamber. 

A manually operated switching circuit al- 
lowed E to indicate trial onset, reinforce- 
ment, and nonreinforcement by turning on, 
respectively, a small yellow, green, and red 
light. The lights were mounted on the wall 
of S’s room about 4 ft. in front of S, and 
the onset of any light activated an event-re- 
corder pen in the margin of the GSR record. 
A telegraph key mounted on the right arm 
of S’s chair seryed as the lever, and each 
press was both indicated by a light in E’s 
room and recorded by the event-marker pen. 

Procedure—All Ss were given misleading 
instructions which stated that E’s purpose 
was to determine how well S could estimate 
the passage of time without mechanical aids. 
The Ss were asked to remove all timepieces, 
and were told that the electrodes would be 
used to measure the subjective moment of 
decision to respond to the estimated end of 
each time interval. The S's were then told 
that their task was to estimate the passing 
of a number of 1-min. intervals, the esti- 
mate for each interval (trial) beginning at 
the onset of the yellow light and terminating 
when S pressed the lever. Correct and in- 
correct estimates would be followed within a 
few seconds by green and red lights, respec- 
tively, and each green-light onset would be 
worth 5¢ to S. The Ss were asked to keep 
their left hands and arms as relaxed as pos- 
sible, but were also told to keep awake in 
order to watch the lights. Either excessive 
Movement or going to sleep was to result 
in the session being terminated and S being 
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paid only $1.00. After E left S’s room, a 
period of 5 min. was allowed for adaptation 
and stabilization of the GSR before the first 
yellow light was given. The first 3 of the 
30 trials were reinforced (green light) for 
all Ss, and the interval between one trial 
and the beginning of the next was varied 
unsystematically over 10-45 sec. The session 
was ended before the thirtieth trial for three 
Ss, who were eliminated for one of the 
following reasons: (a) excessive movement, 
(b) sleep, and (c) noncooperation, as 
evidenced by consistently repeated time esti- 
mates of less than 20 sec. 

For the 27 trials after the first 3, half 
the remaining Ss (experimentals) were re- 
inforced (green light) if their GSR ampli- 
tude following a lever press reached a cri- 
terion, while smaller GSRs were not rein- 
forced (red light). The criterion for each 
experimental (contingent-reinforcement) S 
was half his mean response size (in ohms) 
on the first 3 trials, where the response was 
a GSR initiated within 6 sec. following a 
lever press. The decision concerning rein- 
forcement was made by E from 2 to 10 sec. 
following the lever press, and time-estima- 
tion accuracy, therefore, did not determine 
reinforcement, 

Reinforcement for the remaining, con- 
trol Ss (noncontingent reinforcement) was 
determined by yoking each § to the rein- 
forcement schedule of an experimental S. 
The pairings were determined by order of 
arrival, the first and second arrival of each 
pair being assigned, respectively, to the con- 
tingent and noncontingent conditions. The 
yoking equated not only the number of rein- 
forced trials, but also their position in the 
series, 


RESULTS 


The GSR following any pen-re- 
corded event (lever press or light) was 
taken as the difference between the be- 
ginning and end of the first continuous 
decrease in resistance, provided this 
began within 6 sec, after event onset. 
The criterion for a nonzero GSR 
varied between 50 and 500 ohms ac- 
cording to the sensitivity settings on the 
dermohmmeter, which were in turn de- 
pendent on S’s basal resistance level. 
All nonzero GSRs were transformed 
into conductance change (AC) in 
micromhos. To control for changes 
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in electrodermal reactivity over trials 
(habituation and/or sensitization), as 
well as for between-Ss reactivity dif- 
ferences, the conductance change fol- 
lowing each yellow light (AC) was 
subtracted from that following the lever 
press (ACip). 

Figure 1 shows the mean conductance- 
change differences (AC) —ACy) of 
the experimental (contingent-reinforce- 
ment) and control (noncontingent-re- 
inforcement) groups on trials following 
reinforced (r) trials and those follow- 
ing nonreinforced (nr) trials. Be- 
cause the score distributions were 
markedly skewed, nonparametric tests 
were used to evaluate statistical effects. 
As Fig. 1 indicates, performance on nr 
trials in both groups appeared to be 
more variable than that on r trials, so 
the effect of reinforcement contingency 
was first tested by comparing the per- 
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formance of the two groups on r 
trials only. The Mann-Whitney U 
test (Siegel, 1956) indicated signifi- 
cantly superior performance of the ex- 
perimentals relative to the controls, 
z = 2.01, p < .05. The more conven- 
tional between-groups comparison in- 
volving performance on all trials 
yielded a reduced, though still sig- 
nificant, superiority for the experi- 
mentals over the controls, z= 1.77, 
p <.05. The effect of reinforcement 
itself was determined by comparing the 
response on r trials with the response 
on nr trials separately within each 
group. Wilcoxon matched-pairs 
signed-ranks tests indicated that per- 
formance on r trials was significantly 
superior to that on nr trials for the 
experimental group, z = 2.13, p < .02, 
but for the controls the (noncontin- 
gent) reinforcement effect did not ap- 
proach significance, z = .685, p > .20. 

The efficacy of the time-estimation 
task in the production of responses for 
reinforcement was determined by com- 
puting the number of nonzero responses 
to the 30 lever presses in each group. 
The median number of nonzero re- 
sponses was 28 (93%) for the experi- 
mental group and 27 (90%) for the 
control group. 

Since all Ss received identical treat- 
ment on the first 3 trials (automatic re- 
inforcement), these trials could serve 
as a test for the equivalence of the two 
groups with regard to initial autonomic 
reactivity. The Mann-Whitney test 
between the groups over these trials did 
not approach significance, z = .06, b 
> .20. 

In order to test for the possibility of 
habituation and/or sensitization, the 
mean response amplitude to the yellow 
light was measured over the 30-trial 
series for both groups. The means are 
shown in Table 1 in five-trial blocks. 
Since the distribution of these scores 
was not markedly abnormal, a para- 
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TABLE 1 


Mean GSR AMPLITUDES IN CONDUCTANCE- 
CHANGE Units (AC) TO YELLOW LIGHTS 
OVER TRIALS OF EXPERIMENTAL AND 
CONTROL GROUPS 


Blocks of 5 Trials 


Group 
I u | Il} Iv v VI 
Experimental | 1.86 | 1.24 | 1.08 | 0.90 | 1.05 | 0.99 
Control 1,82 | 1.34 | 1,13 | 1,29 | 1.44 | 1.16 


metric Groups X Blocks analysis of 
variance (split plot) was used, and the 
only significant effect was that of trials, 
F (5, 390) = 4.75, p < .001, with a 
monotonic decrease over blocks to the 
fourth block. 


Discussion 


The significantly greater mean ampli- 
tude of responding in the experimental 
(contingent-reinforcement) group rela- 
tive to the control group provides evidence 
for appetitive operant conditioning of the 
GSR. This evidence is strengthened by 
the finding that the initial response levels 
of the two groups over the first three 
trials, when the groups were treated 
identically, did not differ significantly. 
The fact that mean amplitude on trials 
following reinforcement differed signif- 
icantly from those following nonrein- 
forcement for the experimental (contin- 
gent) group, but not for the control group, 
may be taken as additional evidence for 
operant conditioning. 

The highly reliable change in response 
amplitude over the yellow-light trials 
underlines the importance of using the 
(AC, —AC,,) subtraction method to 
control for changes in electrodermal reac- 
tivity over trials. Since the obtained 
change was a decline in responding 
(habituation), it is clear that the tend- 
ency of the performance curves in Fig. 1 
to rise over trials should not be inter- 
preted as an absolute increase in electro- 
dermal responding to the lever press. 
Nor should the reduction in response 
magnitude reported by Grings and Carlin 
(1966) be regarded as “paradoxical” oF 
incompatible with successful operant con- 
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ditioning of relative amplitude increase, 
provided that a habituation factor is 
recognized to operate during the training 
session.” 

There was no formal assessment of the 
degree to which the experimental Ss were 
aware of the true reinforcement con- 
tingencies, but the spontaneous remarks 
of all Ss suggested that they were misled 
by the time-estimation instructions. The 
questions at the end of the session in- 
dicated concern for the precise criterion 
for time-estimation accuracy employed by 
E rather than any doubts as to the 
relevance of the task. Especially if more 
systematic investigations indicate that this 
method does successfully mask the true 
contingencies, it is of obvious value for 
attacking the problem of awareness. On 
the other hand, the present results hardly 
constitute evidence for a robust condi- 
tioning phenomenon considering that 80 
Ss were needed to obtain differences at 
only marginal levels of significance. A 
within-Ss design may provide a more 
powerful way to demonstrate operant 
modification of electrodermal behavior. 
Amplitude increase and decrease, respec- 
tively, could be reinforced following two 
different lever presses by which S would 
indicate the estimated passing of two dif- 
ferent periods of time. Such a differen- 
tial response-conditioning technique might 
demonstrate with fewer Ss the conclusion 
based on the performance of the 80 Ss 
of the present experiment: that the non- 
spontaneous GSR can be operantly condi- 
tioned with an appetitive reinforcer. 


2The subtraction of the yellow-light re- 
sponse was also designed to control for be- 
tween-Ss reactivity differences, and, it will 
be noted that in Table 1, the Group ACn 
mean for the controls slightly, though non- 
significantly, exceeds that for the experi- 
mentals. That the subtraction procedure 
was important even for controlling this non- 
significant difference is indicated by the fact 
that the superiority of the experimentals 
over the controls with respect to unadjusted 
GSRs following lever presses reached only 
near significance, p < .1. 
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EFFECT OF LIST ORGANIZATION ON SHORT-TERM 
PROBE RECALL? 


ROBERT C. CALFEE an» RICHARD E. PETERSON 


University of Wisconsin, Madison 


Lists of 8 words, 4 words in each of 2 categories, were presented 
at a l-sec, rate in a blocked (High) or random (Low) arrangement, 
with category names available prior to presentation or during a probe 
serial test. Control lists consisted of 8 unrelated words. Recall 
increased with list organization and category availability. Investi- 
gation of the error patterns (intra- and extracategory errors, omis- 
sions, and extralist intrusions) and serial position curves led to the 
conclusion that organization facilitated storage by providing better 
retrieval cues, by making it easier to identify the conceptual structure 
of the list, and through the storage of the structured lists as 2 


independent sublists. 


The importance of organizational fac- 
tors in the free recall of meaningful 
words has been well documented over 
the past few years. Tulving (1966) 
has argued that successive presentations 
of a list of words produce better recall 
only if S is able to augment the struc- 
ture of the list, that repetition per se 
has no effect. While this statement 
may be too strong, there is little ques- 
tion that recall of more than a few 
words usually involves a partitioning of 
the list into subsets of related items and 
that during recall S works with a small 
set of key terms which serve as re- 
trieval cues. These keys may be either 
category names or sets of associatively 
linked items (Cofer, 1965, 1966). 

There is also evidence that an organ- 
ized presentation may facilitate storage 
of information, Cofer, Bruce, and 
Reicher (1966), using a list of 40 
words consisting of 10 items in each of 
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4 categories, found that blocked presen- 
tation (all items in a category presented 
contiguously) produced higher recall 
and clustering scores for high-frequency 
words than did random presentation. 
Surprisingly, blocking did not facilitate 
recall of low-frequency words. Post- 
experimental inquiry indicated that 
most Ss were able to identify all 4 
categories following either blocked or 
random presentation, so that the su- 
perior recall could not be attributed 
solely to retrieval cues. Cohen (1966) 
presented Ss a list of 70 words, 20 
categories of 3 or 4 words each, and 
found higher recall with blocked presen- 
tations. Dallett (1964) also found that 
blocked presentation of a 12-item list 
led to better recall than random presen- 
tation under standard recall conditions, 
but there was no difference if S was 
given the category names prior to 
presentation and during test. 

While study of the effects of cate- 
gorical organization has been largely 
restricted to long-term memory, the 
ability to chunk or organize unrelated 
elements is obviously important in 
short-term memory as well (e.g., Mur- 
dock, 1961). The present study inves- 
tigated the effects of categorical organi- 
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zation on short-term memory for lists 
of meaningful words using a serial 
probe test. Lists of eight words were 
presented in either blocked or random 
fashion. Independent evaluation of in- 
put and output facilitation was achieved 
by varying factorially whether or not 
S was given category names prior to 
presentation of a list or during the test. 
If organizational factors operate pri- 
marily through improved retrieval, then 
availability of category names at recall 
should lead to equivalent performance 
under blocked and random presenta- 
tion, On the other hand, if an organ- 
ized list is stored more efficiently be- 
cause of the availability of category 
names during storage, then giving Ss 
the category name prior to presentation 
should lead to higher recall scores. 


METHOD 


Stimulus materials—Sixty-four lists of 
four words related by some form of cate- 
gorical communality were prepared, 29 se- 
lected from Underwood and Richardson 
(1956), 28 from Cohen, Bousfield, and Whit- 
march (1957), and 15 new sets. The cate- 
gories covered a variety of nominal (e.g. 
insect, tool) and adjectival (hard, round) 
classifications, All words occurred at least 
once per million words according to the 
Thorndike-Lorge (1944) count. 

Pairs of four-word categories were ran- 
domly selected to form 32 eight-word experi- 
mental lists. In addition, eight control lists 
of eight unrelated words each were con- 
structed by selecting words to provide as 
close a match as possible to the experimental 
lists on Thorndike-Lorge frequencies. 

_ Design and subjects—The experiment con- 
sisted essentially of a 2X2X2 factorial 
design plus a control condition. The varia- 
bles were (a) High and Low organization- 
Presentation procedure (blocked or random 
in Cofer’s terminology), (b) availability of 
category name prior to presentation, and 
(c) availability of category name during 
test. Specific cells in the 2 X 2 arrangement 
Of the availability variables will be denoted 
by an ordered pair of the words Yes and No 
So that No-Yes would indicate a condition 
in which names were available during test 
but not prior to presentation. Each of the 
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eight cells in the design was replicated four 
times within a session for each S, for a 
total of 32 experimental lists. In addition, 
there were 8 control lists, for a total of 40 
lists per session. 

Four different presentation orders were 
prepared using Latin squares to counter- 
balance any temporal effects over the 40 
lists. The order in which the eight words 
in a list were tested for recall was also 
balanced with regard to the order in which 
the words had been presented by a series of 
Latin squares. 

High lists were presented in a 4-4 order; 
ie, all the words in one category were 
presented, then all words in the second cate- 
gory. Low lists were presented in one of 
these arrangements: 2-3-2-1, 1-2-3-2, 2-1-2-3, 
3-2-1-2. (The last arrangement means 3 
words in Category A, 2 in B, 1 in A, and 
the last 2 in B.) These orders eliminate 
single or double alternation patterns, which 
in a pilot study appeared to be used by some 
Ss. Within these limits, arrangement of the 
eight words in a list was random. 

In addition to the 40 lists just described, 
five pretraining lists, a control list, and four 
organized lists were prepared from different 
words sets. High and Low blocking was 
used twice, and each of the category-availa- 
bility conditions once. 

The Ss were 20 students from introduc- 
tory psychology classes at the University of 
Wisconsin whose participation fulfilled class 
requirements. Each S was run through a 
different sequence on each of 2 days. 
Testing was done in groups of five Ss, and 
assignment of the groups to sequences was 
such that each of the four sequences was 
tested equally often on both days. 

Procedure —After the five Ss in a group 
had been seated and given test booklets, 
instructions were given in which Ss were 
told that they had to try to recall lists of 
eight words, that some of the lists would 
contain categorical words, and that some- 
times the category names would be given 
either before the list was presented or while 
it was being tested. The testing procedure 
was explained in some detail. Previous an- 
swers were covered, and guessing was en- 
couraged. The five pretraining lists were 
presented and there was a brief pause dur- 
ing which any questions were answered. 
Then the 40 lists were presented by a tape 
recorder without interruption. ; 

Each list was presented in the following 
fashion: The category name was presented 
if appropriate (e.g., “some words in the list 
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pertain to something yellow, others to a 
type of vehicle”), there was a 5-sec. pause, 
and then the word list was read at a rate of 
one word every second with the serial posi- 
tion (SP) preceding, e.g., l-BELT, 2-HAR- 
POON, 3-ROPE, 4-BATON, etc. Testing be- 
gan 1 sec. after the eighth word. The 
SP to be recalled was given with the cate- 
gory if appropriate (e.g, 2-Pointed), after 
which S had 5 sec. for recall. After the 
last test, S was told to turn to the next 
page in the answer booklet, and 5 sec. later 
the next list was begun. 


RESULTS 


Mean recall—aAnalyses were based 
on mean number of words correctly re- 
called for a condition over all eight lists 
per S, four in each of two sessions (16 
control lists, eight per session). Aver- 
aging over order was justified, since 
all variables were balanced for list 
order, In addition, there was no evi- 
dence of either practice or proactive 
interference effects; the mean propor- 
tion of errors in 10-list blocks was .45, 
41, .44, and .42. 

The mean number of words recalled 
per list for each of the presentation con- 
ditions is presented in Fig. 1. High 
lists were better recalled than Low, 
F (1, 19) = 56.9, p < .001, and recall 
was facilitated if category names were 
given prior to presentation, F (1, 19) 
= 35.2, p < .001, or during test, F qa, 
19) = 23.0, p<.001. There was a 
significant interaction between list or- 
ganization and category availability 
during test, F (1,19) = 10.6, p< .001; 
it was more helpful to have category 
names during a test of a Low list than 
a High list. There was also a tendency 
for category names prior to presenta- 
tion to improve recall more for Low 
than High lists, but the interaction was 
not statistically reliable. No other 
sources of variance in the basic 2 x 2 
X 2 design were significant, 

To complement the results above and 
allow comparison with performance on 
control lists, a second analysis of vari- 
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BASEDON ALL 8 TESTS 


MEAN NUMBER OF WORDS RECALLED 
a 


a HIGH ORGANIZATION 
© LOW ORGANIZATION 
© CONTROL, 


no-no no-yes  yes-no 


yes-yes 


CATEGORY INSTRUCTIONS 


Fic. 1. Mean number of words correctly 
recalled from each eight-item list and mean 
number of words recalled on first test of 
eight lists. 


ance was carried out based on all nine 
conditions as a single variable. A 
Duncan range test indicated that the 
conditions not connected by lines below 
differed at the .05 level of significance : 


C LNN LYN LNY HNN HNY LYY HYN HYY. 


The more interesting features of the 
range test were the following: (a) 
performance did not differ significantly 
between control and Low lists with no 
category information, but both were 
lower than any other conditions; and 
(b) recall of High lists was not sig- 
nificantly higher than recall of Low 
Yes-Yes lists. 

There was a gradual drop over test 
positions within a list in probability of 
correct recall for all conditions. In 
particular, whether or not the category 
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was available during test did not have 
any noticeable effect on the decline in 
performance over tests. The absolute 
difference in recall probability between 
the first and last words tested ranged 
from .2 to .3. The effects of the or- 
ganizational variables differed to some 
extent over successive tests, Figure 1 
also shows recall performance on the 
first word tested, normalized on mean 
number correct per S (eight lists) to 
allow comparison with the other curves, 
High lists were still better remembered, 
but performance on High and Low lists 
was at roughly the same level for every 
category-availability condition except 
No-No, In this condition, Low lists 
were not recalled nearly as well as 
High lists ; instead, performance was at 
about the same level as control lists. 

A more detailed picture of the effects 
of the organizational variahles was ob- 
tained by classifying each error as an 
omission or as an intralist or extralist 
error. Intralist errors in organized 
lists were further divided into intra- or 
extracategory errors. (In conditions 
where the category name was available 
during test, extracategory errors were 
extremely rare, of course; a total of 
seven were found.) The mean errors 
per list falling into each of these classi- 
fications are shown in Table 1. Extra- 
list errors constituted about 10-15% of 
all errors in each condition. List dif- 
ferentiation was excellent. Less than 
5% of the extralist errors (about .5% 
of all errors) were intrusions from pre- 
vious lists. Obvious associative sub- 
stitutions accounted for 63% of the 
extralist errors, while 32% were un- 
classifiable. In control lists 51% of 
the errors were omissions, considerably 
higher both proportionately and abso- 
lutely than in High and Low lists in 
which omissions accounted for 28% 
and 31% of the errors, respectively. 
In High Yes-Yes lists, Ss recalled al- 
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most seven of the eight words in the 
list, most of these in correct position. 
(Mean omissions and extralist re- 
sponses were both about .5 words per 
list.) Intracategory substitutions con- 
stituted 55% of the errors in High lists, 
but only 42% in Low lists. This differ- 
ence was maintained for all category- 
availability conditions; e.g., even in the 
Yes-Yes condition, 62% of the errors 
in High lists were intracategory, com- 
pared to 52% in Low lists. 

Serial position curves.—Figure 2 
presents the probability of correct re- 
call as a function of the SP of a word 
in the presentation sequence for each of 
the nine list conditions. Both analy- 
ses of variance for mean recall also 
included SP as a variable. The analy- 
sis based on eight conditions showed 
that the SP curve was different for 
High and Low lists, F (7, 133) = 
10.95, p < .001, and that there was an 
interaction between SP, organization, 
and availability of the category name 
during test, F (7, 133) = 5.46, p< 
001. This latter interaction reflected 
the deeper bow in the curve when the 
category was not available during tests 
of Low compared with High lists. Sev- 
eral of the higher-order interactions 
between SP and list conditions were 


TABLE 1 


CLASSIFICATION ANALYSIS OF MEAN 
ERRORS PER LIST 


Type of Error 


“ton” oe Intra- Extra- Extra- 
category | category list 
High 
S 57 1.62 ose 43 
XN .87 1.50 10 38 
N-Y 74 1,80 a 37 
N-N 1.10 1.36 20 40 
Low 
e .87 1.46 =r 44, 
YN 1.30 1,20 Puls 37 
N-Y 1.21 2.04 ae 40 
N-N 1.47 1.00 1.10 46 
Control | 2.25 =a 1.62 58 
eee EE E 
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SERIAL POSITION 


Fic. 2. SP curves as a function of presen- 
tation order for each condition. 


significant at the .05 level, but the 
effects seemed to be idiosyncratic. 
The most interesting result concerns 
the difference between SP curves for 
High and Low lists. The Low curves 
displayed the typical bow, but each of 
the High curves had a hump in the 
middle due to better recall of the first 
item in the second sublist, and in some 
cases slightly better recall of the last 
item in the first sublist. This descrip- 
tion was supported by orthogonal poly- 
nomial analysis. The linear compo- 
nents were relatively small, though sig- 
nificant for Low lists, F (1, 266) = 
5.2, p < .05. The quadratic component 
accounted for most of the remaining 
variance in Low lists, F (1, 266) = 
215.5, p < 001. The residual term was 
not significant. High lists were best 
described as a concatenation of two 
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four-word lists with different mean re- 
call levels (F = 8.9), significant over- 
all linear (F = 29.1) and quadratic (F 
= 67.6) components, with different 
linear slopes for the first and second 
sublists (F = 54.0) ; for each of these 
contrasts, df= 1,266, and p < .001. 
The variance remaining after these ef- 
fects had been extracted from High 
lists was not significant. 


Discussion 


In this study of short-term serial mem- 
ory, probe serial recall of a list of words 
was better if the words were presented in 
a highly organized fashion, blocked ac- 
cording to categories. Recall of lists that 
were not so organized was better if Ss 
were given category names prior to pres- 
entation or during testing. The result 
suggests that in the highly organized lists 
Ss were more likely to detect the category 
names during presentation and were able 
to use these names as retrieval cues dur- 
ing recall, To the extent that this inter- 
pretation is correct, within time intervals 
characteristic of short-term memory, Ss 
identified the conceptual class of the stim- 
ulus words and used this information to- 
gether with list structure to improve re- 
call. Organizational variables which have 
been shown to be important in more typ- 
ical free-recall studies (i.e, long-term 
memory) also affected short-term memory. 

However, in addition to retrieval ef- 
fects, several results point to superior 
storage of words in more organized lists. 
First, there is uniformly higher recall of 
items for High lists under all conditions 
of category availability. The superior re- 
call of High lists in the No-Yes condition 
is especially relevant since, on the face of 
it, retrieval cues were identical for both 
types of lists. Second, consider the avail- 
ability of words as reflected in number of 
omissions and extralist intrusions. There 
were fewer omissions in High lists than 
Low for all comparable category-avail- 
ability conditions, and the proportion © 
omissions was substantially higher in con- 
trol lists than in the worst case of cate- 
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gorized lists, namely, Low lists with no 
category information. Availability of 
category names prior to presentation gen- 
erally led to fewer omissions. One might 
argue that the labels were retained dur- 
ing the presentation of the list and served 
primarily to facilitate retrieval. How- 
ever, the fact that recall was highest in 
the Yes-Yes condition suggests that cate- 
gory availability prior to presentation af- 
fected storage as well as (or instead of) 
retrieval. Incidentally, regarding retrieval 
cues, the authors were surprised to find 
that category information during test had 
no effect on extralist intrusions. When 
a category label was given during a probe 
and no suitable word was at hand in 
short-term memory, Ss might have 
adopted a strategy of guessing on the 
basis of subordinate associates, leading 
to a greater number of extralist errors, 
The majority of the extralist intrusions 
were obvious associations to items in the 
list being tested, rather than intrusions 
from preceding lists as Loess (1964) 
found, 

The shape of the SP curves for the 
High lists also indicated that storage was 
a function of list structure, For all cate- 
gory-availability conditions, the High 
curves resembled two four-word serial 
recall curves, each with its own primacy 
and recency effect. Primacy and recency 
effects have been attributed, respectively, 
to the opportunity for greater rehearsal 
of the first items in a list, and a reduction 
in interference between presentation and 
test of the last items. The occurrence of 
recency in the first sublist and primacy in 
the second sublist would seem to indicate 
that the two sublists were stored inde- 
pendently, so that rehearsal of items in 
one list produced little forgetting in the 
other list. Shorter lists are generally for- 
gotten at a slower rate (e.g., Atkinson & 
Shiffrin, 1968); hence recall should im- 
Prove with independent storage. There 
are undoubtedly limits on the extent to 
Which such independent storage is pos- 
sible, For example, Cohen (1966) and 
Tulving (1966) both presented evidence 
that as the number of four-item cate- 
Sories in a free-recall list is increased, the 
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number of words recalled per category 
decreases. 

The processing of the conceptually re- 
lated lists used in this study may con- 
stitute a type of chunking. At certain 
levels, it seems almost impossible to pre- 
vent chunking; eg., when high-asso- 
ciation nonsense syllables are used as 
stimuli, Ss appear to encode them as 
words. At higher levels, such as the ab- 
stract categorical structures used in this 
study, it may be necessary to point out to 
S the underlying organization (cf, Cofer, 
1965, on the role of set in producing 
clustering in free recall). In the present 
study, the set was induced both by in- 
structions and by the mixed-list design. 
Once S has made arrangements, so to 
speak, to analyze incoming words for 
category membership, retrieval of con- 
ceptual and associative information about 
an item from long-term memory and the 
processing of this information to deter- 
mine the list structure are operations that 
can be carried out rapidly, at rates of an 
item every second or so. If S does have 
at his disposal a variety of more or less 
complex processing strategies, then any 
model of the operation of human memory 
based on a fixed processing sequence will 
eventually prove inadequate for many 
purposes. For example, acoustic or 
(more properly) phonetic confusion er- 
rors have been observed in the recall of 
letters and words over a period of several 
seconds. This finding has been taken to 
mean that the initial encoding and storage 
of items for a few seconds is phonetic. 
While the present study does not bar the 
possibility of short-term phonetic storage 
in some circumstances, it does suggest 
that other types of encoding operations 
may also occur, and that these operations 
come into play if S is properly set and if 
it is to his advantage to use them. These 
last remarks bear on the processing of 
structured word lists more generally, in 
particular the comprehension of sentences, 
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CONTEXTUAL STIMULI AND PROACTIVE INHIBITION: 


KENT DALLETT an» SANDRA G. WILCOX 
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A special box was constructed making it possible for Ss to learn and 
recall in either of 2 distinctive environments, the other environment 
being a conventional memory drum. 3 experiments were conducted, 
in each of which Ss served for several daily sessions, each day learning 
2 lists of 25 words by serial anticipation, after being tested by free 
recall for the 2 lists learned the day before. In each experiment, 4 of 
the Ss were subjected to a change in environment during the experi- 
ment, and the change was maintained until the end of the experiment. 


Recall became worse with successive days. 


A change of context 


improved recall or retarded its decline. 


When human forgetting is studied 
in the laboratory, proactive inhibition 
(PI) has turned out to be one of the 
most powerful determinants of the 
amount forgotten. As Underwood 
(1957) first pointed out, an S who is 
tested 24 hr. after learning his first 
ist of verbal materials remembers 70- 
80% of the items correctly, while an 
experienced S who has learned more 
than six such lists will remember only 
20-30%. The most common explana- 
ion of PI is that during the retention 
interval the traces of recently learned 
materials are somehow interfered with 
by the traces of materials learned pre- 
viously. However, this explanation 
orces one to ask why it is that the 
effects of a few lists of words learned 
in the laboratory are not masked or 
dwarfed by the cumulative effects of 
many years of preexperimental learn- 
ing. Evidently, if it is “prior learning” 
which is responsible for PI, one must 
also suppose some selector which re- 
duces or eliminates interference from 
materials previously learned in differ- 
ent settings, without preventing the 
interference which results from prior 


_ ‘This investigation was supported by Pub- 
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learning in the same setting. Such 
situational dependence has been dem- 
onstrated in experiments on retroactive 
inhibition in which interference comes 
from subsequent rather than from prior 
learning (Bilodeau & Schlosberg, 1951 ; 
Greenspoon & Ranyard, 1957). We 
have attempted to show this sort of 
situational dependence in PI. 


METHOD 


Changing contextual stimuli—In order to 
produce a rapid and thorough change of con- 
textual stimuli, a special box was con- 
structed. The S could then be tested in one 
of two environments: seated before a con- 
‘ventional memory drum, or standing with 
his head in the box. These environments 
are designated box and drum. In the drum 
environment, S sat before a black plywood 
partition mounted on a wooden table, which 
displayed the face of a Stowe memory drum. 
Behind partition and drum sat the female E 
whom S had encountered on first entering 
the experimental room. The E was out of 
sight, but S knew of E’s presence and had 
just been given instructions by E. The 
learning materials appeared, typewritten, in 
the window of the memory drum. In the 
box environment, S stood with his head in- 
side a large box, his neck in a foam-rubber- 
padded U-shaped cutout in the floor of the 
box, with a curtain to eliminate peripheral 
stimuli behind him. The room was darkened, 
and the interior of the box lighted with 
flashing red and green lights. The inner 
dimensions of the box were about 3 ft. X 3 ft. 
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at the end where S’s head was, and tapered to 
about 1 ft.x2 ft. approximately 33 ft, in 
front of S’s head. None of the walls were 
parallel. The inside of the box was painted 
white, with green and black lines added, con- 
verging to a false vanishing point which did 
not coincide with the perspective of the 
walls. Some of these painted lines were 
‘connected” to black strings hung across the 
interior of the box. Half of one wall was 
covered by furry red patches made of a 
nylon bathmat. After constructing the box 
and using it, it seemed to Es that a great 
deal of its effect came from the changing 
illumination; a red 40-w, bulb flashed at a 
rate of approximately 80/min, while a green 
40-w. bulb flashed at approximately 18/min. 
These were the only lights in the box. The 
Ss generally agreed that the box was highly 
unusual, and on two occasions Ss had to be 
excused because of nausea. To further in- 
crease the difference between the box and 
drum environments, S wore earphones in the 
box and heard instructions and learning 
materials given by a male E via tape re- 
cording, The two environments differed in 
visual stimuli, in method of presentation of 
the learning materials, in S’s posture, in 
auditory stimuli, and in E. (Although while 
in the box Ss undoubtedly realized that the 
“live” E was somewhere nearby.) The 
change could be brought about in 1 min. or 
less: From the drum S$ was taken behind a 
plywood screen and given earphones, his head 
was inserted in the box, the height of the 
box was adjusted, room lights were ex- 
tinguished, and the appropriate instruction 
tape started. Instructions and lists were 
recorded on a Uher tape recorder at a speed 
of 15/16 ips and were stored in the form 
of tape loops, making it possible to change 
materials quickly without rewinding. 

A validation study —To determine whether 
the box environment was as effective a 
means of stimulus change as the separate 
rooms used by Greenspoon and Ranyard 
(1957), an attempt was made to replicate 
their Cond, “AAA” and “ABA” which in- 
volved the learning of two lists of CVC non- 
sense syllables, with recall of the first list 
under the conditions of first-list learning. 
The two groups differed in that Group AAA 
had the same environment throughout the 
experiment, while group ABA had the second 
(IL) list in a changed context. We 
used Greenspoon and Ranyard’s materials 
and, as far as possible duplicated their 
experiment with the box and drum environ- 
ments. Since the box involved aural presen- 


tation, and the CVCs took some time to 
spell, a 3-sec. rate of presentation was used 
on the drum and a 4-sec rate in the box, 
Two minutes transpired between original 
learning (OL) and interference learning 
(IL), with 3 min. between IL and recall, 
The Ss were 24 students who were recruited 
from classes and by notices on bulletin boards, 
and were paid $1.50. The learning data 
showed no differences aside from a significant 
practice effect. In recall, the two groups 
differed significantly, F (1, 20) = 14.14, p< 
001, and the mean recall scores were com- 
parable to those of Greenspoon and Ranyard 
(1957). Their ABA and AAA groups re- 
called 7.1 and 3.5 items, respectively, while 
the corresponding groups in the present study 
recalled 7.7 and 5.1 items. Their degree of 
stimulus change may have been slightly more 
effective than ours, and, of course, the effec- 
tiveness of contextual change might vary 
with different materials and procedures. 
Materials—For the PI experiments, the 
learning materials were 10 serial lists of 25 
nouns, all from the A or AA classes of the 
Thorndike-Lorge (1944) frequency count. 
Serial lists were used because it is easier to 
present serial lists aurally, and also because 
the unclear nature of the focal stimulus in 
serial learning might indicate that serial 
learning is more dependent on context than 
is, say, paired-associate learning. The lists 
were always presented at a rate of 3 sec/ 
item with 3 sec. between repetitions of the 
list. A twenty-sixth item, “start,” was added 
to the beginning of each presentation. In 
each experiment the lists were used in such 
a way that no S in a changed or unchanged 
group had the same sequence of lists as 
another S in the same group, while he had a 
sequence which was identical to a sequence 
learned by an S in the other group. 
Subjects—The Ss were recruited from 
classes or by notices on bulletin boards and 
were paid $2.00 an hour for their participa- 
tion. They were always assigned to condi- 
tions by prearranged schedule. If the first list 
required more than 30 trials to learn, S was 
replaced. Three Ss in the 4-day experiment 
and four in the 3-day experiment were re- 
placed for exceeding this cutoff. Six of 
these learned the failed list on the drum. 
In addition, one $ was replaced in the 4-day 
experiment who learned the first list within 
the 30-trial limit, but had not mastered the 
second list by the end of the hour. In the 
4-day experiment, which was run first, two 
additional Ss were replaced who showed no 
Pl on Day 2. This was not done in the 3-day 
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experiment. In the final 5-day experiment 
the criterion of learning was reduced, and 
no Ss had to be replaced. 

Procedure and design—When S appeared 
for his first session, he was greeted by a 
female E (S. G. Wilcox) and installed in the 
box or at the drum. If he was assigned to 
the drum, the instructions were given by the 
live E, while if he was in the box, all of his 
instructions were delivered by the male E 
on tape, The first list was learned, and 
after 1 min. the second list was learned in 
the same setting. When S returned for the 
second and subsequent sessions, he was first 
placed in the environment of the previous 
day’s learning and asked to give all the 
words he remembered from both lists, in any 
order. Four minutes were allowed for this. 
Then S was kept in the same setting or 
changed to a different one for the learning 
of two more lists, The final session, of 
course, consisted only of the recall test. 

There were three experiments. The first 
was a 4-day experiment, with each S learn- 
ing six lists. Half the Ss had a change of 
environment before starting the third day’s 
learning of Lists 5 and 6, and returned to 
the new environment for recall on Day 4. 
The design involved two starting locations 
(box, drum) and change vs, no change, 
These were factorially combined with 5 Ss 
in each of the four cells. The second ex- 
periment was a similar 3-day experiment, 
with change introduced on Day 2. The 
design comprised four groups of six Ss, In 
each of these experiments, learning was to 
a criterion of one perfect trial. The final 
experiment lasted 5 days, half the Ss 
undergoing change on Day 3, half on Day 4. 
The Ss changed on Day 3 continued in their 
new setting until the end of the experiment. 
Four groups of three Ss were used. For 
this experiment the learning criterion was 
reduced to 22 correctly anticipated items on 
a single trial. 


REsuLts AND DISCUSSION 


For each of the three experiments, an 
analysis of variance of trials to cri- 
terion in learning was carried out. The 
results were the same in each case; 
there was no significant overall differ- 
ence between groups which underwent 
a change and groups which did not; 
there was a significant effect of days 
which reflected the usual improvement 
in learning with successive days of 
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practice, and there was no interaction 
of groups with days in learning, indi- 
cating that the introduction of a change 
of environment did not significantly re- 
duce Ss’ capacity for learning. Al- 
though learning scores within days were 
not analyzed separately, the data clearly 
show a within-day improvement in 
learning which is partly lost between 
days; this may be a practice effect 
which is easily forgotten or it may re- 
sult from what is called “warm-up.” 
Trials to criterion for each list are 
presented in Fig. 1. Starting locations 
(box, drum) did not lead to significant 
or consistent differences in learning or 
in recall, and will not be discussed : only 
the data for change vs. no change are 
presented. 

Although change of context did not 
produce significant differences in learn- 
ing, it did significantly affect recall in 
the 4-day and 5-day experiments. In 
the 4-day experiment, there was no 


MEAN TRIALS TO CRITERION 


DAY 3 
5 6 7 8 


DAY 2 

Liste 2 ss 

Fic. 1. Trials to criterion in learning for 

the 3-day, 4-day, and 5-day experiments (top 
to bottom). 
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significant overall difference between 
groups, F (1, 18) = .22, a significant 
decrease in recall with successive days, 
F (2, 36) = 22.09, p < .01, and a sig- 
nificant interaction of Groups X Days, 
F (2, 36) = 3.18, p < .05. The nature 
of the interaction is revealed in Fig. 2, 
where it will be seen that after the 
change was introduced recall improved. 
There were some features of the data 
which made it desirable to experiment 
further, however. First, it will be seen 
that the data for Day 1 display a curi- 
ous interaction between groups and 
lists, This happened in neither of the 
subsequent two experiments. Also, it 
will be seen that the unchanged group 
increases in recall after the change, 
albeit slightly. This might have been 
due to Ss’ anticipating the end of the 
experiment, but it might also have re- 
sulted from the fact that two Ss who 
showed no PI in recalling the lists 
learned on Day 2 were replaced. In 
the other two experiments, terminal 
increases in performance were not seen. 

In the 3-day experiment, change was 
introduced on the second day. The 
only significant difference in recall was 
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Fic. 2. Mean number recalled in the 
4-day experiment. 
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Fic. 3. Mean number recalled in the 3-day 
experiment (filled circles) and in the 5-day 
experiment (open circles). 


an effect of days, F (1, 22) = 24.51, p 
< .01, while the overall difference be- 
tween groups was not significant, F (1, 
22) = 1.44, nor was the interaction, 
F (1, 22) =1.19. Mean recall scores 
appear in Fig. 3. This outcome might 
indicate that a change in environment 
is ineffective in reducing cumulative PI 
if it is introduced too soon. Conse- 
quently, the 5-day experiment involved 
two groups of six Ss each, one group 
undergoing a change on Day 3, while 
the other had its change on Day 4. The 
analysis of variance showed no over- 
all differences between groups, F (1, 
10) = .001, a significant decrease in 
recall with successive days, F (3, 30) 
= 22.12, p<.01, and a significant 
interaction of Groups X Days, F (3, 
30) = 3.40, p<.05. Recall scores 
for each list are presented in Fig. 3. 
While change does not always produce 
an absolute improvement, it does lead 
to recall which is consistently better 
than the recall of the unchanged Ss. 

How did the contextual change Te 
duce PI? One possibility is suggested 
by a version of interference theory 
which maintains that PI is the direct 
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result of some form of experimental 
extinction which takes place during the 
learning of material whose retention is 
later interfered with by recovery of the 
extinguished associations. From this 
one might deduce that following a 
change of context, fewer associations 
from prior learning would intrude and 
consequently fewer would be extin- 
guished, leading to a reduction of the 
interference which results from recov- 
ery. Unfortunately, there are too few 
overt errors to allow this hypothesis to 
be tested directly. Combining the data 
for all days and for all experiments, 
there were only 18 interlist intrusions 
from the same day’s prior list, and 16 
intrusions from previous days. In con- 
trast, there were 195 intrusions of 
words which had appeared on none of 
the lists, and 5,393 errors which con- 
sisted of misplaced words from the 
same (current) list. Errors in recall, 
however, seem to confirm the supposi- 
tion that interference from prior lists 
was reduced by change of context. The 
errors made in recalling the two lists 
immediately after change were totaled 
for all Ss in all experiments. These 
totals are based on 34 Ss per condition, 
since Ss of the 5-day experiment are 
counted twice. For changed conditions, 
there were 54 errors from the previous 
day’s lists, as compared to 111 such 
errors in the unchanged conditions. 
The error scores were quite variable: 
many Ss gave none, while each of the 
totals includes a different single S who 
gave more than 20 errors from one list, 
perhaps as a result of an error in list 
identification. From more remotely 
prior lists, changed and unchanged Ss 
gave only 9 and 8 errors, respectively, 
while extraexperimental intrusions 
yielded respective totals of 25 and 12. 
The error totals in recall, of course, 
have little bearing on the recovery hy- 
pothesis, but they are consistent with 
the data on correctly recalled items in 


suggesting that PI has been reduced 
by contextual change. 

With respect to the possibility that 
prior learning is simply not available 
after contextual change, and hence can- 
not interfere with retention, data are 
available from a supplementary experi- 
ment which fail to show that change 
of context reduces the conventionally 
tested retention of a single list. Four 
groups of 10 Ss each learned one list 
of 20 adjectives and were tested after 1 
wk. using an anticipation test of reten- 
tion which should be more sensitive 
than free recall to momentary lapses in 
the availability of an item. The four 
groups comprised a 2 X 2 factorial de- 
sign, in which one factor was the 
original environment of learning and 
the second was change vs. no change, 
Mean recall scores varied from a low of 
8.5 items for the unchanged drum 
group to a high of 11.6 items for the un- 
changed box group. The changed 
groups, drum to box and box to drum, 
had respective scores of 10.1 and 9.0 
items. None of these differences were 
statistically significant, the largest F 
ratio being 2.13 for recall in the box vs. 
recall on the drum. Of course, it is 
possible that after several lists have 
been learned, contextual dependence in- 
creases, and such an hypothesis would 
also explain why the difference between 
change and no change was not sig- 
nificant in the 3-day PI experiment. If 
this were true, one might have expected 
larger effects than were obtained in 
the 4-day and 5-day PI experiments, 
although the data are really insufficient 
to answer this question of relative mag- 
nitudes. 

While the data do not suggest the 
exact mechanisms involved in PI, they 
at least suggest that a part of the PI 
usually obtained is due to the fact that 
successive sets of materials are learned 
jn contexts which are similar to one 
another. It is only reasonable that the 
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present data should suggest, as they 
seem to do, that unchanging task-rele- 
vant aspects such as the method of 
learning are more important compo- 
nents of situational similarity than ir- 
relevant contextual stimuli. What is 
surprising is that irrelevant context is 
as important as it is in reducing inter- 
list interference. Perhaps one should 
turn over the coin and consider the re- 
call of early lists as being somehow en- 
hanced by novelty, rather than thinking 
in terms of an inhibition which grows 
with experience in learning. In the 
absence of some agreed-upon “reference 
level” of retention, however, enhance- 
ment and inhibition hypotheses will 
continue to be indistinguishable. And 
furthermore, the appeal of a “novelty” 
explanation follows largely from the 
assumption that the contextual changes 
were functionally irrelevant to S’s task, 
and thus were unlikely to affect the 
locus or form of memory storage for 
verbal lists. In actuality we do not 
know that the contextual changes 
were generally viewed as “irrele- 
vant” by anyone but E. Instead, 
Ss might have thought the contextual 
changes were fundamental and memo- 
rable aspects of an experience whose 
purpose and future course were still 
completely unclear. 


KENT DALLETT AND SANDRA G. WILCOX 


With or without functional rele- 
vance, the change of context might be 
said to have enhanced retention by dif- 
ferentiating the lists which preceded 
the change from those which followed. 
While differentiation has been sug- 
gested before in connection with PI 
(e.g., Underwood & Ekstrand, 1967), 
the present data do not suggest how it 
might operate to reduce forgetting. In- 
deed, “differentiation” would appear to 
be little more than a name for the 
phenomenon which has been demon- 
strated, 
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ORGANIZED LISTS? 
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2 experiments were performed which show that organization of the 
learning material facilitates recall, but has little effect upon recognition 
performance. In the Ist experiment Ss learned lists of 40 words, 10 
each from 4 conceptual categories or unstructured control lists. In 
the 2nd experiment the learning material consisted of CVCs which 
were constructed so that letter combinations were highly predictable 
in the experimental condition and unpredictable in the control groups. 
Intralist similarity was also varied by using either 5 or 10 different 
consonants to construct each set of CVCs. High intralist similarity 
depressed performance for both recall and recognition, but did not 
interact with list structure. The differential effect of organization 
upon recall and recognition was interpreted as supporting a 2-process 


theory of recall and recognition. 


Free-recall performance depends 
upon the organization of the learning 
material, The better organized the 
material is, be it in terms of an asso- 
ciation net, conceptual categories, or 
redundancy, the better recall becomes. 
Indeed, it appears that the major prob- 
lem which S faces in a recall experi- 
ment is to organize the items for him- 
self so that they can be retrieved later. 
Ifa list is already objectively organized, 
this task is facilitated. 

No conclusive investigations of the 
effects of organization upon recogni- 

_ tion have as yet been reported. Never- 
theless, this problem is of some theo- 
retical interest. It is often assumed 
that’ recognition and recall provide al- 
ternative measures of habit strength or 
associative strength. Recognition is 
thought to be the more sensitive mea- 


_. Sure of the two, and a response thresh- 


old is evoked to account for the differ- 
ence. According to this one-process 
theory, one would expect organization, 
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like most other experimenal variables, 
to affect recognition and recall in simi- 
lar ways. An alternate theory holds 
that recognition and recall involve dif- 
ferent processes, and that recall in- 
cludes a recognition stage. Müller 
(1913) distinguished between a re- 
trieval mechanism which makes mem- 
ory traces available and a decision 
mechanism. The S first reproduces 
a response covertly and then judges 
whether the response is the correct one 
by evaluating its familiarity, much as 
the familiarity of a test item is judged 
in a recognition experiment. An overt 
response is made only if the familiarity 
exceeds a criterion. 

The two-process theory failed to in- 
terest psychologists until recently. 
Adams (1967) has argued for a closely 
related two-process interpretation of 
recall and recognition, and Peterson 
(1967) has shown that a distinction be- 
tween search and judgment in recall 
can help to integrate a large number of 
experimental observations. The two- 
process theory has also been advocated 
by Dale (1967) who found that re- 
sponse availability correlated highly 
with recall, but not with recognition 
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performance. The present report ex- 
plores a further implication of the 
theory. If retrieval and recognition 
processes can be separated as neatly 
as the theory implies, organization of 
the learning material can have no effect 
upon recognition, since organization 
facilitates retrieval and only recall in- 
volves retrieval. 


EXPERIMENT I 


Cofer, Bruce, and Reicher (1966) 
have investigated the effect of organiza- 
tion in terms of conceptual categories 
and manner of presentation upon free 
recall. Highly organized lists of 40 
words were constructed by combining 
the 10 top associates from four concep- 
tual categories such as “vehicle,” 
“sport,” “furniture,” etc., from the 
norms of Cohen, Bousfield, and Whit- 
marsh (1957). Corresponding low- 
organization lists were constructed 
from infrequent associates to four dif- 
ferent concepts. Both level of organi- 
zation and blocked vs. random presen- 
tation significantly affected recall. Ex- 
periment I was in part a replication of 
the two most extreme groups of Cofer 
et al.: recall performance of the high- 
organization blocked-presentation group 
was compared with performance of the 
low-organization random-presentation 
condition, In addition, recognition 
memory for the same two experimental 
conditions was investigated. 


Method 


Subjects —Forty undergraduate students 
participated as part of a course requirement. 

Material—All items were selected from 
the norms of Cohen et al. (1957). Two 
high-structure lists were constructed. For 
the recall list, the 10 most frequent words 
from four of the categories of Cohen et al. 
were selected. and arranged in blocks by 
category. For the recognition list, the 20 
most frequent words from four different cate- 
gories were determined. One-half of these 
were designed as study items and the others 
as distractors. Blocked presentation was 


WALTER KINTSCH 


used for the 40 study items. All 80 items in 
random order constituted the test material. 

Two low-structure lists, one for recogni- 
tion and one for recall, were constructed in a 
corresponding manner, except that the words 
chosen for each category were always the 
least frequent responses, and the 40 study 
items were arranged in random order for 
presentation. 

Different categories were selected for each 
list. Double words, plurals, misspellings, 
and proper nouns were excluded. The lists 
did not differ significantly in average word 
frequency (G count of Thorndike & Lorge, 
1944). 

Experimental design—Each S received all 
four experimental treatments, which con- 
sisted of the factorial combination of recog- 
nition-recall and high-low structure. The 
order of-administration of the four condi- 
tions was counterbalanced over blocks of 
four Ss. 

Procedure——Each word was printed on a 
3 Sin. card. The E presented the 40 study 
cards for each list manually from behind a 
wooden screen at a rate of 2 sec. per card, 
The S read the word aloud as it was shown. 
Retention tests were given immediately after 
the items of each list were presented. For 
the recall condition, S was given a lined 
sheet and asked to write down all the words 
which he could remember, irrespective of 
order. For the recognition condition a 
mimeographed test sheet was used which con- 
tained the 40 study items and the 40 cor- 
responding distractor items in random order. 
The S circled those items which he believed 
to be old items. The maximum response 
time was 2 min. for all tests. The S was 
not informed about the nature of the reten- 
tion test until after he had learned a list. 


Results 


The main results of the experiment 
are summarized in Table 1. The recall 
data agree with the results reported by 
Cofer et al. The difference between 
recall in the high-structure and low- 
structure conditions was almost identi- 
cal in the two experiments. The over- 
all somewhat poorer performance of the 
present Ss may be related to the fact 
that they learned four lists per session, 
while Cofer et al.’s Ss learned only 4 
single list. 

The difference between the number 
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TABLE 1 


MEAN RETENTION SCORES FOR EXPERI- 
MENTAL CONDITIONS IN Exp, I 


Correct Incorrect 
Response Response 
‘Condition 
Recog- Recog- 
Recall | nition | Recall | nition 
High Structure | 21.7 (24.3) | 34.8 3 (3 3.3 
Low Structure | 14.8 (17.1) | 32.2 |7 {3} 22 
Difference Score 
Recall Recognition 
High Structure 21.5 (24.0) 31.5 
Low Structure 14.1 ties} 30.0 
Note.—The numbers in parentheses give the results 


from the comparable groups in the Cofer et al. (1966) 
experiment. 


of correct responses and incorrect re- 
sponses was obtained for each S for 
both recognition and recall. These 
differences were analyzed by an analy- 
sis of variance. Recognition was sig- 
nificantly superior to recall, F (1, 108) 
= 406.5 (p < .01). The effect of 
organization in recall was also highly 
significant, F (1, 108) = 66.4 (p< 
01). The effect of organization in 
recognition was not statistically signifi- 
cant, F (1, 108)=2.7 (p>.05). 
Order of presentation was also investi- 
gated in the same analysis, but failed to 
teach statistical significance. 

Taking the difference between cor- 
rect and incorrect recognitions as a 
measure of recognition performance 
implies a high-threshold model for 
recognition. Several studies have 
shown such a model to be inadequate. 
A better estimate of memory strength 
In recognition may be obtained by us- 
Ing a memory model analogous to the 
theory of signal detection. Thus, the 
tables of Elliott (1964) were used to 
obtain values of d’ for each S. The 
average d’ for the high-structure list 
was 2.79. The average d’ for the low- 
Structure list was 2.71. The ¢ value for 
this difference is .6 for 39 df. 


It is clear that organization in terms 
of conceptual categories is not an im- 
portant variable in recognition, At 
the same time, it has a very pronounced 
effect on recall, The only effect on 
Tecognition is a slight shift in the recog- 
nition criterion. More correct as well 
as more incorrect responses were given 
for the high-structure list than for the 
low-structure list. Thus, for reasons 
unknown, Ss seem to relax their re- 
sponse criterion somewhat if the list is 
well structured. The converse finding, 
that fewer recall intrusions occur with 
high- than with low-structure lists 
(Table 1), is easily accounted for by a 
retrieval hypothesis. 


EXPERIMENT II 


Although the results of Exp. I unam- 
biguously support the two-process hy- 
pothesis, a second experiment was per- 
formed to establish their generality. 
Less integrated learning, material was 
used (CVCs) as well as a different 
definition of organization. Miller 
(1958) has shown that organization 
induced through artificial rules facili- 
tates the recall of meaningless letter 
combinations. Similar learning mate- 
rial and similar rules were used in the 
present experiment. 

Beginning with Hollingworth (1913) 
it has been accepted that part of 
the usually observed superiority of 
recognition over free recall is due to 
the whole-part problem. In order to 
recall an item correctly, all of it must 
be stored in memory, or at least enough 
of it to permit reconstruction. On 
the other hand, it is possible to recog- 
nize an item correctly by responding 
only to a small part of it. There is no 
doubt that response integration contrib- 
utes to the difference between recall 
and recognition. However, a perform- 
ance difference remains even when part 
learning is controlled (McNulty, 1965). 
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A comparison of experiments using 
words and nonsense syllables might 
permit one to evaluate the importance 
of part learning and its interaction with 
the organization of the learning mate- 
rial. 

Intralist similarity as well as list 
structure was varied in Exp. II. In 
the experiment just reported the two 
are confounded, in that the high-struc- 
ture list has much higher intralist simi- 
larity than the low-structure list. 


Method 


Subjects——Forty-two undergraduate stu- 
dents were paid to serve in two experimental 
sessions. They learned four lists per session, 
plus a warm-up task. The order of list 
presentation was counterbalanced over Ss. 

Material and experimental design—A pool 
of 50 CVCs was constructed for each of the 
eight experimental conditions. The experi- 
mental conditions were obtained by combining 
the factors recognition-recall, high-low struc- 
ture, and high-low similarity. The high- 
similarity items were constructed first. Sets 
of 50 low-structure CVCs were generated 
from five different consonants and five 
vowels in such a manner that each conso- 
nant followed every other consonant either 
two or three times, each time with a different 
vowel. Two such sets were constructed, one 
for recognition and one for recall, each time 
with different consonants. The sequential 
dependencies which did exist between letters 
in these sets were so small and complex 
that S's could hardly make any use of them. 
For high-structure items strong sequential 
dependencies between the consonants of the 
CVCs were introduced. Each consonant was 
combined with only two other consonants, 
but with all five vowels. Thus, S could 
learn relationships such as if a syllable starts 
with a C, the second consonant can only be a 
DoraK. 

Low-similarity items were obtained by 
transforming one-half of the consonants of 
the high-similarity items into consonants 
different from the original five. For in- 
stance, every other C in the high-similarity 
condition was changed into a G; every other 
D was changed into an F, etc. 

Meaningfulness values (Noble, 1961) for 
the eight sets of items did not vary signifi- 
cantly (F <1). 

From each 50-item set a subset of 20 items 
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was selected at random and designated as 
study items. The remaining items were used 
as distractors in the recognition tests. 

Procedure—All items were presented by 
means of a memory drum at a 2-sec. rate, 
Presentation order was random for all con- 
ditions. Immediately after presentation 
either a recall or a recognition test was ad- 
ministered. Recall was in writing. For the 
recognition , test mimeographed sheets were 
used which contained the 20 study items plus 
10 distractor items. A time limit of 2 min. 
was observed. 

Three trials were given for each list. 
Order of presentation varied from trial to 
trial. A different set of distractor items 
was used for each trial. 


Results 


The results of Exp. I were com- 
pletely confirmed. Structure signifi- 
cantly facilitated recall, but not recog- 
nition. High intralist similarity de- 
pressed both recall and recognition, but 
similarity did not interact with either 
the testing procedures or list structure. 

Table 2 shows the mean number of 
correct and incorrect responses aver- 
aged over trials. In Fig. 1 the differ- 
ence scores (correct minus incorrect 
responses) are plotted as a function of 
learning trials. An analysis of vari- 
ance of the difference scores revealed 
highly significant effects of similarity, 
but no Similarity x Recognition-Recall 
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interaction (F <1). On the other 
hand, the Structure X Recognition-Re- 
call interaction was highly significant 
statistically, F (1, 943) = 31.3 (p< 
.01) : the average recall score for the 
high-structure lists was 6.5 vs. only 4.7 
for the low-structure lists, while recog- 
nition performance was almost identical 
for the high- and low-structure lists 
(11.7 vs. 11.8). Performance changes 
as a function of learning trials were 
also statistically significant and are 
shown in Fig. 1. The facilitating ef- 
fects of structure in recall increases 
with trials as S learns more about 
the rules which govern the composition 
of the set of learning items. 

Recognition was also analyzed by de- 
termining d’ scores for each S. Mean 
d’ for high- and low-structure lists was 
1.5 and 1.6, respectively. Thus, there 
is certainly no evidence for better reten- 
tion of the structured lists. The mean 
d’ values for the low- and high-simi- 
larity lists were 1.9 and 1.2, respec- 
tively. This difference is statistically 
Perici F (1, 452) = 90.2 (p< 
.01). 

Note the reversal of the incorrect re- 
sponse scores in Table 2 as a function 
of list structure. High structure pro- 
duces more false recognitions than low 
structure, but fewer recall intrusions. 
This difference was highly significant 
statistically, F (1, 943) = 34.7 (p< 


TABLE 2 


MEAN RETENTION SCORES FOR EXPERI- 
MENTAL Conpitions IN Exp. I 
AVERAGED OVER THREE TRIALS 


Correct Incorrect 
Response Response 
Condition M] (Se 
y- Recog- 
Recall | Recog- | Recall | nition 
High Similarity 
High Structure | 8.0 | 13.8 | 2.7 3.3 
Low Structure | 6.9 13.2 3.5 2.5 
Low Similarity 
igh Structure | 9.3 14.6 1.6 1.8 
Low Structure | 8.2 14.7 2.1 1.6 
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TABLE 3 


RECALL INTRUSIONS IN Exp. II As A FUNCTION 
or No. or LETTERS IN INTRUSION RE- 
SPONSE WHICH DIFFERED FROM A 
WELL-FORMED EXPERIMENTAL 
Irem 


High Structure Low Structure 


Diferen 
ifferent 
High i 
Bee cit sea g siete lh sink 
larity larity larity larity 
0 155 83 137 27 
1 159 84 285 224 
2 0 6 2 4 
Total | 314 173 424 255 


.01), and replicates a finding com- 
mented upon in the discussion of 
Table 1. 

A total of 1,166 recall intrusions 
were made. Table 3 shows their dis- 
tribution over the four experimental 
conditions. Both high structure and 
low similarity decrease the number of 
intrusions appreciably. Furthermore, 
when lists are structured, about half of 
all intrusions come from the set of 
items used as distractors in the experi- 
ment, while the intrusions for unstruc- 
tured lists tend to be less closely related 
to the set of experimental items. 


Discussion 


The results of the two experiments are 
clear: Organization of the learning mate- 
rial facilitates recall, but not recognition, 
independent of whether words or nonsense 
syllables are learned or of intralist sim- 
ilarity. The data are in agreement with 
a theory which assumes that recall in- 
volves a retrieval phase and that an or- 
ganized list is easier to retrieve than an 
unorganized list, but that recognition does 
not involve retrieval and is therefore 
independent of list organization. In 
recognition S appears to respond simply 
on the basis of the familiarity of each 
item as it is presented. However, the 
familiarity of an item refers to the item 
in the particular context in which it has 
been studied. In most recognition experi- 
ments context is not changed between 
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study and test. If it is changed, perform- 
ance decrements can be expected (Müller, 
1913). 

Note that the recognition results re- 
ported here refer to item recognition, not 
to class recognition. A task involves 
item recognition if § must decide which 
items of a homogeneous set he has ex- 
perienced before. A task involves class 
recognition if distractors are used which 
are not members of the set of study items. 
If recognition on the basis of class mem- 
bership is permitted, better organized lists 
will, of course, be easier to recognize. 
If a highly organized list (e.g., animal 
names) is presented for study, and the 
recognition test uses names of kitchen 
utensils as distractor items, a large effect 
due to class recognition will be obtained. 
Lachman and Tuttle (1965) described an 
experiment which shows a similar effect. 
They compared recognition and recall 
with two kinds of materials: words 
arranged in a grammatical paragraph 
and the same words in random order. 
Recognition tests were given by mixing 
other words with the study words and 
presenting them one by one for a yes-no 
test. Large differences in favor of the 
organized-presentation method were ob- 
served. However, these differences may 
be attributed to the use of distractor 
items which were not related to the study 
items in terms of the organizational vari- 
able. Although both the study words 
and the distractors were high-frequency 
words, only the former were organized 
in a meaningful paragraph, and the dis- 
tractors were unrelated to this organiza- 
tion. The Ss thus could respond on the 
basis of class recognition; the study para- 
graph established a certain set, and Ss 
could exclude all words which did not 
fit into this context. 

Tt has been observed previously that 
learning by the method of recognition is a 
one-stage process and that learning by the 
method of free recall is a two-stage proc- 
ess (Evans & Dallenbach, 1965; Kintsch 
& Morris, 1965). In addition, it has been 
shown that recognition learning is a sub- 
process of recall learning. This argu- 
ment was based entirely upon an analysis 
of the formal properties of the learning 
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data which are obtained with the two 
procedures. Miiller’s two-process theory 
of recall suggests that the two stages 
which were identified by mathematical 
analysis may be interpreted as the re- 
trieval of possible responses and the rec- 
ognition of the correct response. Deci- 
sion procedures for choosing a response 
in a recogniton task have been investi- 
gated previously (Kintsch, 1967). An 
adequate model of the retrieval process is 
not available at this time. Such a model 
must deal with the facilitation of retrieval 
through organization of the learning 
material as well as with the subjective 
organization which a list undergoes dur- 
ing free-recall learning. 
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TRANSFER-ACTIVATED RESPONSE SETS: 


EFFECT OF OVERTRAINING AND PERCENTAGE OF ITEMS 
SHIFTED ON A VERBAL DISCRIMINATION SHIFT? 


COLEMAN PAUL, CHARLES CALLAHAN, MARILYN MERENESS 
anp KENNETH WILHELM 


Adelphi University 


To study overtraining effects upon a discrimination shift, Ss received 
either high or low degrees of training on a 3-alternative verbal dis- 
crimination task. During shift training either 25%, 50%, 75%, or 
100% of the original 12 triads had an alternate member of the triad 
designated as correct. Results were consistent with previous findings 
in discrimination reversal and suggested the operation of a generalized 
response set to suppress originally reinforced responses, 


In two previous verbal discrimina- 
tion reversal experiments (Paul, 1966, 
1968) Ss were presented 12 pairs of 
words until the “correct” member of 
each pair was learned. Following 
either 0% or 50% postcriterion train- 
ing (PCT) trials, discrimination re- 
versal conditions were instituted. For 
various subgroups of Ss, 100%, 75%, 
50%, or 25% of the 12 pairs received 
a reversed or shifted reinforcement con- 
tingency. PCT facilitated reversal per- 
formance, and reversal performance 
varied as a nonmonotonic function of 
the percentage of items shifted (PIS). 
The rapid (virtually one-trial) achieve- 
ment of reversal criterion in the 100% 
reversal condition suggested the opera- 
tion of a generalized set to “reverse” 
all first-task selections. The occur- 
rence of a relatively high percentage 
of errors on the nonreversed items in 
the 75% condition seemed consistent 
with the notion that such a generalized 
response set was operating. 

In a two-alternative, forced-choice 
task, a set to reverse all first-task re- 
sponses is indistinguishable from a set 
to suppress first-task responses—if in- 

1 This research was supported, in part, by 


National Science Foundation Grants GB2782 
and GB5481, 


deed there is a difference between 
these sets. A reversal set cannot op- 
erate in a three-alternative task (with 
the alternatives unrelated), whereas a 
suppression set can. A three-alterna- 
tive task has the further analytic ad- 
vantage in that it permits observation 
of two types of errors during second- 
task performance: the error of giving 
the formerly correct response (S+ 
error) and the error of selecting the 
never-reinforced alternative. In the 
previously mentioned experiments 
(Paul, 1966, 1968) the PCT group 
tended to make somewhat fewer S+ 
errors (the only possible type of error 
in the forced-choice, two-alternative 
task) than did the criterion group; 
thus it appeared that the facilitating ef- 
fect of overtraining upon reversal per- 
formance stemmed from a reduction of 
competition from Task 1 responses, 
but the evidence suggesting this inter- 
pretation was less than convincing. 

The present experiment was de- 
signed to obtain evidence bearing upon 
the operation of a generalized response 
set in discrimination-shift performance, 
and to explore more fully the mecha- 
nisms by which PCT facilitates shift 
performance. In addition, this experi- 
ment extends the empirical findings 
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previously reported to the three-alter- 
native task. 


METHOD 


Subjects —Eighty undergraduate students, 
naive to verbal learning experiments, served 
as Ss. 

Procedure.—Lists: Thirty-six three-letter 
words conforming to the CVC arrangement 
were selected from the 99-100% association 
values of Archer (1960). The words were 
divided into 12 sets of three words each 
(triads). Except for efforts to minimize 
formal similarity within triads (e.g, no mem- 
bers of a triad shared the same first or sec- 
ond letter) and for the restriction that each 
word be used only once, the words were 
assigned randomly to triads. Three orders 
of the triads were arranged so that each 
word in a triad occupied, from left to right, 
the first, second, and third positions equally 
often, and so that the serial positions of the 
triads were unsystematic. To somewhat 
increase the generality of the experiment, 
three lists, differing in the specific triads, 
were made up as described above using the 
original pool of 36 words. 

The triads were printed on individual 4 X 
6 in. index cards which were displayed by 
means of a manually operated cardholder. 
The cards were changed approximately every 
4 sec, and there was a 4-sec. intertrial in- 
terval. 

The Ss were assigned to the 8 conditions 
(2 degrees of learning X 4 percentages of 
items shifted) according to a randomized- 
blocks procedure. All Ss were given in- 
structions appropriate for the verbal dis- 
crimination task: they were informed that 
the “correct” word was arbitrarily selected, 
and that if they guessed correctly, Æ would 
say “correct,” but if they guessed incorrectly, 
E would say nothing. The Ss were assigned 
so that each cell contained n Ss before any 
cell contained n-+1 Ss. The Ss in the cri- 
terion group learned to a criterion of one 
errorless trial and then received one of the 
four _percentage-of-reversal treatments. 
Overlearning group Ss achieved the one 
errorless trial criterion in original learning 
and then received 50% additional trials. If 
the criterion trial was an odd number (eg, 
9), then overtraining was taken as 50% of 
the next higher even number (e.g. 10). 

The experimental transition from original 
learning to shift learning was continuous 
(eg., no lists had to be changed) and simply 
started with the appearance of the first item 


on the next order of the list. No indica- 
tion of the onset of shift condition was 
given. Correct items in the shift phase were 
determined as follows: For the 25% shift 
condition 3 randomly selected triads of the 
12 triads received shifted reinforcement con- 
tingencies (i.e, a formerly incorrect mem- 
ber of a triad, when given as a response, 
was followed by nonreinforcement—in the 
shift phase, it was reinforced). In the 50% 
shift condition 3 items, in addition to the 3 
shifted in the 25% condition, were shifted, 
In the 75% condition 3 more randomly se- 
lected items from the remaining 6 were 
shifted, and in the 100% shift condition all 
the items were shifted. 


RESULTS 


A simple randomized-design analysis 
of variance indicated that the eight sub- 
groups did not differ with respect to 
trials to criterion on the original task, 
F (7, 72)=1. The eight means 
ranged from 11.2 to 15.8. Analysis of 
errors and analyses using degree of 
learning (DL) and PIS as “dummy 
variables” indicated no. difference 
among the groups. 

The overlearning group reached shift 
criterion in fewer trials than did the 
criterion group, F (1, 72) = 4.68, p< 
.05; the main effect of PIS was signifi- 
cant, F (3, 72) =8.19, p<.01. As 
can be seen in Fig. 1, the number of 
trials to reversal criterion increased 
from the 25% condition to the 75% 
condition and then decreased. Despite 
the appearance of the curves, the DL x 
PIS interaction was not significant, F 
(3, 72) <1. Analysis of errors to 
shift criterion yielded the same out- 
comes as the trials measure except that 
the main effect of DL fell just short of 
the .05 level of significance. 

The errors to shift criterion can be 
broken down into errors on shifted 
items and on nonshifted items. These 
errors were transformed to an error/ 
opportunity percentage (eg., Errors 
on Shifted Items/Numbers of Items 
Shifted x Trials to Shift Criterion). 
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Fic. 1. Mean number of trials to shift 
criterion as a function of degree of original 
learning and percentage of items shifted. 


Fig. 2 indicates that as the PIS in- 
creases there is a decrease in the per- 
centage of errors on the shifted items. 
Only the main effect of PIS was sig- 
nificant in the analysis, F (3, 72) = 
13.2, p < .01. Figure 2 also indicates 
that as PIS increases so does the per- 
centage of errors on nonshifted items. 
The main effect of PIS was significant, 
F (2, 54) = 39.6, p < .01, and so was 
the DL x PIS interaction, F (2, 
54) = 3.20, p < .05. It is apparent 
that the overtrained group had a 
greater percentage of errors than the 
criterion group under the 75% condi- 
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Fic. 2. Mean error percentages for shifted 
and nonshifted items during the shift phase. 
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tion, but that these groups did not 
differ under the other two conditions. 
The larger percentage of errors on 
nonshifted items in the overtrained 
group, compared to the criterion group, 
is reasonable in view of the following 
two assumptions: First, the 75% con- 
dition activates a response set to sup- 
press all previously correct responses, 
thereby increasing the errors on non- 
shifted items, Second, the suppres- 
sion of previously correct responses 
requires that the discrimination of these 
responses from the other two response 
alternatives be maintained or retained 
during the shift phase. It seems 
reasonable that the overtrained group 
could better remember and therefore 
suppress the old correct responses; 
thus, the errors would be increased on 
those items for which the previously 
correct response remained correct. A 
similar or comparable difference be- 
tween the overtrained and criterion 
groups is not expected under the 25% 
and 50% conditions because no homo- 
geneous response set to suppress 
originally correct responses is activated 
in these conditions; this point will be 
pursued later. The overtrained cri- 
terion group difference with respect to 
errors on nonshifted items is not ex- 
pected to be duplicated on shifted items 
because effective suppression of pre- 
viously correct responses only partly 
facilitates performance on these items. 
The correct response still must be 
selected from the two remaining alter- 
natives. 

If PCT indeed served to maintain 
the discriminability of previously cor- 
rect responses and thus increases effec- 
tive suppression of them during the 
shift phase, then the overtrained group 
should give fewer Task 1 correct re- 
sponses as errors (S+ errors) on the 
shifted items than the criterion group. 
Differences in S+ errors should occur 
after the onset of shift conditions is de- 
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tected, after one or two trials. For 
each S a percentage of S+ error score 
(S+ errors/total errors on shifted 
items) was computed for Shift Trials 
1 and 2 (Block 1) and Trials 3 and 
4 (Block 2). Figure 3, which gives the 
means of percentage of S+ errors, sug- 
gests that the drop in S+ errors from 
Block 1 to Block 2 is sharper for the 
overtrained groups than for the cri- 
terion groups. This DL Xx Blocks in- 
teraction was significant, F (1, 72) = 
35.46, p < .01. The F ratios for the 
main effects of DL, PIS, and blocks 
were 6.17, 12.56, and 198.46, respec- 
tively, and each was significant at the 
.05 level or less. The PIS X Blocks 
interaction and the triple interaction, 
with respective F ratios of 2.69 and 
2.64 (3, 72 df for both), were the only 
other interactions to approach signifi- 
cance (.10 > p > .05). A final point 
of interest is the effect of PCT upon 
perseveration of originally correct re- 
sponses during the initial stages of the 
shift task. Only the 100% condition 
is presented because it seems the least 
likely to be influenced by the other com- 
plicating effects present in the non- 
homogeneous conditions, Since analy- 
ses of the number of S+ errors on the 
first shift trial and the number of suc- 
cessive S+ errors yielded the same 
conclusion, only the latter is presented. 


BLOCKS 


TRIAL 


Fic. 3. Mean percentages of S+ errors on 
the first two trial blocks of shift performance. 


The criterion group made fewer (X = 
5.3) S+ errors than did the over- 
trained group (£ = 9.2) before select- 
ing an alternative other than S+; 
standard deviations were 3.91 and 
4.89, respectively. The difference was 
significant #¢(18) =2.53, p< .05. 
Thus it appears that it took slightly 
longer for members of the overtrained 
group to start to shift their responses, 
but once started the shift was com- 
pleted more effectively than in the 
criterion group. This observation is, 
at least superficially, consistent with the 
effects of PCT upon habit reversal in 
some studies using rats (Paul, 1965). 


Discussion 


The results of this experiment are gen- 
erally consistent with the previous experi- 
ments on verbal discrimination shifts 
(Paul, 1966, 1968). PCT facilitated shift 
performance (relatively rather than as as- 
sessed against a nonspecific transfer base 
line), and PIS was nonmonotonically 
related to trials (and errors) to shift 
criterion in the current three-alternative 
task as well as in the previous two- 
alternative ones. An additional con- 
sistency was the observation that per- 
centages of errors on shifted items de- 
creased with increases in PIS, and per- 
centages of errors on nonshifted items 
increased with increases in PIS. 

There are observations that suggest the 
operation, in the 75% and 100% condi- 
tions, of a generalized tendency to sup- 
press first-task responses. The distribu- 
tion of errors, in the 75% condition, over 
the shifted and nonshifted items suggests 
that this condition activates a tendency to 
suppress all previously reinforced first- 
task responses, a tendency which would 
account for the relatively small percentage 
of errors on the shifted items and the 
concomitant large percentage of errors on 
the nonshifted items. In the 100% PCT 
condition the median number of S+ errors 
on the second shift trial was zero. It is 
worth noting that in a two-alternative 
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task zero S+ errors means an errorless 
trial criterion has been achieved. Thus 
the virtual absence of S+ errors on the 
second trial is consistent with Paul's 
(1966, 1968) finding that reversal cri- 
terion was reached in two (median) trials 
in the 100% PCT condition and also sug- 
gests the operation of a generalized set 
to suppress first-task responses. 

The effective suppression of previously 
reinforced responses requires that the dis- 
crimination of these responses from the 
other two alternatives (within the triad) 
be maintained during shift performance. 
If it seems reasonable to think that the 
maintenance of this discrimination in- 
creases with degree of first-task learning, 
one can note that PCT did facilitate shift 
performance and, as suggested by Fig. 3, 
did so by sharply reducing S+ errors. 
One might consider that the steep drop 
in S+ errors over the first two blocks of 
shift trials suggests a faster extinction 
rate for the overtrained groups. How- 
ever, if extinction implies a “weakening” 
of responses, then such an interpretation 
does not seem warranted for several rea- 
sons. First, in the 75% condition the 
drop in S+ errors on shifted items is ac- 
companied by a drop in S+ responses on 
nonshifted items, where S+ responses are 
still being reinforced. Second, in the 
50% condition PCT did not appear to 
affect the rate with which S+ errors de- 
clined. Thus, the data seem more con- 
sistent with the notion of a transfer- 
activated response set to suppress first- 
list responses than with an analysis that 
treats each triad separately. 

An experiment by Raskin, Boice, 
Rubel, and Clark (1968) found a rela- 
tively large number of errors on non- 
shifted items in a condition somewhat 
similar to the present 75% condition, 
They interpreted their finding (and a 
similar one by Paul, 1966) in terms of the 
“frequency theory” of discrimination 
(Ekstrand, Wallace, & Underwood, 1966), 
stating: 


It appears that as the percentage of pairs 
requiring the less frequent item as a re- 


sponse [reversed or shifted pairs] increased, 
there was a stronger tendency for Ss to re- 
spond with the less frequent item for all 
pairs. This tendency probably produced 
a greater number of errors on the nonre- 
versed pairs, since the correct response 
to those pairs was the more frequent item 
[p. 528]. 


Since the present results may be subject 
to the same interpretation, it seems worth- 
while to state some difficulties with this 
interpretation. First, the frequency the- 
ory, as the writers understand it, is item 
specific and attempts to account for the 
differentiation of the S+ from the S— 
for each pair of discriminanda; no mech- 
anism is stated to account for interpair or 
generalized intralist effects? Further, 
from the “frequency theory” analysis of 
the 75% condition by Raskin et al. 
(1968) one might expect that in the 25% 
conditon a large number of errors would 
occur on the shifted items since Ss should 
tend to give the more frequent response 
to all items. The evidence (see Fig. 2), 
while not inconsistent with this expecta- 
tion, does not seem to strongly support 
it. The 25% and 75% conditions are 
complimentary in that both require S to 
discriminate three triads from nine triads 
—either three shifted from nine non- 
shifted or the converse. Despite the for- 
mal identity of this requirement of the 
task, performance under the two condi- 
tions was asymmetrical—with the 75% 
condition being the more difficult. Tenta- 
tively, the writers are inclined to think 
that the 75% condition activated a re- 
sponse set to suppress first-task responses, 
a set which increased the difficulty of dis- 
criminating the appropriate occasions for 
giving first-task responses. On the other 
hand, the number and distribution of 
shifted items in the 25% condition were 
insufficient to activate any generalized 
response set. Thus, in this condition the 
shifted items were learned individually 


2 Ekstrand et al. (1966) do mention “fre- 
quency rules” in a figurative sense; the bases 
for the “formation and operation” of possible 
rules are not discussed. 
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and with minimal interference from a 
transfer-activated response set to give 
first-task responses. It is likely that the 
frequency theory can be modified to over- 
come the above-mentioned difficulties, 
and, thus, frequency could be viewed as a 
specific basis for a transfer-activated re- 
sponse set. 


8 This interpretation is reminiscent of the 
italicized half of Kulpe’s statement: “The 
force with which a determining tendency acts 
is not only greater than the familiar repro- 
ductive tendencies, it also derives from a dif- 
ferent source and its effectiveness is not tied 
to associative relations [Mandler & Mandler, 
1964, p. 216].” 
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ROLE OF AFFECT IN SHORT-TERM MEMORY 
FOR PAIRED ASSOCIATES * 


CALVIN F. NODINE anp JAMES H. KORN 
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Short-term memory (STM) for “minimal” paired associates (PA) 
was investigated by presenting Ss 2 successive picture-trigram (Exp. 
I) or picture-letter (Exp. II) PA units. Affective content of 1 
stimulus term was pleasant (P) and the other was unpleasant (U). 
After either a 3-, 9-, or 15-sec. retention interval, during which Ss 
engaged in number tracking, 1 of the 2 PA units was tested for recall. 
Recall of U units was inferior to P units for both response classes. 
However, with letters, recall of P was higher than U at immediate 
recall (3 sec.), but affect did not differentially influence recall at 9 
and 15 sec. With trigrams, differential recall extended from 3 to 15 
sec, with U inferior to P at all retention intervals, These findings 
suggest that the differential influence of affect on STM varies directly 
with task difficulty which can be specified in the PA task by response 


availability. 


Recent evidence (e.g., Kleinsmith & 
Kaplan, 1963, 1964; Walker & Tarte, 
1963) points to a rather complex rela- 
tionship between affect and long-term 
memory (LTM). In these studies, 
paired-associate (PA) learning was in- 
vestigated under varying degrees of 
arousal (measured in terms of skin 
resistance) with recall being measured 
after intervals from 2 min. to 1 wk. 
Learning under conditions of low 
arousal resulted in a typical forgetting 
curve in which immediate recall was 
high and decreased with time. In con- 
trast, learning under high arousal ex- 
hibited low immediate recall, which 
remained relatively unchanged over 
time. These results were interpreted 


1This investigation was supported by 
Grants MH 11974-02 and MH 07722 from the 
National Institute of Mental Health, United 
States Public Health Service. The authors 
would like to express their appreciation to 
Judith V. Korn for her assistance in col- 
lecting and scoring the data. The authors 
are also indebted to Barbara F. Nodine and 
Herbert Clark for their critical readings of 
the manuscript. This paper was presented at 
the meeting of the Psychonomic Society, 
Chicago, October 1967. 


as supporting a theory of perseverative 
consolidation of neural traces. Accord- 
ing to this theory (Walker, 1958), 
learning under high arousal results in 
a more intensely active neural trace 
which, during consolidation, is tem- 
porarily less available for recall than a 
trace formed during learning under 
low arousal. However, because of 
greater consolidation, the high-arousal 
trace results in greater permanent mem- 
ory than the low-arousal trace. 
Although all of the Michigan studies 
measured recall after 2 min., they pro- 
vide little information on short-term 
memory (STM) for individual items. 
This arises from the fact that learning 
and testing of the entire PA list pre- 
ceded the 2-min. retention test. The 
present study examines the role of 
affect in STM for minimal numbers of 
PA items using the Peterson and 
Peterson (1962) technique. This tech- 
nique has the advantage of being more 
comparable to the traditional STM 
situation than the list-learning task 
traditionally used in tests of LTM. Tf 
inhibition resulting from perseverative 
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consolidation is responsible for differ- 
ences in recall between high- and low- 
arousal learning at 2 min., one would 
expect shorter retention intervals to 
magnify the differences between high- 
and low-affect units. Differences in 
item difficulty were also incorporated 
into the present experiment since 
Walker’s (1958) theory predicts that 
the duration of the “action decrement” 
varies with the relative ease or diffi- 
culty of the problems. Item difficulty 
was manipulated by pairing stimulus 
terms of high or low affective content 
with either trigrams or single letters. 
Thus, affect was varied on the stimulus 
side, and difficulty (manipulated in 
terms of response availability) was 
varied on the response side, Inclusion 
of two response classes, trigrams and 
letters, also permits assessment of the 
relative contributions from response- 
learning and associative-learning com- 
ponents of the PA task on STM. 


METHOD 


= Subjects—The Ss were 84 students from 
introductory psychology classes at Carnegie- 
Mellon University fulfilling a course re- 
quirement. Forty Ss (20 males and 20 
females) took part in Exp. I in which tri- 
grams were used as response terms. The 
remaining 44 Ss (22 males and 22 females) 
took part in Exp. II in which single letters 
were used as response terms. 

Apparatus —Stimulus events were con- 
trolled by recording high-frequency beeps, 
timed by an electric metronome, on magnetic 
tape. During the experiment, the beeps on 
the tape activated projectors via a 
Carousel programmer (Model 1) to produce 
controlled exposure durations and slide 
changes. The recall intervals were filled 
with two-digit numbers recorded on the 
same tape at the rate of one per second. An 
announcement of each of the 12 trials and a 
“ready” signal were also recorded on the 
tape so that E simply turned on the tape 
recorder to run the entire experiment. Re- 
sponse latencies were obtained on test trials 
by having E start an electric clock at the 
onset of the test slide and stop the clock 
when S responded. 

Materials—Each PA unit consisted of a 


picture (face) stimulus term and either a 
CCC trigram or single-letter response term. 
Two classes of pictures were used: (a) pic- 
tures of psychologists taken from issues of 
the American Psychologist (eg. E. G. 
Boring, E. C. Tolman, H. F. Harlow, etc.), 
and (b) pictures of individuals who had 
suffered facial lacerations (eg., gunshot 
wounds, knife wounds, burns, etc.) taken 
from a plastic surgery textbook (Kazanjian 
& Converse, 1959).2 The affective content 
of these stimuli was independently measured 
by a group of 17 students who rated the 
24 stimuli on a 9-point scale (1 = extremely 
unpleasant, 9 = extremely pleasant) from ex- 
tremely unpleasant (U) to extremely pleasant 
(P). The faces were presented to Ss in the 
form of slides projected on a screen in front 
of the group. The mean ratings for pictures 
of psychologists’ faces ranged from 6.4 to 
8.0 with an SD of 1.54. The mean ratings 
for pictures of distorted faces ranged from 
1.6 to 3.3 with an SD of .95. There was 
almost no overlap among ratings from P and 
U stimuli as indicated by a t of 35.71 (p< 
001). As a measure of rater reliability, an 
intraclass correlation was computed resulting 
in r=.85. 

In addition, these 17 S's were given a ques- 
tionnaire asking them to give their reactions 
to the slides. Part of this questionnaire con- 
sisted of a list of adjectives. The Ss were 
asked to circle the adjective which most 
accurately described the way they felt about 
the slides which they rated as most U, The 
following adjectives were most frequently 
circled: revolted—4, afraid—3, disgusted—3, 
sad—3, interested—2, neryous—2, Several 
Ss mentioned that, although they were dis- 
turbed by the U slides, they felt less strongly 
about those which they had rated as Ey 

Response terms “were either trigrams or 
single letters. The trigram items were taken 
from Witmer’s list (Underwood & Schulz, 
1960) and had association values ranging 
from 29% to 50%. All letters of the alpha- 
bet were used as response terms in Exp. II. 
Table 1 shows each of the 24 PA units 
arranged in pairs for all 12 experimental con- 
ditions for Exp. I and IL. A trial consisted 
of the presentation of a pair of PA units 
followed by the testing of one PA unit of 
the pair. In both experiments, there was no 
overlap of letters between the two response 
terms of each pair. Furthermore, differences 
between individual response terms of pairs of 
PA units on each trial in terms of associa- 


2 Complete references to the picture stimuli 
are available upon request from the authors. 
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TABLE 1 


RESPONSE TERMS OF PAIRS OF PA Units For STUDY-TEST ORDERS AT EACH RECALL INTERVAL 
1N Exp. I ann II 


Recall Interval 
Study Order 
on a 3 Sec. 9 Sec. 15 Sec. 
Exp. I 
U-Ps 
U wig(29) pez(33) mgx(33) gwe(33) tdg(45) anp(42) 
P jal(33) ksn(33) bgx(45) rjp(45) sxd(50) cif (45) 
P-U 
U Jmw(33) gks(38) qds(38) xfr(38) rat(42) gbh(38) 
P mhk(42) dpb(45) ngj(38) lwe(38) fsz(33) hjm(33) 
Exp. II 
U-P 
U (11.7) p(11.4) o(11.2) a(10.8) u(9.7) x(9.5) 
by 7(10.9) n(10.8) s(12.4) b(13.1) w(12.2) h(12.2) 
P-U 
U m(12.4) 412.2) g(11.0) q(9.6) (11.7) f(11.8) 
P e(10.3) (9.7) d(12.0) r(12.0) (9.5) 4(10.7) 


Note.—Half of the Ss were tested with underlined items, half with nonunderlined items. r ain 
a U refers to PA units containing high-affect (unpleasant) stimulus terms. P refers to PA units containing 


low-affect (pleasant) stimulus terms. 


Witmer association values (Underwood & Schulz, 1960). 


e Anderson (1965) association values. 


tion value were minimal. Within each pair 
of PA units, S-R pairing was counter- 
balanced. 

Procedure—The Ss in both experiments 
were given instructions to the effect that this 
was an experiment to test their memory for 
faces. A tape recorder announced each trial 
which consisted of the presentation of two 
monochromatic pictures of faces, one U and 
one P, each paired with a different trigram 
(Exp. I) or letter (Exp. II), as indicated 
in Table 1. Each PA unit was projected on 
a screen in front of S. One of two study 
sequences—pleasant first, unpleasant second 
(UP) or unpleasant first, pleasant second 
(PU)—was presented, and each PA unit of 
the pair remained on the screen for 2 sec. 
Immediately after offset of the second PA 
unit of the pair, S heard a random series of 
two-digit numbers presented by the tape 
recorder at the rate of one number per sec- 
ond. The S was instructed to repeat aloud 
each number as it was heard. This task was 
designed to minimize rehearsal opportunities 
by Ss and is similar to the number-tracking 
task previously described by Nodine, Nodine, 
and Thomas (1967). After a variable re- 
tention interval of 3, 9, or 15 sec, the 
numbers stopped. One of the two stimulus 


pictures, U or P, was projected on the 
screen alone for 2 sec., and S' was given 15 
sec. to recall the appropriate response term 
and spell it aloud. The Æ recorded S's 
response and latency. The tape recorder 
then announced a new trial, and the sequence 
of events was repeated until S had received 
study-test trials for all 12 pairs of PA units. 
The 12 trials were repeated after a brief rest 
period (15 sec.). Thus, S was tested twice 
(two repetitions, same order of stimuli) in 
all four combinations of study-test sequences 
at each of the three recall intervals making 
a total of 24 observations per S. All OF 
were given a practice trial and questiones 
regarding the experimental procedure ae 
to initiation of the experiment proper. 

the end of each experiment Ss were aske 
for their impressions of the stimuli and their 
recall strategies. 

In summary, the basic procedure for ae 
trial consisted of four steps: (a) The Ss aie 
presented a stimulus term from one Via 
(eg. P face) with its response term ( i 
gram or single letter) for 2 sec.; (b) a 
S was then presented the stimulis ay 
from the other class (e.g, U face) wil Hi 
response term for 2 sec.; (c) the S repeal e 
two-digit numbers aloud. during a varia 
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retention interval of 3, 9, or 15 sec.; (d) 
finally, S was presented with one of the two 
stimulus terms as the signal to recall the 
appropriate response term. The “minimal” 
PA task was used in order to facilitate dis- 
crimination between P and U stimulus terms. 


Resutts 


The findings of Exp. I and II will 
be considered together. Data on both 
frequency of correct responses and 
speeds of correct responses during the 
PA tests are considered in this section. 
Analyses of variance were performed on 
the frequency measure and ¢ tests on 
the speed measure. In the analyses of 
variance, presentation order, test order, 
retention intervals, and repetitions were 
varied within Ss. Separate analyses 
were conducted on Exp. I (trigrams) 
and Exp, II (letters). The first re- 
sponse which occurred during the 15- 
sec, recall period was counted as cor- 
rect or incorrect. 

Effects of affect on recall—The Ss 
correctly recalled a greater proportion 
of P than U units in both experiments. 
However, the difference was significant 
only when trigrams were response 
terms (P= 42 vs. U=.29), F (1, 
38) = 15.26, p <.001, and not when 
letters were response terms (P = 85 
vs. U=.82). Failure to obtain sig- 
nificance with letters may be due to a 
ceiling effect resulting from the high 
Overall level of performance. 

Affect resulted in a significant inter- 
action with retention intervals in the 
case of letters, F (2, 84) = 11.08, p< 
001, but not trigrams. According to 
this interaction, recall of letters paired 
with U was inferior to those paired with 
P at 3 sec, (q = 6.42, p < .01; Tukey, 
honestly significant difference). How- 
ever, after 9 and 15 sec., differences in 
recall due to affect were small and 
nonsignificant. Figure 1 shows 
little change in recall of U items as 
Tetention intervals increased. Recall of 
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u 


PROPORTION. CORRECT 


RETENTION INTERVALS (SEC.) 


Fic. 1. Proportion of correct responses as 
a function of retention intervals for pleasant 
(P) and unpleasant (U) stimulus terms 
paired with letters (L) or trigrams (T). 


P items decreased with retention inter- 
vals, although most of the loss occurred 
between 3 and 9 sec. (g = 5.82, p< 
01). 

The Affect X Intervals interaction 
was absent in the case of trigrams. 
Recall of trigrams displays a typical 
STM forgetting curve, with recall of 
both P and U items decreasing as re- 
tention intervals increase. Speeds 
(1/latency) of correct responses are 
shown in Fig. 2. 

A pattern similar to that for correct 
responses, only more pronounced, was 
present for speeds of correct responses. 
With letters, correct responses were 
faster for P than U items at 3 sec, t 
(42) = 3.23, p<.01. At 9 sec., the 
pattern was reversed. Correct re- 
sponses were faster for U than P, t 
(42) = 3.06, p<.0l. Correct re- 
sponses to U remained slightly, but 
nonsignificantly, faster than correct 
responses to P at 15 sec. With tri- 
grams, speeds of correct responses were 
slower (f= 5.19, p < .001) and showed 
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little change as retention intervals in- 
creased. 

Overall, proportions of correct recalls 
decreased as retention intervals in- 
creased for both trigrams, F (2, 76) 
= 19.49, p < .001, and letters, F (2, 
84) = 9.52, p < .001. Figure 3 indi- 
cates that for both response classes 
most of the STM loss occurred between 
3 and 9 sec. (q = 9.32, p < .01, for tri- 
grams; q = 3.59, p < .05, for letters). 

The probabilities of recalling an item 
correctly are higher for letters than 
trigrams, since S must integrate the 
three elements of the trigram unit in 
order to recall it correctly. Associative 
learning is immediately possible with 
letters since only a single element is 
involved. The probability of a correct 
trigram response was adjusted by in- 
cluding partially correct responses (ie, 
one or two letters in correct positions), 
and the resulting curve is shown in F. ig. 
3 (broken curve). With the exception 
of the 9-sec. recall interval, the adjusted 
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RETENTION INTERVALS ( SEC.) 


Fic. 2. Speed of correct responses as a 
function of retention intervals for pleasant 
(P) and unpleasant (U) stimulus terms 
paired with letters (L) or trigrams (T). 


CALVIN F. NODINE AND JAMES H. KORN 


14 


PROPORTION CORRECT 


RETENTION INTERVALS (SEC.) 


Fic. 3. Proportion of letter (L) and tri- 
gram (T) responses recalled correctly as a 
function of retention intervals. (Broken line 
indicates adjusted proportions for trigram 
curve when partially correct responses were 
included.) 


trigram curve has essentially the same 
form as the letter curve. The differ- 
ence between the adjusted curve for 
trigram responses and the original plot 
suggests that response integration is a 
major factor in recall of the trigrams. 
Effect of repetitions—The second 
repetition of the 12 trials resulted in 
increases in proportion recalled cor- 


TABLE 2 
MEAN PROPORTION OF SINGLE LETTERS AND 
TRIGRAMS RECALLED CORRECTLY FOR 
Srupy-TEst ORDERS 


Study-Test 


i Letter 
Order Single 


Trigram 


Primacy 
PU-P 30 
UP-U 
Recency 
UP-P 
PU-U 


.88? 


41° .80 


aq =6.68,p < .01 (Tukey, honestly significant 


difference; 


bq = 5.48, p <.01. 
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rectly from .23 to .47 for trigrams, F 
(1, 38) = 111.92, p < .001, and from 
.78 to .89 for letters, F (1, 42) = 32.73, 
p < .001. 

Effects of study order —Overall, pro- 
portions of correct recalls were practi- 
cally identical for study orders in which 
P preceded U (PU) and U preceded 
P (UP). However, there were signifi- 
cant Study Order X Affect interactions 
for both trigrams, F (1, 38) = 11.15, 
p < 01, and letters, F (1, 42) = 7.51, 
p <.01. As is shown in Table 2, with 
trigrams recall was facilitated when the 
item to be tested occurred last in the 
study order, indicating a recency effect. 
‘With letters, recall was facilitated when 
the item to be tested occurred first in 
the study order, indicating a primacy 
effect. 


Discussion 


Affect differentially influenced recall 
patterns for the two response classes, tri- 
grams and letters. The role of affect on 
STM for PAs will be considered sep- 
arately for each response class within a 
theoretical framework which examines 
task difficulty in relation to underlying 
response integration and associative proc- 
esses. 

Affect and recall of  picture-letter 
pairs.—The differential influence of affect 
on the recall pattern of PA units in which 
letters were response terms is similar to 

. that found in the Michigan studies, except 
that the differential effects of arousal oc- 
curred over considerably longer retention 
intervals in the latter studies than in the 
present experiments. The fact that, with 
letters as responses, U was inferior to P 
at 3 sec. is consistent with Walker's 
(1958) theory which predicts lower im- 
mediate availability of high-affect units 
than low-affect units due to a greater 
temporary “negative bias against repeti- 
tion” for high-affect units during con- 
solidation, 

If high-affect units are less available 
than low-affect units after brief retention 
intervals, differences in immediate recall 


499 


of minimal PAs should be accompanied 
by associative interference in the form of 
intrapair competition. Low-affect units, 
because of their greater temporary avail- 
ability, should intrude on high-affect units 
to a greater extent than high-affect on 
low-affect units. Consequently, more in- 
trapair intrusions from P than U units 
should occur in immediate recall. As the 
greater temporary inhibition of U units 
dissipates with time, U units should be- 
come more available and therefore less 
subject to interference from P units at re- 
call, Thus, the probability of recall of P 
and U units should be equivalent. Figure 
4 shows the proportion of intrapair/total 
intrusions as retention intervals increase 
for P and U units with letters and tri- 
grams as response terms. As predicted 
for letters, at 3 sec. P intruded on U more 
frequently (.86) than U on P (.03), 
z= 4.69, p<.01 (sign test). At 9 and 
15 sec., proportions of intrapair intrusions 
from U were slightly, but nonsignificantly, 
greater than those from P. Lower avail- 
ability of U units at 3 sec. was also re- 
flected in speeds of correct responses 
which were significantly slower for U 
than for P units. 

Walker’s theory implies that the action 
decrement associated with the consolida- 
tion process may be active for seconds or 
minutes after learning, depending on the 
difficulty of the task, In the PA task in 
which letters were responses, no differ- 
ences between U and P units were found 
at 9 and 15 sec. Moreover, two measures 


LETTERS 


INTRAPAIR/TOTAL INTRUSIONS 


i 
RETENTION INTERVALS (SEC.) 


Fic. 4. Proportion of intrapair/total intru- 
sions as a function of retention intervals for 
pleasant (P) and unpleasant (U) stimulus 
terms paired with either trigrams or letters, 
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indicated availability of U units was 
higher than P units at 9 and 15 sec.: (a) 
speeds of correct responses which were 
faster for U than P at longer intervals, 
and (b) proportions of intrapair/total in- 
trusions which were slightly greater for 
U than P at longer intervals. These data 
suggest that with a relatively simple task, 
such as pairing a face with a letter, the 
consolidation process is complete within 
seconds rather than minutes after learning. 

Affect and recall of picture-trigram 
pairs —Although it has been shown with 
picture-letter units that poorer immediate 
recall of U units is accompanied by in- 
creased associative interference in the 
form of intrapair intrusions, this was not 
the case for picture-trigram units. Intra- 
pair intrusions accounted for 59% of all 
intrusions when the responses were let- 
ters, but only 38% of all intrusions when 
the responses were trigrams. These data 
indicate that very little associative learn- 
ing took place during the 2-sec. study 
period when the response term of the PA 
unit was a trigram rather than a letter. 
Most of the trigram errors (62%) re- 
flected incomplete response learning. Be- 
cause of their low response availability 
which deterred associative learning, tri- 
grams were less susceptible to associative 
interference as can be seen in Fig. 4. 
The plot of intrapair/total intrusions for 
trigrams is of the same form as that for 
letters over intervals, but the magnitude 
of the effects is attenuated. This relative 
lack of associative interference was also 
reflected in speeds of correct responses, 
which were slower than those for letter 
pairs and approximately equal for P and 
U units for all retention intervals, 

With trigrams as responses, recall of 
U units was inferior to P units at all 
three retention intervals. Walker’s the- 
ory predicts that the duration of the ac- 
tion decrement will vary with the relative 
difficulty of the task. Presumably, the 
relationship between task difficulty and 
duration of the action decrement is direct. 
The inferior recall of U over 3-15 sec. is 
therefore consistent with the theory. 
Moreover, this relationship obtained even 
when the trigram curves were adjusted 
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(as in Fig: 3) by the inclusion of par- 
tially correct responses. 

Affect and paired-associate learning— 
Underwood and Schulz (1960) have pro- 
posed response learning (integration) 
and association as two overlapping stages 
in PA learning. Underwood (1964) has 
recently pointed out that differences in 
recall as a function of the number of ele- 
ments making up a response item pri- 
marily reflect differences in degree of re- 
sponse learning. From this point of view, 
differences in performance between the 
two response classes in the present study 
basically reflect differences in degree of 
response learning, The trigram data on 
partially correct responses support the 
position that response learning was in- 
complete at the end of one or two repeti- 
tions, but that some of the elements of 
each trigram response had been connected 
to appropriate stimulus terms. 

Affect differentially influenced the re- 
call of single-letter response terms at 
short (3-sec.) retention intervals only. 
This interaction resembles the findings 
of several PA studies employing stimulus 
terms which consisted of high- vs. low- 
arousal words paired with single-digit 
response terms (Kleinsmith & Kaplan, 
1963, 1964; Walker & Tarte, 1963). The 
single-digit response terms in these stud- 
ies are similar to the single-letter re- 
sponse terms used in Exp. II. Both are 
relatively easy tasks that require a min- 
imum amount of response learning. When 
task difficulty is increased by requiring 
response learning in addition to associa- 
tive learning, as in Exp. I where trigrams 
were responses, affect influenced recall 
over longer (9-15 sec.) retention 1- 
tervals, These data suggest that the ex- 
tent of the influence of affect on ST. 
varies with task difficulty. When re- 
sponse availability is high, as in Exp. 
II (letters), the duration of the action 
decrement is short, and is accompanies 
by associative interference in the form 
of increased competition from responses 
associated with low-affect stimulus terms 
which impaired recall of responses a850- 
ciated with high-affect stimulus terms. 
When task difficulty is increased by de 
creasing response availability, as in Exp. 
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I (trigrams), the duration of the action 
decrement is increased. Thus, recall of 
U units is consistently inferior to P units 
from 3 to 15 sec. Recall of PA units 
containing trigram responses primarily 
involved response learning as evidenced 
by the intrusion data which reflected in- 
terference with response learning rather 
than associative interference, as in the 
case of letters. 
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AND 


The amount of rehearsal in 10 number sequences was manipulated by 
requiring 8 Ss to recall all previously presented numbers as each new 
number was introduced. The recall order was forward (F), back- 
ward (B), free recall (FR), or natural serial (NS), and the type 
of presentation involved either overt cumulative rehearsal or no overt 
rehearsal. Type of presentation, recall order, and number of trials 
all yielded significant effects, but no interactions were significant. 
Rehearsal generally enhanced performance, but only in B recall was 
the shape of the serial position curve changed. In the F condition, 
the later the introduction of an item, the faster the rate of loss of 
retention. Forgetting in the F and B conditions was more rapid than 
in the FR and NS conditions. 


The influence of rehearsal on re- 
tention has been a persistent problem 
in the study of human memory and 
learning. It is clear that uncontrolled 
rehearsal represents an undesirable in- 
fluence in attempts to isolate the major 
factors affecting immediate memory. 
But the extent of this potential influ- 
ence is not known. In particular, how 
does the influence of rehearsal interact 
with the order of recall? Does re- 
hearsal affect the “subjective organiza- 
tion” (Tulving, 1964) of memory-span 
sequences? That is, when items are 
presented repeatedly, how much of the 
gain in retention can be attributed to 
rehearsal per se and how much to the 
reorganization of the information to be 
retained? There has been little re- 
search directly concerned with investi- 
gating rehearsal since the usual pro- 
cedure in memory and learning ex- 


1 This investigation was supported by 
United States Public Health Service Re- 
search Grant MH-08556 and United States 
Public Health Service Research Career De- 
velopment Award K3-MH-23,796 to the first 
author from the National Institute of 
Mental Health. The authors are indebted 
to Jane Hoyt for assistance in data collection 
and analysis. 
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periments has been to attempt to mini- 
mize rehearsal as much as possible 
(e.g, Peterson & Peterson, 1959; 
Waugh & Norman, 1965). The study 
to be reported here represents a direct 
attack on the relationship between re- 
hearsal and retention. An attempt is 
made to manipulate rather than sup- 
press rehearsal and to observe the ef- 
fect of overt rehearsal on several kinds 
of recall order. 

In the method used here, which is 
referred to as the cumulative rehearsal 
(CR) method, S is required to recall 
all members of the stimulus sequence 
after each successive member is pre- 
sented. The required rehearsal is 
“overt,” ie, voiced, rather than 
silent (cf. Murray, 1965, 1966). | For 
example, if forward-order recall is re- 
quired, and the stimulus sequence 1S 
7-3-8-5-9-6, a CR protocol with per- 
fect performance would be: 


E’s Presentation S’s Response 


7 7 

3 7-3 

8 7-3-8 

5 7-3-8-5 

9 7-3-8-5-9 

6 7-3-8-5-9-6 
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Therefore, in a six-item sequence, the 
first item is rehearsed five times, the 
second, four, the third, three, etc. 
The amount of rehearsal is, of course, 
perfectly correlated with the presenta- 
tion position in the sequence. This 
confounding may be evaluated by com- 
parison with a nonrehearsal (NR) 
control condition which corresponds to 
the usual memory-span method and 
does not require any overt rehearsal 
during presentation. 

With this procedure, it is possible 
to trace the course of retention and 
forgetting of items as the memory load 
is increased. The major methodologi- 
cal difficulty in this paradigm is the 
timing problem. Because old items are 
rehearsed before new items are pre- 
sented, and Ss rehearse at different 
rates, the rate of presentation is not 
standardized. This problem may be 
reduced to some extent by using a 
within-Ss design. By varying the re- 
quired order of recall, the interaction 
of rehearsal with recall organization 
also may be studied. 


METHOD 


Materials.—Each recall sequence consists 
of 10 numbers from the set of numbers 1-20 
inclusive. Eight blocks of 20 sequences were 
derived from four 20X20 Latin squares 
divided into right and left halves. Therefore, 
each block of 20 sequences contained each 
one of the stimulus numbers in each pre- 
sentation position exactly once. 

Subjects—The Ss were eight undergradu- 
ate students at Stanford University who were 
paid for their participation. 

Procedure —Each § participated in four 
experimental sessions, each session separated 
by at least 24 hr. During each of the four 
sessions, Ss were presented with two blocks 
of 20 sequences each and instructed to recall 
the presented sequences in one of four recall 
orders: forward (F), backward (B), free 
recall (FR), or natural serial (NS). In 
the case of F or B recall, Ss were instructed 
to repeat the numbers in the sequence in the 
same (F) or reverse (B) order as the 
Presentation order. In FR, Ss were allowed 
to recall the presented numbers in any order 


they wished. In the NS condition, Ss were 
required to recall the presented numbers in 
their natural ascending serial order, i.e., 
from the lowest to the highest number, 

During each session only one recall order 
was used for both rehearsal conditions, CR 
and NR, with one block of 20 sequences to 
each rehearsal condition, In the CR condi- 
tion, Ss were instructed to recall all preced- 
ing numbers in the presentation sequence as 
the numbers were spoken by E before the 
next number was presented. That is, E read 
the first number, S repeated it; E read the 
second number, S repeated the first two num- 
bers; E gave the third, and S repeated the 
first three; etc., through all 10 num- 
bers in the sequence. In the NR condition, E 
presented all 10 numbers before S recalled 
all 10. Therefore, in the NR condition, each 
number in the presentation sequence was 
presented once and in fairly rapid succession, 
whereas in the CR condition a number might 
be rehearsed many times before its final 
recall. The presentation rate for the NR 
condition was approximately 1/sec, The 
presentation rate for the CR condition de- 
pended partly on the speed with which S 
responded after each successive presentation. 
The S was instructed to respond quickly, 
and E gave the next number as soon as $ 
finished his response to the previous presen- 
tation. 

In the course of four sessions, each S 
participated in four recall conditions, one 
session each of F, B, FR, and NS, with two 
rehearsal conditions, CR and NR, in each 
session, for a total of eight experimental 
conditions. One block of sequences was as- 
signed to each condition for each S and coun- 
terbalanced across Ss, so that each block was 
used in each condition exactly once. Order 
of presentation of CR and NR and of the 
four recall conditions was also counterbal- 
anced across Ss. 


RESULTS 


To equalize the scoring in the four 
recall conditions, recall order was dis- 
regarded in counting an item correct. 
That is, correct recall depends only on 
the item occurring in the response pro- 
tocol, regardless of position. This pro- 
cedure makes scoring F, B, FR, and 
NS equivalent since correct perform- 
ance does not depend on an item being 
recalled in its correct position. How- 
ever, this scoring procedure also means 
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that this experiment measures only re- 
tention of information about the oc- 
currence of items from a known set 
and not information about order of 
presentation, even though Ss were in- 
structed to attempt to recall items in 
specific orders. 

The number of correct responses in 
the NR presentation condition was 
compared with the number correct on 
the tenth recall trial of the CR con- 
dition in a four-way analysis of vari- 
ance. The three major variables (type 
of presentation, recall order, and trials) 
all had statistically significant effects, 
but none of the interactions approached 
significance. The significant trials ef- 
fect, F (19, 133) = 2.67, p < .01, sug- 
gests a practice effect, but no significant 
trend was apparent. The effect of re- 
call order was very highly significant, 
F (3, 21) = 21.82, p < .001, and with 
both types of presentation the effect of 
recall order was the same: NS = FR 
>F=B. The type of presentation 
also was significant, with CR perform- 
ance superior to NR, F (1, 7) = 6.73, 
b < .05, but the effect was not as strong 
as might have been expected. The 
failure to find a stronger difference due 
to amount of rehearsal might reflect 
a ceiling effect, since performance in 
all conditions was quite high, particu- 
larly in the FR and NS recall orders. 

Serial position curves—Shown in 
Fig. 1 are the serial position curves for 
the four recall conditions for the NR 
presentation condition and for the tenth 
recall attempt of the CR condition. 
The most important findings shown by 
Fig. 1 are the changes, or lack of 
changes, in the serial presentation re- 
call curves as a function of rehearsal. 
In three of the four conditions, the 
functions for the CR presentations es- 
sentially duplicate, at slightly higher 
performance levels, the functions for 
the NR presentations. The most strik- 
ing change occurs in the B recall con- 


Fic. 1, 
recall orders for the NR and the last recall 
attempt of the overt CR conditions. 


Serial position curves for four 


dition where a strong recency effect 
and no primacy effect is evident with 
single presentations of the recall items. 
When Ss are required to repeat all 
previous items after each item is pre- 
sented, a primacy effect is seen. This 
primacy effect develops largely at the 
expense of the recency effect: the re- 
call of the initially presented items im- 
proves, while the retention of the later 
items, except the last, declines. The 
typical bowed serial position curve is 
evident in the F recall condition with 
both CR and NR presentation, and a 
consistent énhancement in performance 
occurs in the CR condition. In both 
FR and NS conditions, however, the 
serial position curves are best described 
as flat, especially with NR presentation. 
Some slight tendency toward a bowed 
serial position curve is found with re- 
peated presentations. The serial posi- 
tion curves for FR and NS in Fig. 1 
are very similar because, as is evident 
by inspection of the individual proto- 
cols, most of the FR sequences were 
recalled in NS order. This was true 
for Ss who received FR before NS, as 
well as for those who had NS first. 
Evidently the massed practice afforded 
by the block presentation of each ex- 
perimental condition led to rapid adop- 
tion of the NS recall strategy, eve? 
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when it was not required. Examina- 
tion of the FR protocols for the four 
Ss who received FR before NS reveals 
several different organizations of the 
response sequences (most commonly, 
an F order) before an NS-ordered pro- 
tocol appears. Thereafter, NS recall 
was used almost exclusively. 

Forgetting of individual items— 
With the rehearsal procedure, it is 
possible to examine the course of 
change in recall in finer detail than 
with a single recall procedure. Be- 
cause S is required to attempt to recall 
all previously presented items on each 
presentation, recall of each item is 
tested at every step. Depending on 
the recall condition, the most recently 
presented item either may be recalled 
immediately (the B condition) or may 
be forced to wait until all other pre- 
sented items have been recalled (the F 
condition). Under all four recall con- 
ditions, items show consistent gradual 
decline in recall performance from the 
initial attempt to all later attempts 
(Fig. 2). The most striking feature 
of the change in retention is the almost 
uniform decline as a function of pre- 
sentation order: the later the presenta- 
tion, the lower the probability of cor- 
rect recall. 


RECALL ATTEMPT 


Fic. 2. Proportion correct on successive 
recall attempts for four recall orders. (The 
Parameter is the stimulus-presentation posi- 
tion.) 


An important result shown in Fig. 2 
is the fact that for the F and B recall 
orders the later the item’s presentation 
position, the faster the rate of loss of 
retention. The slopes of the recall 
functions exhibit a regular increase as 
the position in the presentation se- 
quence increases. Comparison of the 
F and B results are of particular in- 
terest. The B recall condition requires 
immediate recall of the most recently 
presented item resulting in perfect per- 
formance on the first recall attempt. 
On the other hand, recall in the F con- 
dition is delayed until prior presenta- 
tions are recalled. As Fig. 2 shows, 
later recalls in the B condition are 
roughly equivalent or slightly inferior 
to the comparable performance in the 
F condition. Therefore, it seems clear 
that the immediate rehearsal of a pre- 
sented item provides no advantage for 
the later recall of that item. This re- 
sult is consistent with earlier findings 
that an immediate second presentation 
gives little increment in later recall 
performance over a single presentation 
(Greeno, 1964). 

Mean recall performance as a func- 
tion of the number of the recall at- 
tempt is shown in Fig. 3. The num- 
ber of the recall attempt is also 
equivalent to the maximum number of 
correct possible. In all four recall con- 
ditions performance is nearly perfect 
through the first five presentations. 
Thereafter the decline in performance 
is quite precipitous ; the proportion cor- 
rect as a function of recall attempt 
leads to a highly significant effect, F 
(9, 63) = 30.72, p < .001. Perform- 
ance as a function of recall order is, 
easily separated into two categories : 
the decline of the F and B performance 
is much more rapid than for the FR 
and NS conditions. In fact, the mean 
differences between the two categories 
show an almost perfectly linear in- 
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Fic. 3. Proportion correct for four recall 
orders as a function of the number of items 
to be recalled (ie, number of recall at- 
tempts). (Inset is the mean difference 
between Cond. NS and FR and Cond. F and 
B as a function of the recall attempt.) 


crease (see inset, Fig. 3), The dif- 
ference between the two categories 
appears to be related to whether in- 
formation about order of presentation 
must be preserved, In the F and B 
conditions, Ss are required to recall 
the numbers in order of presentation. 
In FR and NS, retention of order of 
presentation is not necessary, although 
NS recall does require Ss to produce 
the items in a particular order during 
the recall test. 

Furthermore, reference to Fig. 2 
shows that the difference between the 
F and B conditions on the one hand 
and the FR and NS conditions on the 
other is due to the lower forgetting rate 
for all items under FR and NS condi- 
tions. An alternative possibility would 
have been for superior retention of just 
part of the recall set. As Fig. 2 shows, 
however, the addition of each new item 
-to be recalled seems to demand a pro- 
portional decrease in the probability of 
recall of each previously presented item. 
Since the serial position curves shown 
in Fig. 1 are simply the envelope or 
lower boundary formed by the lower 
points in Fig. 2 (and reversed right 
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to left), the fine structure of recall 
shown in Fig. 2 may help to account 
for the overall serial position curves, 
In particular, the relatively flat serial 
position curves for NS and FR reflect 
both a lower rate in decrease of recall 
and a high probability of initial recall 
for all items. 

What happens to an item after it 
is forgotten (omitted from a response 
sequence)? A surprising result in this 
experiment is that after an item is left 
out of a response sequence, it reappears 
approximately 25% of the time in a 
later sequence. Summing over the last 
four recall attempts and all four recall 
orders, the likelihood that an item will 
be correctly recalled after it is for- 
gotten at least once is .246 (328 oc- 
currences of 1,332 possible). The pro- 
portion of reappearances is remarkably 
constant over the four recall orders and 
over successive recall attempts. For 
the F, B, FR, and NS recall orders 
the mean reappearance rate is .25, .23, 
.27, and .26, respectively. On recall 
attempts 8, 9, and 10, the observed 
proportions are .25, .23, and .26. 

Error analysis—The strong differ- 
ence between the F-B conditions and 
the FR-NS conditions is further em- 
phasized in Table 1, which contains an 
analysis of the various kinds of errors 
in the NR condition, in the CR con- 
dition, and on Trial 10 of the CR con- 
dition (CR-10). Examination of Table 
1 reveals that the difference between 
F or B and FR or NS is almost per- 
fectly accounted for by the reduction 
in the number of omissions for the FR 
and NS conditions. When recall suc- 
cess is calculated as the number of 
items remembered proportional to the 
maximum possible, the differences are 
found. When the proportion is base 
on the number of responses made, this 
difference disappears. 

The overall difference due to Te 
hearsal may be seen by comparing 
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TABLE 1 


Error ANALYSIS: TYPE OF RESPONSE AS A FUNCTION OF RECALL ORDER 
AND TYPE OF PRESENTATION 


Errors Proportion Correct 
Condition Total No. 
Responses | Correct Rie ted 
Repeated | Intrusions | Omissions | Possible | Rao oies 
Cumulative Rehearsal 
Forward (F) 8315 | 7998 29 288 802 
Backward (B) 8105 7932 16 157 868 901 ‘970 
Free Recall (FR) 8628 | 8368 17 243 432 951 ‘970 
Natural Serial (NS) | 8616 | 8403 0 213 397 1955 975 
Cumulative Rehearsal = Trial 10 
F 1386 | 1274 0 102 326 «196 919 
B 1305 | 1243 8 54 357 ‘777, | .953 
FR 1530 1414 8 108 186 1884 924 
NS 1532 1414 0 88 156 903 943 
Nonrehearsal 

F 1292 1126 3 163 474 «104 .872 
B 1314 | 1173 5 136 427 «133 892 
FR 1497 1338 5 154 262 836 1894 
NS 1577 1395 0 182 205 | 872 885 


errors in CR-10 and NR shown in 
Table 1. The decrease in omissions 
with rehearsal was greatest for F and 
least for NS. While for F the de- 
crease in omissions was greater than 
the decrease in intrusions, for NS the 
decrease in intrusions was greater. 


Discussion 


_ An interesting result of this experiment 
is the greater overall recall in Cond. FR 
or NS than in Cond. F or B (Fig. 3). 
As indicated above, FR and NS were 
similar in this study because Ss chose to 
use NS-order recall in the FR condition, 
even without previous exposure to the 
NS condition. Thus FR and NS were 
effectively the same kind of recall. The 
similiarity between overall recall in F 
and B conditions would appear to be re- 
lated to the need to preserve “order in- 
formation” as well as “item information.” 
An initial interpretation of this result 
might be that Cond. F and B are more 


difficult because they require ordering the 
items in a unique sequence. But the NS 
condition also requires a kind of ordered 
recall and might be expected to be more 
difficult since reordering might be re- 
quired as new items are presented, while 
new items could simply be added to the 
already existing sequence in Cond. F or 
B. For example, if the correct sequence 
is 7-15-2-9-14 and then 11 is presented, 
in the F (or B) condition, the new 
sequence can be formed by simply adding 
the new item to the appropriate end: 7- 
15-2-9-14-11. In the NS condition, how- 
ever, the last recall sequence (2-7-9- 
14-15) must be broken and a new one 
learned: 2-7-9-11-14-15. Thus, each new 
presentation may require major revision 
in organization rather than simple exten- 
sion of the sequence. By this interpreta- 
tion, which assumes that presentations are 
stored in order of arrival and then reor- 
ganized for NS-order recall, F and B 
recall should be greater than FR and NS 
recall. Since F and B recall was actually 
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less rather than greater, the assumptions 
of this interpretation do not appear to be 
tenable. 

Crossman (1960) has argued that “item 
information” and “order information” may 
be stored and retrieved separately, and 
that an increase in the demand for one 
type of information may be at the expense 
of a reduction in the other. Thus, in- 
creasing the order requirements in recall 
may reduce the total number of items 
which can be recalled, effectively keeping 
the total amount of information trans- 
mitted approximately constant. Although 
the output in the NS condition is ordered, 
NS-order recall does not demand order 
information in the usual sense since Ss 
may use order information about the NS 
order of No, 1-20 in long-term storage. 
Another interpretation of the finding that 
recall in Cond. NS and FR is greater than 
F or B recall is that Ss might scan the 
serially ordered set of items for serial- 
order recall and use available item in- 
formation to either emit or withhold each 
successive item in the sequence. There- 
fore, NS recall might be greater because 
use of order information already in long- 
term storage requires retention only of 
item information. 

The possibility of sequential scanning 
suggests at least two other possible ways 
in which the NS condition could produce 
more efficient recall. The first is that the 
scanning method in NS recall would re- 
duce at least one kind of error, that of 
repeated responses, The process of scan- 
ning the possible set would mean that 
each item would be considered only once, 
When one is trying to produce a series of 
items sequentially with no inherent order, 
however, there is no assurance that an 
item will not be produced more than once. 
This effect can be seen in Table 1 where 
“Repeated Corrects,” items which are 
correct responses, but which are emitted 
in the output sequence more than once, 
occur in every condition except the NS 
condition. The second possibility is that 
partial retention of information (Mur- 
dock, 1963; Slamecka, 1967) could more 
easily be used in the NS condition (and 
also in FR) by scanning the Tesponse set 
and outputting the item if a suitable 


amount of information indicates that it 
was likely to have been presented. In the 
Ordered Cond. F and B, however, a 
low response criterion would disrupt the 
sequential recall of items. Hence, it is 
possible that different response criteria 
may be used for NS than for F and B 
conditions. The fact that the difference in 
the number of omissions almost totally ac- 
counts for the difference in the number of 
correct strongly suggests that use of par- 
tial information in the NS and FR condi- 
tions may contribute to the supperior 
performance under those conditions. 

A reasonable view of these kinds of 
recall may be derived from the assump- 
tion that item information in short-term 
memory might be organized either in 
terms of characteristics of the presenta- 
tion or in terms of characteristics of the 
events. Thus F and B recall may exem- 
plify the former, and FR and NS recall 
the latter. Other studies have shown sev- 
eral differences between same-order re- 
call (F) and serial-order recall (NS). 
Same-order recall of item information de- 
pends upon presentation order (Eben- 
holtz, 1963; Young, Patterson, & Benson, 
1963), while serial-order recall does not 
(Buschke, 1966). As confirmed by the 
present study, proportion correct by serial 
position is different for same-order and 
serial-order recall (Buschke, 1966, 1968). 
Same-order recall may be differentially 
impaired in some brain-damaged patients 
when serial-order recall is relatively in 
tact (Buschke, 1965). 

Two recent studies have indicated that 
same-order recall and serial-order recall 
involve different kinds of short-term stor- 
age, in the sense that different kinds of 
item information about the same events 
may be stored. In the first (Buschke, 
1967), the same sequence was presented 
for two trials, requiring either same-order 
recall on both trials, serial-order recall on 
both trials, same-order followed by serial- 
order recall, or serial-order followed by 
same-order recall. Recall increased when 
either same-order or serial-order recall 
was used on both trials. However, recall 
did not increase when serial-order recall 
followed same-order recall and was less 
for serial-order recall when followed by 
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same-order recall than when followed by 
serial-order recall. In the second study 
(Buschke, 1968), same-order and serial- 
order recall were compared when kind of 
recall was specified either before or after 
presentation and storage. Serial-order 
recall was greater than same-order recall 
when kind of recall was known before 
storage. Both serial order and same-or- 
der recall decreased when kind of recall 
was not known until after storage, but 
serial-order recall decreased more, so that 
serial-order and same-order recall were 
then equal. The differential decrease in- 
dicates that the decrease in recall was due 
to differences in storage rather than to 
nonspecific interference by instructions 
intervening between presentation and re- 
call, Both studies indicate not only that 
different kinds of item information may 
be retained in short-term storage for these 
kinds of recall, but also that when either 
or both same-order and serial-order recall 
may be required Ss may preferentially 
encode for same-order recall, presumably 
because same-order recall requires order 
information as well as item information. 

Greater recall in the NS and FR than 
in the F and B conditions might therefore 
reasonably be attributed to the storage of 
different kinds of item information and 
the storage of additional order informa- 
tion in the F and B conditions. The kind 
of model which seems most appropriate 
for NS-order recall may be one involving 
short-term storage of references to the 
NS sequence of No. 1-20 in long-term 
Storage, in effect “marking” those items 
in the sequence which were presented for 
Subsequent scanning of the items in NS 
order. The evidence currently available 
certainly suggests that item information 
in short-term storage for NS-order recall 
involves reference to the well-learned se- 
quence of No. 1-20 in long-term storage 
and scanning that serial sequence for 
read-out of “marked” items, rather than 
Scanning item information directly re- 
lated to presented items. 
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LIST DIFFERENTIATION, RECALL, AND 
CATEGORY SIMILARITY? 


EUGENE WINOGRAD 2 


Emory University 


Following the presentation of 2 lists of words, Ss were asked to iden- 
tify the list membership of all words seen on both lists. List 1 and 
List 2 were related in that each consisted of 6 instances of each of 6 
categories, or 36 words, and 3 of the categories were common to both 
lists. List differentiation was significantly poorer by sign test for 
words from repeated (R) categories than from nonrepeated (NR) 
categories. In Exp. II, recall for both lists was tested following the 
same presentations of the lists as in Exp. I. While total words recalled 
did not differ for R and NR categories, category recall was signifi- 
cantly poorer for List 1 NR over R categories, and words recalled per 
category recalled was significantly poorer for List 1 R over NR 
categories. The results were discussed both in terms of a recency 
principle operating in conjunction with an overloaded memory and in 
terms of an unlearning mechanism. 


In seeking to elucidate possible cues 
for the identification of list membership, 
the writer has reported on the effects 
of presentation frequency, normative 
word frequency, and retention interval 
(Winograd, 1968, in press). If the 
list-differentiation (LD) paradigm 
followed in the above studies may 
be conceived as one basically con- 
cerned with the problem of memory for 
context, it may be argued for the sake 
of analysis that a given word on a list 
has both a general environmental con- 
text and an intralist context composed 
of the other words on that list. For in- 
stance, if all the words on List 1 appear 
on a red background and all List 2 
words appear on a blue background, 
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the environmental context  (back- 
ground hue) would be a reliable cue 
for a discrimination of list membership. 
Similarly, if all List 1 words are 
printed in Chinese characters and all 
List 2ə words are English words, no 
cue other than word class would be 
necessary. 

The research reported here is con- 
cerned with the effects of interlist simi- 
larity of category cues on both LD and 
recall. The discrimination of list mem- 
bership should be degraded by redun- 
dancy of the category cue, if the cate- 
gory name plays a functional role in 
free-recall learning. In brief, similar- 
ity should lead to greater confusion in 
this task as well as others. 

The category-clustering effect is well 
documented by now and was used as 
the basis for the present design. Con- 
sider two lists, each composed of words 
belonging to one of six conceptual cate- 
gories. The two lists are related in 
that half of each list is drawn from iden- 
tical categories; i.e. three of the 
six categories making up each list are 
the same, although the specific instances 
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of the identical categories are different. 
The other three categories on each list, 
however, are unique to a list. LD 
should be poorer for items belonging to 
categories which appear on both lists, 
or repeated (R) categories, than for 
items from nonrepeated (NR) cate- 
gories. 

In view of the poorly understood re- 
lationship between LD and availability, 
it is of interest to examine LD and re- 
call separately following identical ma- 
nipulations. The second experiment 
reports data on recall for the same lists 
used in the first experiment. 


METHOD 


Experiment I 


Lists—A mixed-list design was used. 
Each S was exposed to two lists of 36 words 
each. Each list consisted of six words from 
each of six categories in a mixed order. Any 
pair of lists had three categories in common 
(but no words in common), while each list 
had three categories unique to that list 
Words were chosen from the norms of Cohen, 
Bousfield, and Whitmarsh (1957) so that 
categories were balanced for mean frequency 
of occurrence of words as responses to the 
category name, Nine categories were used: 
human dwellings, cloths, furniture, trees, 
weapons, sports, natural earth formations, 
vegetables, and vehicles. Each category was 
an R category (common to both lists) for 
one-third of the Ss. Actually, three lists were 
composed so that the three possible pairings 
of the lists resulted in each category being 
common to the two lists one-third of the time. 
If the three lists are called a, b, and c, re- 
spectively, the three pairings used were ab, 
be, and ca, Within each list, presentation 
sequence was randomized with the restriction 
that each category be represented twice 
within each block of 12 words. 

The test lists contained the 72 words from 
both lists mixed together with the restriction 
that no more than three successive words 
come from the same category or the same list. 

Procedure—The words were presented on 
a Lafayette memory drum at a 2-sec. rate. 
List 1 was presented three times with a 10- 
Sec. interval between successive presentations. 
List 2 was presented three times similarly, 
after a 30-sec. interval between lists. Three 
different orders of each list were used. No 
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recall test was given at any time during Exp. 
I. Instructions stated that the experiment 
was concerned with memory and that S was 
to “try to learn a list of 36 words.” Before 
List 2, S was told that a second list was to 
be presented and that he was to learn the 
words on that list as well. 

The tests decks contained seventy-two 3 X 
5 in. cards, each showing one of the words 
typedin caps. The Ss were instructed to desig- 
nate, next to the appropriate number on the 
test sheet, on which of the two lists each 
word had appeared. 

The test sheet contained the confidence 
rating scale which Ss used in assigning words 
to lists. A rating of 1, 2, or 3 indicated that 
the word belonged to List 1, with 1 indi- 
cating certainty, while a rating of 4, 5, or 6 
indicated List 2 membership, with 6 indicating 
certainty. The test procedure is comparable 
to one described elsewhere (Winograd, 
1968). There was a 10-sec. time limit for 
each rating, but within that limit the pro- 
cedure was self-paced. 


Experiment II 

The only difference between the two studies 
was in the test. At the conclusion of presen- 
tation of List 2, Ss in Exp, II were allowed 
7 min. to write down as many of the 72 words 
from both lists as they could recall. Recall 
instructions stressed that the words should 
be written in the order that they occurred to 
S. At the end of the recall period, Ss were 
asked to identify the list membership of each 
word recalled by writing “1” or “2” next 
to each word. The confidence scale was not 
used in this test. Instructions for the list- 
identification test emphasized that identifica- 
tions be made in the order in which the 
words were recalled. 

Subjects —There were 18 Ss of both sexes 
in each experiment. They were paid 
volunteers taking courses during the sum- 
mer session at Columbia University. Ex- 
periment I was completed before Exp. II 


began. 
RESULTS 


Experiment I 

List differentiation Ratings of 1, 
2. and 3 were combined and scored as 
identifications of List 1; ratings of 4, 
5, and 6 were counted as List 2 identi- 
fications. The mean proportion of 
words from NR categories correctly 
identified as to list membership was 
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-89 ; the mean proportion of words from 
R categories correctly assigned to list 
was .71. Since each S$ served under 
both R and NR conditions, the sign 
test is appropriate. Sixteen of 17 dif- 
ferences observed (there was one tie) 
were in the direction of more correct 
identifications for the NR half of the 
list, p< .01. The mean numbers of 
correct identifications by list for each 
condition were, for R categories, 12.78 
and 13.00 for Lists 1 and 2; for NR 
categories, 16.06 and 16.17 for Lists 1 
and 2. R vs. NR differences are sig- 
nificant by the sign test for each list 
separately as well as for combined lists 
($ < .05). LD is clearly poorer for 
words belonging to categories appear- 
ing on both lists. 

Use of the rating scale—Analysis of 
the distribution of rating assignments 
for each condition showed no evidence 

_ of a bias toward first- or second-list 
ratings. For the R categories, there 
were 321 assignments to List 1 plus 327 
assignments to List 2; for the NR cate- 
gories, there were 322 assignments to 
List 1 plus 326 List 2 assignments. 
However, there was evidence of 
greater confidence for the NR words. 
Maximal confidence ratings (1 and 6) 
constituted 87% of the ratings for NR 
words but only 67% of the ratings for 
R words. Moreover, given that a 
maximal confidence category was em- 
ployed, it was incorrectly employed 9% 
of the time for NR words but 20% 
of the time for R words. In other 
words, not only were Ss more confi- 
dent of their identifications for the NR 
words, but also this confidence was less 
likely to be mistaken. 


Experiment II 


List 1 recall—The mean numbers of 
List 1 words recalled were 8.56 and 
8.39 from the R and NR categories, 
respectively. The mean difference 
score for the 18 Ss was precisely zero. 


Recall on the category level was 
scored by counting a category omission 
if none of the words of a given category 
appeared on the recall sheet. For pur- 
poses of scoring, R categories were re- 
garded separately by list. Of a total 
of 16 List 1 categories omitted, 14 were 
NR categories, That such an imbal- 
ance in category recall is to be found 
in the face of no difference in total 
words recalled suggests a difference in 
the number of items recalled within 
each category. The results of such an 
analysis are interesting. Given cate- 
gory recall, for NR categories, the mean 
number of words recalled per category 
was 3.82. For R categories, the com- 
parable figure is 2.94 words. A 
paired-t test shows this difference to be 
significant, (17) = 2.93, p < .01. 

Thus, fewer List 1 NR categories 
are available, but more items within a 
recalled NR category are available. 
The overall equality in number of items 
recalled reflects the interplay of these 
separate processes. 

List 2 recall—The mean numbers of 
List 2 words recalled were 10.72 from 
the R categories and 10.11 from the 
NR categories. The difference is not 
significant, #(17) = 1.02. 3 

There were three List 2 categories 
omitted in recall, and all three were NR 
categories. The mean numbers of 
words recalled per category, given re- 
call of the category, were 3.57 and 3.56 
for R and NR categories, respectively. 
There seems to be no difference in any 
of the recall measures for List 2 as a 
function of type of category. 

Order of recall—Postman and Kep- 
pel (1967) have reported a measure 
of dominance in the free recall of two 
lists which was applied to these data. 
The percentage of List 2 responses for 
each S occurring in the first half of the 
words recalled was determined and sub- 
tracted from the number expected on 
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the basis of a chance recall order. The 
mean value of this index, stated as a 
percentage of the possible difference, 
was 12.7. Following arc-sin transfor- 
mation of the percentages, a ¢ test was 
performed comparing the observed pri- 
ority effect with zero, The difference 
was significant, ¢(17) = 3.39, p < .01. 
Thus, there is evidence that List 2 
words are recalled earlier than List 1 
words. 

Another priority analysis was car- 
ried out with regard to R vs. NR cate- 
gory words, with list membership 
disregarded. For this analysis, the 
percentage of words from R categories 
occurring in the first half of recall was 
computed and compared with chance 
expectancy. The mean value of the 
priority index was 10.1%. This value 
differs significantly from zero, (17) = 
2.81, p < .02. Words from R cate- 
gories tend to be recalled earlier than 
words from NR categories. 

Alternation between lists within clus- 
ters of repeated words.—The question 
may be raised as to whether Ss dealt 
with the R categories separately by list 
or, instead, combined items from the 
same categories across lists. Another 
way of stating the question is to ask 
if it is more likely that Ss entered re- 
call with three larger R categories than 
with six smaller R categories. An ap- 
proach to the matter was made by 
borrowing the measure of alternation 
between two lists in free recall reported 
by Shuell and Keppel (1967). In the 
present case, the measure was applied 
only to clusters of three or more items 
from an R category containing at least 
one item from each list. Within such 
clusters the instances of alternation be- 
tween lists were counted and computed 
as a percentage of the maximum num- 
ber of possible alternations, given the 
number of items found from each list 
in that cluster. Before computing this 
percentage, which can vary from 0% 


513 


to 100%, 1 was subtracted from both 
numerator and denominator since at 
least one such alternation must occur 
within a cluster representing both lists. 
Three of the 18 Ss were excluded from 
this analysis since they produced no 
appropriate clusters. For the remain- 
ing 15 Ss, the mean value of this mea- 
sure was 46.5% ; if the 3 excluded Ss 
are included with scores of 0%, the 
mean value becomes 38.8%. In both 
cases, the observed percentage is sig- 
nificantly greater than zero by ¢ test 
at the .001 level of confidence. Thus, 
there is a significant tendency to alter- 
nate between lists when recalling words 
from R categories. 

List differentiation, For both lists 
combined, the mean proportion of words 
correctly recalled that were correctly 
assigned to list was .79 for words from 
R categories and .93 for words from 
NR categories. Fifteen of the 18 Ss 
correctly identified a higher proportion 
of the NR items, p < .01 by the sign 
test. For List 1, the proportions of 
correct list identifications were .77 and 
91 for R and NR conditions, respec- 
tively. For List 2, the corresponding 
proportions were .80 and .96. For each 
list, the superior performance for NR 
categories is significant by the sign test 
(p < .05). It should be noted that 
the list-identification data of Exp. IT 
represent discriminations only for items 
which were recalled, whereas in Exp. I 
all items were tested. On the other 
hand, the test of list identification in 
Exp. II followed 7 min. of recall and 
is thus subject to forgetting as well as 
possible interference from the recall 


process itself. 


DISCUSSION 


List differentiation —The results of 
Exp. I show that confusion of list mem- 
bership is greater when category mem- 
bership is redundant across lists than 
when it is not. The list-identification 
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data of Exp. II indicate that this effect 
obtains as well when available items only 
are tested. The latter finding indicates 
that words which are available are not 
necessarily capable of being correctly dif- 
ferentiated. One can recall a vegetable 
without knowing on which list the vege- 
table occurred. An important methodo- 
logical point may be drawn from the im- 
perfect correlation between recall and 
LD. In a similar experiment involving 
the learning of two lists constructed from 
repeated categories, Shuell (1968) found 
significantly more retroactive inhibition 
when only List 1 recall was demanded 
than when recall of both lists was tested. 
Shuell suggested, and this writer agrees, 
that degraded LD leads to the underes- 
timation of availability with single-list re- 
call in this situation, With redundant 
categories, Ss are faced with the dilemma 
of either making some interlist intrusions 
or withholding some available items of 
whose list membership they are in doubt. 
The decision process apparently acts to 
minimize intrusions at the cost of lower 
recall scores. 

Recall—tInterpretation of the recall 
data of Exp. II is complicated by the dif- 
ferent outcomes obtained at different 
levels of analysis. Thus, in terms 
of total words recalled, interlist related- 
ness had no effect; in terms of category 
recall, there was a marked loss of List 1 
NR categories; in terms of words per 
category recalled, there was a marked loss 
of List 1 words from R categories. One 
factor which seems implicated in account- 
ing for the loss of List 1 NR categories is 
output interference. On the basis of the 
priority analyses performed, List 1 NR 
categories have the lowest recall priority. 
Hence, in general, recall of these cate- 
gories would have been attempted after 
the most intervening time and interpolated 
recall. Whether the Priority analysis re- 
flects output interference or, instead, reg- 
isters the outcome of a process completed 
prior to recall remains an open question, 
however, 

From another direction, the loss of List 
1 NR categories is not surprising. It 
may be assumed that Ss had nine func- 
tional categories, of which six were NR 


and three were R categories. Recall of 
the R categories should have been facil- 
itated by repetition and strong associa- 
tions between first- and second-list in- 
stances. Given that category recall of 
nine categories is less than perfect, earlier 
categories would be more likely to be lost 
than later ones. In other words, List 1 
NR categories are the most vulnerable in 
this situation. 

The data on recall per category, given 
category recall, may also be viewed as re- 
flecting the operation of a recency prin- 
ciple when overloading occurs, but in this 
case overloading within a category. As 
Tulving and Pearlstone (1967) and Ear- 
hard (1967) have shown, a given cate- 
gory cue cannot hold an unlimited num- 
ber of instances. In the present case, as 
the alternation measure seems to show, 
repeated categories can be considered to 
have had up to 12 items placed into them. 
The evidence is that the more recent items 
are more available at recall than the 
earlier items. 

The recall data of this study are not 
inconsistent with an unlearning inter- 
pretation of List 1 forgetting. Shuell 
(1968), using an unmixed-list design, 
found substantially the same pattern of 
loss of availability in a similar experiment 
and interpreted his results in terms of 
mechanisms commonly adduced in paired- 
associate studies of retroactive inhibition 
using the A-B, A-C paradigm. It should 
be pointed out that, unlike Shuell’s pro- 
cedure, the present study gave only pres- 
entations, but no recall tests, during the 
learning phase. Recent statements of the 
unlearning hypothesis stress the impor- 
tance of elicitation, overt or not, of the 
List 1 response during List 2 learning 
(Keppel, 1968). In free-recall learning, 
the most likely locus for such a process 
would be during the recall phases of List 
2 learning. Since the present procedure 
used no such tests, one is forced to assume 
that unreinforced List 1 elicitations oc- 
curred as implicit rehearsals during List 
2 presentations. At minimum, this repre- 
sents an extension of the hypothesis. On 
the other hand, the present data, es 
with the studies of Postman and Keppé 
(1967) and Shuell (1968), show that sub- 
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stantial first-list retention losses occur as 
a consequence of interference processes 
in free-recall learning, and it would be 
desirable to encompass the paired-asso- 
ciate and free-recall findings within a 
single conceptual framework. 
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TASK DIFFICULTY AND THE FRUSTRATION EFFECT 


RONALD R. SCHMECK anv JAMES L. BRUNING 
Ohio University 


This within-S experiment concerns the relationship between motivation 
and performance on tasks of varying difficulty. The difficult task 
(a linear maze) was preceded by a double runway similar to the 
type employed by Amsel and Roussel (1952). The results indicated 
that running speed in the small segment of straight runway preceding 
the linear maze was faster following nonreward in the lst goal box 
than following reward (p<.001). Concerning the errors made in 
the linear maze, although the overall difference between rewarded 
and nonrewarded trials was not significant, the Trials X Reward inter- 
action was significant (p<.025), and ¢ tests indicated that Ss made 
fewer errors following reward. The results were interpreted in 
terms of Spence’s (1956) treatment of the motivation-performance 
relationship and frustration theory as elaborated by Amsel (1962). 


The literature contains many studies 
investigating the effects of nonreward 
on the performance of a previously re- 
warded simple running response (Am- 
sel & Roussel, 1952; Wagner, 1959). 
However, the literature contains no 
studies investigating this source of 
frustration as it affects the performance 
of a complex task. The latter prob- 
lem was the one of interest in the fol- 
lowing study. 

The apparatus used for this experi- 
ment was essentially a combination of 
the double runway used by Amsel and 
Roussel (1952) and the linear maze 
used by Hill (1939). Following reward 
or nonreward in the first goal box, 
S was released into the second “alley.” 
This second alley consisted of a short 
runway leading directly to a complex 
linear maze with the correct path ter- 
minating at a second goal cup. Thus, 
it was possible to obtain a within-S 
measure of the starting speeds follow- 
ing reward or nonreward in the first 
goal box and, then, the number of 
errors made in the linear maze as S 
moved toward the second goal cup. On 
the basis of Spence’s (1956) predic- 
tions concerning the effects of increased 
drive on the performance of simple and 


complex tasks, it was predicted that 
the heightened drive resulting from 
frustrative nonreward in the first goal 
box would produce increased running 
speed in the initial segment of the sec- 
ond alley, but would interfere with per- 
formance in the maze section and thus 
lead to an increased number of errors. 


METHOD 


Subjects—Thirty-six Sprague-Dawley al- 
bino rats from the colony maintained by the 
Psychology Department at Ohio University 
served as Ss. The animals were 110 days 
old at the beginning of the experiment. 

Pretraining procedure—All Ss were placed 
on a 22-hr, food-deprivation schedule 1 
wk. prior to pretraining. In addition, each S 
was handled for 1 hr. each day during this 
period. a 2 

All Ss were given pretraining trials in 
the usual (Amsel & Roussel, 1952) manner 
with the exception that three swinging doors 
were placed in the second alley of the pre- 
training apparatus. These doors served to 
train Ss to push through a succession © 
doors in order to obtain reinforcement. 

The first alley of the pretraining anri 
consisted of a start box (12 in. long), al a 
(24 in.), and a goal box (12 in.). The ae 
ond alley (59 in.) led directly into z 
second goal box (18 in.). The first AA 
box, alley, and goal box, plus the first 2 ae 
of the second alley, constituted a single aah 
which could be detached and connecte a 
series with the linear maze during the © 
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perimental phase. The remaining 36 in. of 
the second alley contained the three swing- 
ing doors placed at 12-in. intervals. The 
apparatus was 4 in. wide, 5 in. high, and was 
painted flat black. 

Pretraining consisted of three trials per 
day in which Ss in the experimental group 
and in one of the control groups received 
100% reinforcement in both goal boxes. The 
Ss in a second control group did not receive 
any reinforcement in the first goal box, but 
were always reinforced in the second. A 
total of 54 pretraining trials were given. In 
addition to pretraining in the double alley, 
all Ss were given three exploratory trials 
in the linear maze to reduce random behavior 
during the experimental training trials. 

Experimental procedure—The experi- 
mental apparatus consisted of the detachable 
segment of the pretraining apparatus con- 
nected in series with the linear maze. The 
maze contained four rows of four identical 
swinging doors (23 in. wide). Only one 
door in a row was unlocked at any one of 
the choice points. Pressure against the 
locked doors activated microswitches which 
were wired to counters that recorded the 
frequency of incorrect choices (movements 
of the locked doors). Under these condi- 
tions it was possible for an animal to make 
multiple errors at the same locked door of 
the maze or to return to the same locked 
door after pressing several others; it was 
observed that this did frequently occur dur- 
ing the experimental phase. 

During each trial of the experimental 
phase, S was placed in the first start box, the 
first start door was opened, and S ran to 
the first goal box. Ten seconds after entry 
into the first goal box the second start door 
was opened. This activated a clock which 
was stopped when S broke a photoelectric 
beam 20 in. from the start door, The S en- 
countered the first choice of the linear maze 
15 in. beyond the beam of light and, after 
Pushing through the series of four unlocked 
doors, gained access to the second goal cup. 
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The Ss were randomly assigned to one of 
three groups. Those in the experimental 
condition (Group 100-50) received 100% 
reinforcement in the first goal box during the 
pretraining phase and were switched to 50% 
reinforcement during the experimental phase. 
The Ss in the first control condition (Group 
100-100) received 100% reinforcement in the 
first goal box during both the pretraining 
phase and the experimental phase. The Ss 
in the second control condition (Group 0-0) 
received no reinforcement in the first goal 
box during either the pretraining or the 
experimental phase of the experiment, All 
Ss received 100% reinforcement in the second 
goal box during both phases of the experi- 
ment. Reinforcement consisted of 5 Noyes 
pellets. During the experimental phase, Ss 
were given four trials per day for 5 consecu- 
tive days. The measures recorded were the 
starting speeds for the first 20-in. segment of 
the second alley and the number of errors 
in the linear maze. 


Resutts AND Discussion 


The mean starting speeds (recipro- 
cal latencies) for all groups were com- 
bined into five blocks of two trials 
each (see Table 1), The first statistical 
analysis was computed to determine the 
performance differences of the experi- 
mental group following reward and 
following nonreward over the five 
blocks of trials, This was computed 
asa Treatment X Treatment X Ss anal- 
ysis of variance and indicated that the 
overall mean speed following nonre- 
ward was faster than that following re- 
ward, F (1, 10) = 98.81, p < .001.* In 


1The F ratios for the ABS analysis of 
variance used on the speeds of the experi- 


TABLE 1 
MEAN SPEEDS (RECIPROCAL LATENCIES) AND ERRORS Per TRIAL BLOCK 
7 Speed Error 
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addition, the Trials X Reinforcement 
interaction was also significant, F (4, 
40) = 5.28, p< .01. These data are 
entirely consistent with previous find- 
ings (Amsel & Roussel, 1952). The 
heightened running speed is theoreti- 
cally due to an additional source of 
drive provided by an unconditioned 
frustration reaction aroused by non- 
reward in the first goal box (Amsel, 
1962). Additionally, these data would 
provide support for the theoretical 
treatment by Spence (1956) regarding 
the motivation—performance relation- 
ship when the task is of a simple na- 
ture. Since Ss had been given 54 pre- 
training trials and were simply re- 
quired to run straight ahead for the 
first 20 in. of the second alley, the 
heightened drive would be expected to 
improve performance, 

The control groups (Groups 100-100 
and 0-0) were included in order to in- 
vestigate the possibility that the frus- 
tration effect as measured in the first 
20 in. of the second alley, rather than 
being attributable to an additional 
source of drive, might be due to the 
removal of a response depression varia- 
ble (McHose & Ludvigson, 1965; 
Wagner, 1959). Analysis of variance 
comparing the performance of Group 
0-0 and Group 100-50 on nonrewarded 
trials showed that the speeds of Ss in 
Group 100-50 were significantly faster, 
F (1, 20) = 5.52, p < .05. This indi- 
cates that the increased running speed 
noted following nonreward in the 100- 
50 condition was not due to the simple 
removal of a depression factor. The 
performance of the second control 
group (Group 100-100) was compared 
with the performance of Ss in Group 
100-50 on rewarded trials. This was 
done simply to insure that the overall 


mental (100-50) group were computed as fol- 
lows: reward = 1907/19.30, trials = 9.50/8.53, 
Reward X Trials = 28.00/5.30. 
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performance of the 100-50 group was 
not elevated as a result of sampling 
error. The analysis showed the dif- 
ference between these two groups to be 
nonsignificant, F (1,20) = 2.20. 
Turning to the complex task, the er- 
rors were also combined into five blocks 
of two trials each. Again, a Treatment 
X Treatment X Ss analysis was com- 
puted to compare the performance of 
Ss in Group 100-50 following reward 
with their performance following non- 
reward over the five trial blocks.? This 
analysis revealed that, while the over- 
all difference between rewarded and 
nonrewarded trials was not significant, 
the Trials x Reward interaction was 
significant, F (4, 40) = 3.65, p < .025. 
The failure to obtain a significant over- 
all difference was due to the fact that 
early in the experimental phase Ss 
made more errors following reward 
(X = 8.90) than following nonreward 
(2 = 6.55). A ft computed on the first 
block of trials, however, indicated that 
no significant difference existed, ¢ (10) 
= 1.79. Nevertheless, by the fifth block 
of trials the direction of the difference 
had reversed, and the comparison be- 
tween the number of errors following 
reward (X=2.09) and nonreward 
(X = 3.27) was significant, t (10) = 
2.36, p < .05. Also, a significant trials 
effect indicated that Ss were learning 
the correct path through the maze, F 
(4, 40) = 10.61, p < .001. 
Concerning the performance of the 
control groups in the linear maze, a 
between-S analysis of variance compat- 
ing the mean errors of the 0-0 group 
with those of the 100-50 group on non- 
rewarded trials showed that, although 
the difference was in the expected 
direction (i.e. the 0-0 group made 


2 The F ratios for the ABS analysis of the 
errors made by the experimental ( 100-50) 
group were computed as follows: T 
7.00/2.80, trials = 83.25/7.85, Reward X Trials 
= 17.50/4.80. 
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fewer errors), it did not reach signifi- 
cance, F (1, 20) = 1.00. A second 
analysis comparing the 100-100 group 
with the rewarded trials of the 100-50 
group indicated that this difference also 
was not significant. While the com- 
parison between the 0-0 group and the 
nonrewarded trials of the 100-50 group 
did not substantiate the results ob- 
tained with the speed data, it should be 
noted that, since Ss had not been pre- 
trained in the maze as they had in the 
straight alley, the error variance of the 
error scores was considerably larger 
than that of the speed scores.® 

The above study seems to demon- 
strate clearly the facilitative effects of 
frustration on the performance of a 
simple task and the debilitative effects 
of the same internal state on the per- 
formance of a complex task. Within 
the framework of Hull-Spence behavior 
theory a simple task is defined as one 
involving few competing responses, 
with the habit strength of each of these 
responses being considerably less than 
that of the correct response. The 
straight alley portion of the apparatus 
used in the present study, as well as 
the second alley of the apparatus typi- 
cally employed by Amsel, clearly fits 
this definition. Very few competing 
responses should be present in the sec- 
ond alley, and the pretraining proce- 
dures insure that the habit strength of 
the running response is far greater than 
the habit strength of any competing re- 
sponses which do happen to be present. 
Under these conditions, a higher per- 
formance level should be obtained when 
Ss are performing under high drive. 

3 The F ratios for the between-S analyses 
of the control speed data were computed as 
follows: 0-0 vs. 100-50 nonreward = 64.00/ 
11.60, 100-100 vs. 100-50 reward =25.00/ 
11.39. The F ratios for the analyses of the 
control error data were computed as follows: 


0-0 vs. 100-50 nonreward = 34.00/33.85, 100- 
100 vs. 100-50 reward = 86.00/33,72. 


The greater speeds obtained in the 
above study following frustrative non- 
reward in the first goal box lend sup- 
port to this analysis, 

Also within the Hull-Spence frame- 
work, a complex task is defined as one 
which involves numerous competing re- 
sponses having habit strengths nearly 
equal to the habit strength of the cor- 
rect response. The linear maze used in 
the present study clearly fits this defini- 
tion. Due to the fact that each of the 
choice points and each of the doors 
within a single choice point were vir- 
tually identical, a considerable amount 
of habit strength would have general- 
ized from the correct stimulus to the 
other similar stimuli. Under these 
conditions, any increase in drive (eg., 
following frustrative nonreward) would 
be likely to raise the excitatory poten- 
tials of these competing responses above 
threshold. The greater number of 
errors obtained in the later trials of 
the present study following frustrative 
nonreward lends support to this aspect 
of the analysis, On the other hand, 
superior performance should be ob- 
tained when S is functioning under low 
drive since fewer competing responses 
would be raised above threshold under 
these conditions, The fewer number 
of errors recorded on rewarded trials 
again lends support to the analysis. 

One additional implication of the 
above analysis should be noted. While 
the debilitative effects of frustration are 
clearly predicted by Hull-Spence behav- 
ior theory, it might also be predicted 
that these effects would become less 
debilitative as training continues. Con- 
tinued training should raise the habit 
strength of the correct responses far- 
ther above that of the competing re- 
sponses. Consequently, it might be 
expected that the debilitative effects of 
frustration would become facilitative as 
training continues. It is possible that 
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training was not continued far enough 
in the present study to reveal such a 
trend. 
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REWARD SCHEDULE EFFECTS FOLLOWING SEVERELY 
LIMITED ACQUISITION TRAINING? 


E. J. CAPALDI, A. T. LANIER, ann R. C. GODBOUT? 


University of Texas 


The experimental design, a 2X2 factorial, contained 2 schedules of 


reward (partial ys. continuous) 


and 2 levels of reward magnitude 


(1.08- vs. .09-gm. food). The rats were extinguished in the straight 


alley following only 5 acquisition 
quite similar to those previously 


lowing more extended acquisition training. 


trials. The extinction results were 
reported by other investigators fol- 
The present results are 


most relevant to those hypotheses which assume that a partial rein- 
forcement effect will occur only following a fairly substantial number 


of acquisition trials. 


McCain and his associates (e.g., McCain, 
1966) have obtained a partial reinforcement 
effect (PRE) following as few as five, four, 
three, and even two trials. The two-trial 
investigation indicates that a single transi- 
tion from a nonrewarded (N) to a re- 
warded (R) trial is sufficient to produce a 
PRE, Recently, however, Surridge, Rash- 
otte, and Amsel (1967) failed to obtain a 
PRE following the four-trial sequence 
NRNR and criticized the investigations of 
McCain and associates suggesting that their 
results were due to specific experimental 
procedures. 

Small trial studies have typically compared 
only partial reward with consistent reward 
and have tended to report only one running 
measure in the alley. The present investiga- 
tion departed from this trend in being a 2 X 
2 factorial which employed two magnitudes 
of reward (1.08 vs. 09 gm.) and two sched- 
ules of reward (partial vs. continuous). 
Moreover, speeds across several alley sec- 
tions were taken. Wagner’s (1961) well- 
known investigation employed similar groups 
and 16 vs. 64 acquisition trials. Thus Wag- 
ner’s investigation provides several important 
base lines with which to compare the 
present results. 

Method—The Ss were 48 naive male rats 
approximately 90 days old when receiv 
from the Holtzman Company, Madison, Wis- 
consin. 

The apparatus consisted of 
painted a flat gray throughout, 
long, 4 in. wide, with 9-in. high sides, 

1 Thi i ational 
raat, SPER asa et perelpmen, Research 


Grant HD 00949-04 to the first author. 
2 Public Health Service predoctoral fellow. 


80 in. 
and 


covered with 4-in, hardware cloth, An 
8-in. long floor treadle suspended over a 
microswitch constituted the initial portion of 
the alley, and when the treadle was de- 
pressed by the weight of the rat the first 
Ol-sec, clock was started. Clock 1 was 
stopped (start time) and Clock 2 started 
when S interrupted a photobeam 2 in, beyond 
the treadle. Clock 2 was stopped (run 
time) and Clock 3 started when S broke the 
second photobeam located 52 in, from the 
first. Interruption of the third photobeam 
stopped Clock 3 (goal time). The third 
photobeam was 154 in. beyond the second 
photobeam and 22 in. in front of a brass 
2% 4X 14 in, food cup. The times from the 
three separate alley sections were summed 
yielding the total time measure, A man- 
tally lowered guillotine door, 14 in. from the 
rear of the goal compartment, served to 
confine S to the goal area. Throughout all 
phases of training, S was allowed a maxi- 
mum of 60 sec. to traverse each section of 
the runway; when 60 sec. elapsed in a 
single section of the runway, S was guided 
down the runway until the next photobeam 
was broken. 

Preliminary training was given to Ss on 
arrival at the laboratory. The Ss were 
housed in individual cages and placed on an 
ad-lib feeding schedule, The Ss were randomly 
divided into four equal replications of 12 Ss 
each. On the fourth day after arrival, Ss 
jn the first replication were placed ona 23-hr. 
food-deprivation cycle. Deprivation cycles 
for the other three replications were ini- 
tiated on the seventh, tenth, and thirteenth 
days after arrival, respectively. 


The 3- 
day interval between replications allowed 
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each replication to complete its experimental 
training before the next replication began 
experimental training. The deprivation cycle 
was maintained by providing a constant daily 
ration of 12 gm. of Wayne Lab Blox. The 
deprivation cycle was continued for a total 
of 24 days, including 21 days preceding ex- 
perimental training and the 3 days of experi- 
mental training. On each of the 10 days 
immediately preceding experimental training, 
each replication was handled in a group for 
a period of 10-15 min. No food was avail- 
able during the handling session, i.e, S was 
not fed outside the home cage prior to ex- 
perimental training. Upon return to the 
home cage after a handling session, each S 
was fed 8-10 .045-gm. Noyes pellets. The 
12-gm. daily ration was received 10 min. 
after the Noyes pellets were consumed. It 
should be noted that Ss were given neither 
alley exploration, rewarded goal-box place- 
ments, nor any other contact with the appa- 
ratus prior to experimental training. Also 
to be noted is that experimental training oc- 
curred in a small cubicle adjacent to the 
large colony room where Ss were housed, a 
cubicle unfamiliar to Ss. 

Experimental training was given to Ss 
which consisted of 1 acquisition trial on the 
first day, 4 acquisition trials on the second 
day, and 10 nonreward trials on the third 
day. On the first day of experimental train- 
ing, the 12 Ss in a replication were randomly 
divided into four equal groups correspond- 
ing to the four experimental treatments, 
The experimental groups were: (a) large 
partial (LP), given 3 large reward and 
2 nonreward trials in acquisition according 
to the following schedule—LNLNL; (b) 
large consistent (LC), given 5 large reward 
trials in acquisition; (c) small partial (SP), 
given 3 small reward and 2 nonreward trials 
in acquisition according to the following 
schedule—SNSNS; (d) small consistent 
(SC), given 5 small reward trials during 
acquisition. Large reward consisted of 24 
.045-gm. Noyes pellets; small reward con- 
sisted of two .045-gm Noyes pellets. On re- 
warded trials S was removed from the goal 
box immediately after consumption of the 
reward. On nonrewarded trials in both ac- 
quisition and extinction, Ss were confined in 
the unbaited goal box for a period of 30 
sec. Within each replication three squads 
were formed by selecting 1 $ from each of 
the four experimental groups. The 4 Ss in 
a squad were carried as a group into the 
cubicle housing the experimental apparatus. 
The squad members were then run in rota- 


tion according to a predetermined schedule, 
and the resulting ITI for both acquisition 
and extinction was approximately 3-5 min. 
Each squad received the daily ration a mini- 
mum of 10 min. following the termination of 
its daily experimental session. The 12-gm. 
daily ration of Wayne Lab Blox for each 
S was decreased an amount corresponding to 
the amount of food which that particular ani- 
mal had consumed during its experimental 
trials on that day. 

The Ss in a given replication were sacri- 
ficed soon after experimental training had 
been completed. Due to an unfortunate con- 
fusion the laboratory custodian killed not 
only the third replication which had com- 
pleted training but also 6 Ss in the fourth 
replication which had completed only their 
first day of experimental training. Two of 
these Ss were in LP, 2 in LC, 1 in SP; 
and 1 in SC. One S in SP was dis- 
carded for refusal to run. In order to 
achieve equal numbers of Ss per group, the 
S in group SC whose mean Anderson rate 
score (Anderson, 1963) over the 10 extinc- 
tion trials for the total speed measure (re- 
ciprocated total time) fell closest to the 
corresponding mean for that group was dis- 
carded. All reported analyses are based 
upon the resulting 10 Ss per group. 

Results —Running speeds (1/time) in each 
alley section for each group on each of 
the five trials of acquisition are shown in 
Fig. 1. Differences increased over trials 
with, generally speaking, consistent reward 
producing faster running than partial reward 
and large reward producing faster running 
than small reward. These, of course, are the 
expected results. All analyses to be re- 
ported have df=1, 36. The effects of re- 
ward magnitude (M) were not significant in 
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start, F=2.07, p> .10, or run, F=321, 
p<.10, but were significant in goal, F= 
4.55, p<.05. The effects of reward sched- 
ule (RS) were not significant in start, F< 
1, or run, F<1, but were significant in 
goal, F=8.07, p<.01. In all sections, 
the RS X M interaction produced F<1. In 
view of the acquisition differences, the ex- 
tinction speeds were converted to rate mea- 
sures as recommended by Anderson (1963). 
Each S’s speed in the appropriate alley sec- 
tion on the initial nonrewarded trial of Day 3 
of experimental training (which, of course, 
was not an extinction trial) served as the 
estimate of the acquisition asymptote. The 
extinction asymptote was taken to be the 
reciprocal of 60 sec. or the maximum time 
recorded in any alley section. The extinc- 
tion asymptote employed here produced a 
rate transformation which is actually quite 
similar to the one employed by Wagner 
(1961). The resulting rate measures in the 
run and goal are shown in Fig. 2. The rates 
for start are not shown because in all start 
comparisons F<1, The ordering of the 
groups in Fig. 2 is exactly the same as re- 
ported by Wagner: rate of decline in extinc- 
tion was least for LP, followed in order by 
SP, SC, and finally by LC. An analysis 
of the rate measures shown in Fig. 2 indi- 
cated that the effects of RS were significant 
in run, F=10.69, p<.01, and goal, F= 
12.72, p < .01, while the effects of M were 
not significant in either run, F <1, or goal, 
F<1. The RS Xx M interaction was not 
quite significant in run, F=3.45, p<.10, 
or goal, F= 3.74, p<.10. It is interesting 
to note that an analysis of rate measures for 
total speed, i.e, along the total length of 
the alley, produced an RS X M interaction 
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which was significant, F=5.92, p<,025. 
Subsequent Duncan's range tests indicated 
that LP was significantly different from SC, 
b<.05, and LC, p<.01, in run and goal. 
The difference between LP and SP was not 
significant in any section. Moreover, neither 
differences between SP and SC nor differ- 
ences between LC and SC were significant, 
Discussion—Some of the differences ob- 
tained by Wagner (1961) in the start mea- 
sure were significant, eg., LP vs. LC, but 
none approached significance here. The 
groups in this investigation and in Wagner's 
were ordered alike in the run and goal 
measures. However, Wagner’s differences 
in run and goal were statistically larger. 
For example, the RS X M interaction, sig- 
nificant in Wagner’s investigation, only ap- 
proached significance in run and goal in the 
present experiment, p<.10. Too, in 
Wagner’s investigation SP and SC differed 
reliably in the goal measure (but not start 
or run), but this difference was not signifi- 
cant in the goal here. The present results 
suggest that in the run and goal measures 
reliable differences between LP and LC 
and between LP and SC will occur follow- 
ing very few acquisition trials. The present 
results also suggest that all other reward 
schedule effects obtained by Wagner fol- 
lowing 16 and 64 trials are in the process 
of development from the very outset of 
training and will appear, in some form at 
least, following as few as 5 acquisition trials. 
The substantial correspondence between 
the present results and those of Wagner 
(1961) would seem to render unlikely that 
the small trial PRE obtained here and thus 
by McCain (1966) and associates is due to 
experimental error. Accordingly, the pres- 
ent results would seem to have several im- 
plications for hypotheses which assume that 
the rat's reaction to nonreward changes over 
trials (nonreward becomes frustrating, etc.) 
and that it is this changed reaction which is 
ultimately responsible for the PRE (Amsel, 
1958). The present results suggest either 
that a PRE may occur in the absence of 
the changed reaction or that the changed 
reaction to nonreward occurs in many fewer 


trials than previously assumed. 
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ACTIVATION, MANIFEST ANXIETY, AND VERBAL LEARNING 


ROBERT E. THAYER anv SHEILA J. COX 
California State College at Long Beach 


The Activation-Deactivation Adjective Check List (AD-ACL), a self- 
report test of transitory levels of activation, was hypothesized to have 
advantages over the Manifest Anxiety (MA) scale as an indicant of 
drive in tests of predictions drawn from Hull-Spence theory. 2 groups 
of Ss tested in a low-threat group procedure learned different lists of 
paired associates designed to elicit 2 levels of response competition. 
Groups differentiated by MA scale scores did not confirm predictions 
drawn from Hull-Spence theory, while predictions were confirmed by 
groups differentiated by AD-ACL scores. 


Within the framework of Hull-Spence 
theory, the relationship between performance 
and level of emotionality as assessed by the 
Manifest Anxiety (MA) scale has been ex- 
tensively investigated (Spence & Spence, 
1966). Scores on the MA scale, which pre- 
sumably denote the tendency to exhibit overt 
anxiety symptoms and thus emotionality, have 
been used as indicants of drive (D). Ob- 
vious alternatives to MA scale scores are 
physiological measures of emotionality which 
could be obtained within the experimental 
situation and would not have the limited 
generality of anxiety responses. The Acti- 
vation-Deactivation Adjective Check List 
(AD-ACL) correlates substantially with 
composites of physiological variables and 
has advantages over individual physiological 
variables that are often poorly correlated 
(Thayer, 1967). Thus, this test can be used 
in situations in which physiological measures 
are difficult to obtain. The AD-ACL con- 
sists of a set of self-descriptive adjectives 
with which S reports his momentary percep- 
tion of feelings of activation by rating each 
adjective on a 4-point scale. Four orthogonal 
activation dimensions have been factor ana- 
lytically defined, and one dimension, General 
Activation (eg. lively, ‘active, full-of-pep, 
energetic), has been most sensitive in a wide 
variety of activation-inducing conditions. 
The present research was designed to test 
predictions drawn from Hull-Spence theory 
and to compare the AD-ACL and MA scale 
as indicants of D. 


Experiment I—Spence, Farber, and 
McFann (1956) constructed a paired-asso- 
ciates list to test a prediction drawn from 
Hull-Spence theory that high-D Ss would 
perform better than low-D Ss on tasks with 
dominant correct responses and few compet- 
ing responses. Using the MA scale as an 
indicant of D, they found reliably superior 
performance by the high-D group. In the 
present research, which included the pre- 
viouly constructed paired-associates task, 
superior performance by the high-D group 
was also predicted. In the original study the 
Towa group employed an individual testing 
procedure and short stimulus presenta- 
tion times; the present research used an 
anonymous group testing procedure and long 
stimulus presentation times in an attempt to 
reduce the arousal of emotionality among 
Ss. It was assumed that stress is a com- 
ponent of most psychology experiments and 
that the present low-threat group procedure, 
although somewhat stressful, would require 
maximum sensitivity of the two indicants of 
D because of the permissive conditions em- 
ployed. 

Method—The Ss in Exp. I were 74 male 
introductory psychology college students. 
For analyses using the MA scale (Taylor, 
1953), divisions were made into upper and 
lower 20% of all 160 Ss in Exp. I and Il. 
Cutoff points were 23 and above and 8 ane 
below. Experiment I analysis involved 1 
high and 20 low MA scale Ss. For analysis 
using the General Activation subscale of the 


i 
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AD-ACL (Thayer, 1967), all Ss in Exp. I 
and II were divided into upper and lower 
thirds. The rating scale for each self- 
descriptive adjective included: definitely do 
not feel, cannot decide, feel slightly, and 
definitely feel. The four alternatives were 
scored 1 through 4, respectively, and 
seven adjectives were used in the General 
Activation subscale. The cutoff points for 
upper and lower thirds for both groups were 
19 and above and 12 and below (mean cutoff 
points, 2.7 and 1.7). Analysis for Exp, I 
involved 21 high and 33 low AD-ACL Ss. 

A paired-associates list including 15 high 
intrapair association, low intralist association 
adjectives was used (Spence et al., 1956). 
The words were presented with a Kodak 
Carousel slide projector and a manual slide 
change control on a viewing screen in front 
of an average size classroom. The slide 
presentation time—10 sec. per stimulus, 5 
sec. per pair, and 5 sec. between trials—was 
determined with a stopwatch. In the learn- 
ing task, Ss recorded their responses in a 
small stapled booklet made of plain mimeo- 
graph paper cut into 45 numbered sheets. 

The S's were run in several groups ranging 
in number from 20 to 35. They were told 
their group was part of a large number of 
introductory students participating in this 
study. An E explained that a list of paired 
associates would be presented four times in 
different orders. Starting on the second 
presentation, as the stimulus word appeared, 
Ss would write the response word on one 
of the numbered sheets and fold the sheet 
back before the confirming pair appeared. 
The students were twice reminded of the 
anonymity of their recorded responses and 
asked to cooperate by stopping and folding 
the sheet back immediately upon the signal. 
During the experiment Æ observed Ss and 
noted that cooperation was excellent. After 
the instructions Ss filled out an AD-ACL, 
participated in the paired-associates task, and 
last filled out an MA scale. 

Results—The total number of correct 
words per trial was scored. Performance 
means based on MA scale and AD-ACL 
divisions are presented in Table 1. Per- 
formance means based on MA scale divisions 
were in the opposite direction to predictions 
drawn from Hull-Spence theory, and a be- 
tween-groups analysis of variance indicated 
nonsignificant differences. The means based 
on AD-ACL divisions were in the expected 
direction, and an analysis of variance showed 
that the high-activation group performed re- 
liably better than the low-activation group, 


TABLE 1 


MEAN PERCENTAGE or CORRECT RESPONSES OF HIGH- 
ASSOCIATION PAIRED ASSOCIATES FOR ANXIETY 
AND AcTIVATION GROUPS 


Percentage of Correct 


Responses 
Groups 
Tı T: Ts 
High Anxiety (n = 15) 61.3 778 87.5 
Low Anxiety ( = 20) 69.7 87.3 94.3 
High G-Act (n = 21) 724 89.2 94,3 
Low G-Act (» = 33) 614 81.0 89.7 


F (1, 52) = 4.08, p<.05, The within-groups 
interaction was not significant. 

Experiment II—Using MA scale scores as 
indicants of D and a second paired-associates 
list, Spence et al. (1956) tested a prediction 
drawn from Hull-Spence theory that low-D 
Ss would perform better than high-D Ss 
on tasks in which competing responses are 
initially stronger than correct responses, Re- 
sults indicated that low-D Ss learned sig- 
nificantly faster than high-D Ss. The 
paired-associates list that was used included 
four high-association pairs from the list 
described in Exp. I and eight low-association 
pairs constructed in such a way that re- 
sponse tendencies were higher for the high- 
association response words than for the low- 
association response words, The Iowa 
group predicted superior initial performance 
by the high-D Ss on the high-association 
pairs; because of increasing interference 
among high- and low-association pairs they 
also predicted gradual superiority by low-D 
Ss as learning progressed. Some support 
was found for these predictions. In the 
present experiment the authors predicted 
superiority by low-D Ss on high- and low- 
association paired associates as well as the 
total list. It was assumed that using the 
low-threat group procedure described in Exp. 
I any initial superiority by the high-D Ss 
would be eliminated because of the probable 
rapid learning. 7 

Method—The Ss were ih ae S 
ducto: sychology college students. e 
ae points of the MA scale and AD-ACL 
were described in ag L YA present 
experiment 16 and 15 Ss were in the upper 
sat lower 20%, pease gon 
scale. And 31 and 's were in the respec- 
tive upper and lower thirds of the AD-ACL. 
The 12 paired associates were taken from 
the study by Spence et al. (1956). The 
other apparatus and procedure were de- 


scribed in Exp. I. 
Results—The total number of correct 
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TABLE 2 


MEAN PERCENTAGE OF CORRECT RESPONSES ON HIGH- AND Low-ASSOCIATION PAIRED Associates 
FOR ANXIETY AND ACTIVATION GROUPS 


High Anxiety (n=16) | Low Anxiety (n=15) 


High G-Act (n =31) Low G-Act (n =22) 


Paired Associates 


Tı T: Ts Ti T: Ts Tı T: Ts Tı T: Ta 

i iati 68.8 | 67.2 82.8 | 58.2 | 65.0 78.2 48.5 | 58.0 73.5 61.2 76.2 | 83.0 
EAEE 25.0 | 37.5 62.5 23.4 | 34.1 55.9 21.0 | 39.1 55.2 31.9 | 46.0 | 68.8 
M 39.6 | 47.4 | 69.2 35.0 41.7 56.1 73.5 


63.3 | 30.2 | 4514 | 61:3 


words per trial was scored. The means for 
performance trials of the individual sets of 
high- and low-association paired associates 
and the total set of words based on the MA 
scale divisions are presented in Table 2. 
The high MA scale Ss showed superior 
performance to lows on both sets and the 
total set. These differences when tested 
with analysis of variance were not statis- 
tically significant, however. The mean dif- 
ferences for the four pairs of high-associa- 
tion words are inconsistent with our predic- 
tions and the Iowa group’s predictions of 
gradual superiority of low-D S's. And dif- 
ferences for the eight low-association pairs 
are also inconsistent with predictions, 

The performance means of Ss divided on 
the basis of AD-ACL scores are presented 
in Table 2 and are all consistent with predic- 
tions made in the present experiment. Dif- 
ferences for the eight pairs of low-associa- 
tion words were not statistically significant, 
though the means are in the expected direc- 
tion, between-groups F (1, 51) =3.05, p< 
«10. On the four pairs of high-association 
words the low-activation group performed 
reliably better than the high, F (1, 51)= 
5.35, p <.05. The interaction was not statis- 
tically significant in either analysis. Con- 
sidering both sets of words together, the low- 
activation group performed reliably better 
than the high-activation group, F (1, 51) 
S471, b<.05. The interaction was non- 
significant. 

Previous experience with the AD-ACL 
had indicated one-third splits result in the 
most powerful discrimination, In order 
to more adequately compare the perform- 
ance results based on the two indicants of D, 
several analyses were performed using com- 
parable AD-ACL and MA scale cutoff 
points. The Ss in Exp. I and II were 
divided on the basis of one-third splits on the 
MA scale. For the noncompetitive list in 
Exp. I and the total competitive list in 
Exp. II, performance means were still in the 
opposite direction to our experimental pre- 
dictions, and differences were not statis- 


tically significant. The Ss in Exp. I and II 
were also divided into upper and lower 20% 
groups on the basis of AD-ACL scores, 
With these divisions, performance means for 
the noncompetitive list in Exp. I were in the 
expected direction but not significantly differ- 
ent, For the total competitive list in Exp. 
II, low AD-ACL Ss showed reliably superior 
performance to high Ss, F (1, 31) =4.83, 
$< .05. 

Discussion—All predictions drawn from 
Hull-Spence theory were supported with 
AD-ACL scores used as indicants of D, but 
predictions were not supported with MA 
scale discriminations. The low-threat group 
procedure employed in this research re- 
quired subtle discriminations of whatever re- 
duced differences in emotionality existed. 
The lack of discrimination by the MA scale 
is not surprising in view of the mounting 
evidence that moderate psychological stress 
must be present f6r adequate MA scale dis- 
crimination (Spence & Spence, 1966). 

The AD-ACL has at least two advantages 
over the MA scale as an indicant of D. 
Measures of emotionality can be obtained 
within the experimental situation instead of 
outside of it as is the case with the MA 
scale. This AD-ACL measurement property 
allows more direct monitoring of the effects 
of independent variables relevant to D. As 
an indicant of D the AD-ACL also has ad- 
vantage over the MA scale since the former 
test is not limited to measurement of overt 
anxiety symptoms. 
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AGE, JUDGMENT TIME, AND AMOUNT OF KINESTHETIC 
AFTEREFFECT 1 


R. OVER ? ann S. GRIEW * 
University of Otago 


In this study 24 Ss of mean age 18.2 yr. and 24 Ss of mean age 75.9 yr. 
set a rod so that it felt horizontal following kinesthetic inspection of a 
tilted rod. Many old Ss took longer to complete postinspection judg- 
ments than young Ss, and these old Ss showed a smaller aftereffect 


than young Ss. 


Age differences in amount of aftereffect no longer 


occurred when judgment time was the same for young and old Ss. 


Singer and Day (1965) have shown that 


spatial aftereffects decay exponentially with | 


time between cessation of inspection and 
completion of judgment. They have sug- 
gested that age differences in amount of 
aftereffect may occur for no reason other 
than that young and old Ss differ in the time 
taken to complete postinspection settings. 
The present experiment examines the rela- 
tionship between age, judgment time, and 
amount of kinesthetic aftereffect. 
Method.—There were 24 old Ss of mean 
age 75.9 yr. from an old people’s home and 
club and 24 young Ss of mean age 18.2 
yr. from a youth club. In each group there 
were 12 males and 12 females. 


1967), consisted of a wooden pivoted rod 
which could be adjusted by S through man- 
ually operated rotary controls. 

The Ss were blindfolded. Each S made 
12 pretest judgments of the apparent hori- 
zontal with the bar initially tilted either 15° 
clockwise or 15° counterclockwise from the 
horizontal. Four posttest judgments were 
made, each preceded by an inspection peri 
during which § moved his right hand along 
the rod for 1 min. in time with a metronome 
at 84 beats/min. The same starting 
position was used for all pretest and post- 
test judgments and the rod was also at this 
tilt during inspection. For half of the Ss 
the tilt was clockwise, and for half counter- 
clockwise. A rest period of 5 min. was given 

i after each posttest judgment. An electronic 
timer (Hunter) operated by E through a 
silent switch was used to measure the time 

Tats 
Date, Stay, as, supported O G Health, 
United States Public Health Service. The authors are 
grateful to Marie Mossman for help in data collection 
and analysis. 


; Now at Dalhousie University. 
Now at University of Dundee. 


The appa- . 
ratus, already described in detail (Over, | 


taken to complete each pretest and post- 
test judgment. Old Ss with a mean posttest 
setting time of greater than 18 sec. (a time 
exceeded by only one young S) were re- 
tested and required to complete settings in 
13 sec. (the mean judgment time for young 
Ss) while being given a count of time by E. 

Results and discussion—Mean aftereffect 
and judgmient-time values are given for 
young and old Ss in Table 1. Aftereffect 
was calculated for each S by subtracting the 
mean of the last six pretest settings from the 
mean posttest setting; + was scored if 
the difference was in the direction of the tilt 
of the inspection figure. Tests for 
homogeneity of variance indicate signifi- 
cantly tore variation in both amount of 
aftereffect, F (23, 23) = 2.65, p<.05, and 
judgment time, F (23, 23) =2.21, p<.05, 
among old Ss than among young Ss. For 
old Ss there was a rank order correlation of 
=.50, t (22) =2.72, p < 02, between amount 
of aftereffect and judgment time. The eight 
old Ss who made judgments most quickly 
(mean time = 12.2 sec.) showed significantly 
more aftereffect, M=+4.87, SD = 3.34 com- 
pared with M = +1.31, SD=3.16, t (14) = 
1.96, p <.05,* than the eight old Ss who were 
slowest in making judgments (mean time = 
25.2 sec.). “Fast” old Ss did not differ in 
mean aftereffect from young Ss, t (30) = 


4 One-tail test. 


TABLE 1 
AFTEREFFECT AND JUDGMENT TIME VALUES 


Aftereffect Judgment Time 

Young Old Young Old 

00 +2.84 12.91 18.43 

3b tios 3,32 3.96 5.89 
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85, p>.05, but “slow” old Ss showed less 
aftereffect, £ (30)=2.69, p<.02, When 
slow old Ss were required to complete each 
judgment within 13 sec., aftereffect (M= 
+4.03, SD =1.71) no longer differed from 
the value for young Ss, t (30) =.36, p> 
05. 

These results support the claim (Singer & 
Day, 1965) that the time taken to complete 
postinspection judgments must be considered 
when amount of aftereffect is compared 
across different groups of Ss. Attention to 
this variable may show that it is unnecessary 
to appeal to hypothetical neural processes to 
explain differences found in amount of after- 
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effect between brain-damaged and normal Ss 
(Klein & Krech, 1952) and between intro- 
verted and extraverted Ss ( Eysenck, 1955). 
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SIGNAL DETECTION APPROACH TO THE STUDY OF 
RETINAL LOCUS IN TACHISTOSCOPIC RECOGNITION 1 


WILMA A. WINNICK anv GERARD E. BRUDER 
Queens College of the City University of New York 


Using signal detection techniques, 2 experiments studied the variable 
of retinal locus in the recognition of arrays of letters. In the Ist, 
horizontal rows of 8 letters spanning the right and left visual fields 
were presented at 100-msec. duration. Ss reported whether particular 
probe letters were or were not the ones seen in each of the 8 positions. 
Ratings were made of the degree of confidence in that judgment. In 
the 2nd experiment, the same judgments were required about vertical 
tows of letters exposed at 200 msec. Both experiments found no 
difference in accuracy in either the right and left or the north and 
south visual fields. The results corroborate Ayres’ (1966) conclusion 
that dominance of one field over the other is an artifact of the order 


of reporting. 


Studies of the role of retinal locus in 
tachistoscopic recognition (e.g, Ayres & 
Harcum, 1962; Bryden, 1960; Harcum & 
Jones, 1962; Heron, 1957) have found left 
visual field dominance or primacy when 
materials were presented simultaneously to 
the left and right visual fields and north- 
field dominance with materials simultane- 
ously exposed in the north and south visual 
fields, Crovitz and Schiffman (1965), find- 
ing bow-shaped error functions closely re- 
sembling the serial position effects of serial 
learning tasks, invoked an immediate memory 
factor involving both primacy and recency. 
More recently, Ayres ( 1966) suggested that 
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the left-field and north-field dominance found 
in previous retinal locus studies could have 
resulted from left-to-right and up-to-down 
report orders and preexposure sets favoring 
accuracy in the left and north visual fields. 
By controlling or equalizing these factors, 
he was able to eliminate left-field dominance 
for horizontal and north-field dominance for 
vertical materials, 

The present experiment has eliminated re- 
Port order by the use of signal detection 
techniques, requiring only a partial report 
of the items seen. Signal detection methods 
were found by Winnick, Luria, and ne 
(1967) to be appropriate for the study p 
tachistoscopic recognition, and these ne s 
seemed potentially useful for the study of the 
retinal locus variable. Two experiments are 
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reported; the first used a horizontal row of 
letters to study left-right field differences, 
while the second studied north-south dif- 
ferences with a vertical array of letters. 

Method.—Fifteen introductory psychology 
students served as Ss in Exp. I; an inde- 
pendent sample of 10 Ss served in Exp. II. 
Each S participated in two l-hr. sessions 
from 1 to 3 days apart. 

For Exp. I there were 80 stimulus cards 
each containing a horizontal row of eight 
randomly selected letters with the restric- 
tion that no letter appear on a given line 
more than once. The English capital letters 
were typed and centered on the cards for 
presentation in a mirror tachistoscope (Ger- 
brands). The letters were spaced ł in. 
apart, with 4 in. between the most centrally 
placed letter and the center of the card where 
the fixation cross was located. In addi- 
tion there were 160 response cards each of 
which contained eight typewritten dashes 
or lines corresponding to the positions of the 
eight letters on the stimulus cards. The 
spacing of the positions on the response cards 
was the same as the spacing of letters on the 
stimulus cards. Also, a letter was typed in 
one of the eight positions on each response 
card. For signal presentations (S), the 
letter in a particular position on the response 
card was in that same position on the stimu- 
lus card used for that trial. For noise pres- 
entations (N), the letter in a particular 
Position on the response card was not in 
that same position on the stimulus card used 
for that trial. 

The sequence of events followed during 
each trial was as follows: (a) a stimulus 
card containing a row of eight letters was 
exposed in the tachistoscope at a duration 
of 100 msec.; (b) a response card contain- 
ing eight corresponding lines or positions, 
and a letter in one of the positions was 
then presented to S; (c) S was required to 
make a “yes” or “no” decision as to 
whether the letter in a position on the re- 
sponse card was or was not the letter in that 
Position in the series of letters just exposed 
to him; and (d) along with his “yes” or 
(o” decision S also was required to rate the 
certainty” of his decision by using a pre- 
pared 5-point rating scale. The points on the 
scale were described to S as follows: 1 to 
indicate a pure guess, 5 to indicate abso- 
lute certainty, 3 for average certainty, and 
2 and 4 for intermediate points on the scale. 
Each $ was given 16 practice trials, 8 
being signal presentations and 8 noise 
Presentations, which sampled each of the 


letter-span positions equally and in a ran- 
dom order. Each S$ then received 160 test 
trials, 80 in Session 1 and 80 in Session 2. 
Half of the trials were signal and half noise 
presentations, and the positions in the eight- 
letter span were sampled equally. Two 
different random orders of presentations 
were used, one for Session 1 and one for 
Session 2, Rest intervals of 5 min. were 
given after 40 trials, or whenever so de- 
manded by S. 

The apparatus and procedure in Exp, II 
were identical to those used in Exp. I, with 
the following exceptions: (a) each S re- 
ceived 256 test trials, 128 in each of the two 
sessions; (b) vertical exposure of letter 
spans was used, and response cards were 
also vertically arranged; and (c) an ex- 
posure duration of 200 msec. was used. 
Preliminary experimentation using the ver- 
tical mode of presentation indicated that 
this increase in exposure duration was neces- 
sary to maintain performance above the 
chance level. 

Results—Figure 1 shows the probability 
of an incorrect recognition for letter posi- 
tions 1-8 in Exp. I. Analysis of variance 
indicated the effects of letter position (LP) 
to be significant, F (7, 98) =4.85, p<.01. 
In the left-field error function (LP 1-4), 
the greatest accuracy was found in LP 1, 
with a sharp decline in accuracy for the re- 
maining LPs. The right-field error func- 
tion (LP 5-8) is more bow-shaped, with 
the greatest accuracy in LP 5 and 8 and 
somewhat less accuracy in LP 6 and 7. In- 
spection of the curves and statistical analy- 
sis failed to find any evidence of left-field 
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PROBABILITY INCORRECT RECOGNITION 


S TA ert a 
LETTER POSITION 
Fre. 1. Probability of incorrect, recognition as 


a function of letter position in Exp. I. 
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dominance. Across-field comparisons using 
the Newman-Keuls q statistic indicated that 
there was no significant difference in the 
number of errors (number of incorrect 
recognitions) in the LP Pairs 1 and 5, 2 
and 6, and 3 and 7 (p> .05). 
ing of the right field is evident in the 
comparison of LP 4 and 8, which found a 
significantly larger number of errors at LP 4 
(p <.05). 

Figure 2 shows for Exp. II the prob- 
ability of an incorrect recognition for LP 
1-8 in the north and south fields. 
sis of variance indicated that the effects of 
LP were significant, F (7, 63) =11.02, p< 
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TABLE 1 


VALUES OF d’ AND B AS A FUNCTION OF LP FOR LEFT 
AND RIGHT FIELDS IN Exp. I AND NORTH 
AND SouTH FIELDS IN Exp. II 


.01. As can be seen, the error functions for 
the north field (LP 1-4) and the south field 
(LP 5-8) are highly similar in mirror image. 
For both fields, accuracy was greatest in 
the LP closest to the centrally located fix- 
ation point (LP 4 and 5). 
of an incorrect recognition increased as the 
distance from the fixation point increased 
with a slight decrease in the last LP. 
Across visual fields, there was no evidence 
of north-field dominance. Comparison of 
the numbers of errors (incorrect recogni- 
tions) in the LP Pairs 4 and 5, 3 and 6, 
2 and 7, and 1 and 8, again using the New- 
man-Keuls q statistic, revealed no significant 
reheat differences in accuracy (p> 
.05). 

The detectability (d') and criterion (B) 
values for each LP in the left and right and 
the north and south fields are shown in 
Table 1. The £ values correspond to the 
slope of the Receiver Operating Char- 
acteristic (ROC) for each LP at a point 
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Frc. 2. Probability of incorrect recognition as 


a function of letter position in Exp. II. 
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which was obtained by collapsing each 
2X10 matrix (i.e, signal-noise vs, response 
category) intoa 2X2 matrix with the cut- 
ting point between the response categories 
No-1 and Yes-1 (Pollack & Decker, 1958). 
Values of d’ and $ were computed using the 
assumptions of normality of distributions 
and equal variances (Green & Swets, 1966, 
pp. 36-37, 405). 

The d’ values in Table 1 reveal a similar 
relationship between accuracy and LP as seen 
for the probability measures, Across-field 
comparison of d’ values also yielded no evi- 
dence of left-field or north-field dominance. 
The variation of 8 over LP, although small, 
shows a functional relation similar to that 
seen for the probability and d’ measures. 

Discussion—The results of the two ex- 
periments reported support Ayres’ (1966) 
suggestion that left and north visual field 
dominance is an artifact of sequential report 
orders. Eliminating order of report by 
requiring only partial report negated domi- 
nance in these fields. 

The finding of bow-shaped error functions 
for letter spans points to the similarities 
between tachistoscopic recognition and serial 
learning (Croyitz & Schiffman, 1965; Har- 
cum, 1965). Although the bow-shaped 
functions suggest that letter spans are 
scanned or processed in serial order, it also 
is possible that these functions stem from 
the sequential ordering of responses. The 
Tesults of Exp. I favor the former interpre- 
tation. The error functions seen in Exp. I 
resemble the previously found bow-shaped 
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curves even though the sequential ordering 
of responses was avoided by using the partial 
report method. 

It is also of interest that this bow-shaped 
relationship between LP and the probability 
measure was reflected in the signal detection 
measures of sensitivity (d') and criterion 
(8). Ina recent study of short-term memory, 
Murdock (1966) found that while the prob- 
ability of a correct recall and increased 
across serial positions, d’ remained invari- 
ant. While differences in tasks and materials 
may account for this apparent contradiction, 
further study should be given to the relation- 
ship between serial position and d’. 
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MEDIATED GENERALIZATION VIA STIMULUS LABELING: 
A REPLICATION AND EXTENSION + 


DAVID R. THOMAS 
University of Colorado 


ALBERTA DeCAPITO CARONITE, GARY L. LaMONICA, anp KENNETH L. HOVING 
Kent State University 


In Exp. I, Ss (n=20) trained to finger lift to a light of 490 mya 
(labeled “green” by E) responded more in generalization testing to 
longer (“greener”) and less to shorter (“bluer”) wavelengths than 


Ss (n=20) told that it was “blue.” 


In Exp. II, 77 Ss were trained 


to finger lift to 490 my, as in Exp. 1, except that Ss were first told that 
“a group of college students in a previous experiment” had judged the 
CS “mostly green,” and they were required to state their level of 
agreement with the group judgment. The greater their stated level of 
agreement with the mostly green label, the greater was their subsequent 
tendency to respond to longer (“greener”) wavelengths during gen- 


eralization testing. 


In a series of experiments, Thomas and 
his associates (Doll & Thomas, 1967; 
Thomas & Bistey, 1964; Thomas & De- 
Capito, 1966; Thomas & Mitchell, 1962) 
have demonstrated that in the wavelength 
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generalization task the verbal label (i.e. 
color name) S gives to the CS influences the 
pattern of subsequent responding. In one 
study, Thomas and DeCapito (1966) in- 
structed Ss to make a yoluntary finger-lift 
response to an ambiguous (490 ma) stimulus, 
but prior to the generalization test they 
were required to label the stimulus. Those 
who called the CS “green” responded more 
to longer (“greener”) and less to shorter 
(“bluer”) wavelengths than those labeling 
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it “blue.” The purpose of Exp. I was to 
determine whether a similar mediational ef- 
fect could be observed if the label for the 
CS were supplied by E. Having demon- 
strated this, the purpose of Exp. II was to 
extend the procedure of the first experiment, 
such that the label given the CS was attrib- 
uted to “a group of college students in the 
previous experiment” rather than arbitrarily 
employed by E. An additional innovation in 
this experiment was the requirement that Ss 
state their level of agreement with the 
“group judgment” prior to the administra- 
tion of the generalization test. This allowed 
us to relate Ss’ verbal behavior to their 
subsequent generalization test performance 
and also provided a degree of commitment of 
the Ss once they had publicly stated their 
position. It would seem reasonable to ex- 
pect that the more strongly an S claimed 
agreement with the group judgment, the 
more her generalization test performance 
would be influenced. 

Method.—The Ss in Exp. I were 40 under- 
graduate women enrolled in introductory psy- 
chology courses at Kent State University. 
In Exp. II, 77 women from the same popula- 
tion were employed. The apparatus used in 
both experiments was the same as that de- 
scribed by Thomas and DeCapito (1966). 

The procedure used in Exp. I was as 
follows: After S was seated in the experi- 
mental room in front of the stimulus source, 
instructions were read indicating that a 
“green” (or “blue”) color would be ex- 
posed to S and that in subsequent generali- 
zation testing, when she recognized this color, 
she was to respond by lifting her finger from 
the telegraph key. For 20 of the Ss the 
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490-my CS was called green, for the other 
20 it was called blue. Following a 60-sec. 
presentation of the 490-mp CS, eight random- 
ized series of seven test stimuli, including 
the CS, were presented; these stimuli, which 
had earlier been matched for brightness, were 
475, 480, 485, 490, 495, 500, and 505 ma. 
Each test stimulus was presented for a period 
of 3 sec, and was followed by a blackout of 
5-10 sec., allowing sufficient time for E to 
record the presence or absence of a re- 
sponse and to change the monochromator 
setting. 

The procedure used in Exp. II was as 
follows: Each S$ was seated in the experi- 
mental room in front of the stimulus source 
and standard instructions were read to her. 
However, at the time the 490-muz CS was 
presented, E said: 


In a previous experiment in which the 
subjects were asked to label this color, a 
majority of them called it mostly green. 
We would like to know to what extent 
you agree with this judgment. Please 
score this rating scale accordingly. 


The Ss then scored their agreement on a 
6-point scale, which included the designa- 
tions: (1) strongly disagree, (2) disagree, 
(3) slightly disagree, (4) slightly agree, (5) 
agree, (6) strongly agree. After the rating 
was finished, the 490-my stimulus was turned 
off and the generalization test was begun. 

The generalization test used in Exp. II 
consisted of six randomized series of five 
equally bright test stimuli, including the 490- 
mya CS; these were 470, 480, 490, 500, and 
510 my. Each test stimulus was presented 
for a period of 3 sec. and was followed by 
a blackout of 5-10 sec. 

Results and discussion—For each S in 
Exp. I a generalization gradient was ob- 
tained based on the number of responses to 
each of the generalization test stimuli. Fig- 
ure 1 presents the mean generalization gradi- 
ents of the green-label and the blue-label 
groups. It is clear from this figure that Ss 
on whom the label green was imposed re- 
sponded more to the longer (“greener”) and 
less to the shorter (“bluer”) wavelengths 
than did Ss on whom the blue label was 
imposed. To analyze the degree to which a 
given S tended to respond to longer (or 
shorter) wavelengths, the gradient was 
treated as a grouped frequency distribution ; 
and the mean of each S’s gradient, (i.e. 
frequency distribution) indicated the size and 
the extent of the distortion in recognition of 
the CS; eg. a mean of 490 indicated no 
distortion, a mean higher than this indicated 
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a tendency to respond to “greener” values, 
and the higher the mean the greater the 
strength of this tendency. The mean of the 
gradient means was 492.2 for Ss in the 
green-label group and 489.1 for Ss in the 
blue-label group, £ (38) =4.07, p<.01. In 
the Thomas and DeCapito study (1966), 
where the labels were selected by Ss them- 
selves, the corresponding values were 492.0 
for the green label and 488.4 for the blue. 
These values do not differ significantly from 
the corresponding ones in the present study 
(t<1 in both cases), thus it may be con- 
cluded that a label imposed by E on S has 
essentially the same mediational effect on S’s 
generalization behavior as does a label which 
S has selected volitionally. 

Table 1 indicates the number of Ss in 
Exp. II selecting each of the six agreement- 
disagreement categories. It is clear from 
this table that Ss tend to prefer the more 
central categories. In addition, there is a 
tendency to select agreement categories more 
than disagreement ones, whereas more of 
Thomas and DeCapito’s (1966) Ss called 
490 mp blue than green. This tendency is 
itself evidence of social influence. 

A generalization gradient was obtained 
from each S in Exp. II, and Fig. 2 presents 
the mean generalization gradient of Ss in 
each of the six agreement-disagreement cate- 
gories. The figure clearly demonstrates the 
predicted positive relationship between level 
of agreement with the mostly green label and 
tendency to respond to longer (“greener”) 
wavelengths, Also, as in Exp. I, the mean 
of each S’s generalization gradient was ob- 
tained, and the mean of these gradient means 
for each of the six categories appears in 
Table 1. These means reflect the same posi- 
tive relationship shown in the figure. The 
extent of this relationship is indicated by 
a correlation (r) of +.46, (p <.01) between 
Ss’ agreement-disagreement categories and 
the means of their generalization gradients. 

It should be made clear that the correla- 
tion between category chosen and generaliza- 
tion performance does not simply reflect a 
relation between verbal compliance and judg- 
mental change. The agreement-disagreement 


TABLE 1 


GENERALIZATION GRADIENT MEANS OF THE 
Sıx Groups oF Ss 


Agreement Category Gradient Mean| n 


Strongly Disagree 488.57 2 
Disagree 489.00 8 
Slightly Disagree 491.47 22 
Slightly Agree 491.55 17 
Agree 494.17 21 
Strongly Agree 494.72 7 
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STRONGLY DISAGREE (2) 
DISAGREE (8) 

SLIGHTLY DISAGREE (22) 
SLIGHTLY AGREE (17) 
AGREE (21) 

STRONGLY AGREE (7) 


MEAN NUMBER OF RESPONSES 


Ad 
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Mean generalization gradients of Ss 
in the six different groups. 


category selected by an S doubtlessly re- 
flected at least two factors: (a) S’s personal 
preinfluence judgment of the value of the 
CS, and (b) her susceptibility to group in- 
fluence. With the present experimental 
design these two factors could not be dif- 
ferentiated. 

Experiment II demonstrated that the more 
strongly Ss stated that they agree with a 
particular label the more that label exerted 
its mediational effect on generalization. This 
suggests that any experimental manipulation 
influencing the degree to which Ss agree 
with a particular label also should be re- 
flected in corresponding changes in gener- 
alization. For example, if the label is attrib- 
uted to a particularly prestigious group, its 
effectiveness should be enhanced. With the 
procedure used here, since the color name is 
attributed to a (hypothetical) reference 
group, the characteristics of this group (eg., 
its status and relationship to S, the size and 
unanimity of the group, etc.) can readily be 
manipulated. 
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AVOIDANCE CONDITIONING OF THE GSR: 
A REPLICATION OF KIMMEL AND BAXTER * 


RICHARD V, THYSELL anp CHEN-YEH HUANG 
University of Waterloo 


Avoidance conditioning of the GSR was obtained by omitting a 
shock on each of 16 trials on which the experimental $ produced a 
GSR to the tone CS. The mean CR of the experimental Ss was 
greater than that of the yoked control Ss, These results replicate 
those reported by Kimmel and Baxter (1964), and provide further 
evidence that operant conditioning of an ANS response is possible. 


Kimmel and Baxter (1964) have reported 
evidence of operant GSR conditioning, They 
administered a series of trials in which, for 
an experimental S, the UCS (shock) was 
delivered following the CS (tone) if S 
failed to give a CR, and was omitted if 
S gave a CR. A yoked control § re- 
ceived an identical number and distribution 
of shocks, Classical conditioning of the 
GSR can occur for both groups on shock 
trials. Avoidance conditioning can occur for 
the experimental Ss because of the contin- 
gent omission of shock. Such conditioning 
presumably could not take place for control 
Ss, who sometimes receive shock following 
a CR and sometimes receive no shock fol- 
lowing no CR. Kimmel and Baxter found 
that while both experimental and control 
groups showed conditioning, the mean CR 
was significantly greater for the experimental 
group. The difference was interpreted as 
being due to avoidance conditioning in the 
experimental group. 

Two attempts to replicate this finding and 
extend it to extinction (Kimmel, Sternthal, 
& Strub, 1966) were only partially successful. 
The mean CR for the experimental group 
was greater than that of the control group 
in each study, but the acquisition differences 
were not statistically significant. The ex- 
tinction data combined across the two experi- 
ments showed the mean CR of the ex- 
perimental groups to be significantly greater 
than that of the control groups. In a third 
replication study (Kimmel & Sternthal, 
1967), Ss were matched for GSR reactivity 
and conditionability to minimize the bias 
(Church, 1964) inherent in the yoked-S de- 
sign. Again the mean CR was slightly, but 
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not significantly, greater for the experi- 
mental group during acquisition. Because of 
the somewhat equivocal nature of these 
results the present experiment was under- 
taken to provide another replication of the 
original Kimmel and Baxter study. 

Method.—Thirty-two students from the 
introductory psychology class at the Uni- 
versity of Waterloo participated in the ex- 
periment to fulfill a course requirement. 
The Ss were assigned randomly to the ex- 
perimental and control groups. 

The CS was a l-sec., 1,000-cps tone of 
approximately 40 db., delivered via earphones 
by a Hewlitt-Packard Model 201C audio- 
oscillator. The UCS was a .1-sec. shock 
of 1.5 ma, (110 y. ac) for a nominal S of 
50K ohms, The UCS was delivered to the 
index and ring fingers through Lafayette 
finger electrodes (Model E-2). The in- 
terval from onset of the CS to the onset of 
the UCS was 5 sec. The duration of CS and 
UCS and the CS-UCS interval were con- 
trolled electronically. A switch permitted E 
to interrupt the timing circuit following CS 
onset to prevent the delivery of the shock. 
The GSRs were recorded from the palm and 
back of the hand on a Beckman dynograph. 

Procedure—The Ss were run in pairs, 
each pair including an experimental S and 
his yoked control. The Ss were seated in 
easy chairs facing away from each other and 
the recording and shock electrodes were at- 
tached. They were told that they were to 
relax and breathe normally and that they 
would receive a series of tones and shocks, 
No mention was made of a relationship 
among tone, shock, and the GSR. After 
reading the instructions, Æ retired to an ad- 
joining room containing the apparatus and 
linked to Ss’ room by a one-way mirror and 
intercom, 
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Fic. 1. Mean conditioned GSR magnitude during acquisition for experimental and control groups. 


After an adaptation period of approxi- 
mately 5 min. Ss were given three shocks of 
increasing intensity, the last being 1.5 ma. 
The shocks were followed by presentations 
of the tone alone until both Ss of a pair 
reached a criterion of two consecutive fail- 
ures to respond. The first conditioning trial 
was given 30 sec. after the last tone, and all 
Ss received the shock on this trial whether 
they responded to the tone or not. The in- 
tertrial interval during acquisition was varied 
randomly among 20, 30, and 40 sec. with a 
mean of 30 sec. 

During Acquisition Trials 2-16 the shock 
was omitted for each experimental S on any 
trial on which he produced a GSR of at least 
1,000 ohms within 5 sec. of CS onset. His 
yoked control received shock omission on the 
same trials. 

Results and discussion—The mean CR, in 
terms of the square root of the maximum 
change in conductance during the CS-UCS 
interval, is shown in Fig. 1, as a function of 
trials, for both groups. The overall trends 
and the higher level of performance of the 
experimental group after Trial 8 are similar 
to those reported in the earlier studies by 
Kimmel et al, To evaluate this acquisition 
difference each S’s scores were summed over 
the 16 trials and a binomial test was per- 
formed on the number of pairs in which the 
summed score of an experimental S$ exceeded 
that of his yoked control (n=12). This 
test yielded p=.08 (two-tailed). 


As in the Kimmel, Sternthal, and Strub 
study (1966), the present results confirm the 
superior performance of the experimental 
group reported by Kimmel and Baxter 
(1964) but the difference was of marginal 
statistical significance. The relatively smaller 
difference between groups found in the 
present study could be due to several pro- 
cedural variations which included: (a) a dif- 
ferent criterion of adaptation to the tone 
prior to the first acquisition trial, (b) a dif- 
ference in the magnitude of the CR required 
to avoid shock, and (c) the running of two 
Ss at one time in the same room, which, 
furthermore, was not soundproof as in the 
Kimmel and Baxter study. That the same 
general results have now been found in five 
separate studies, each of which incorporated 
some procedural changes, indicates the reli- 
ability of the treatment effect. 
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ORDINAL POSITION 


WILMA A, WINNICK 
Queens College of the City University of New York 


IN SERIAL LEARNING 


AND RHEA L. DORNBUSH 1 
Douglass College, Rutgers, The State University 


A transfer experiment examined the role that knowledge of item 


position plays in serial learning ( 


SL). PA learning of numbers (as 


stimulus items) paired with nonsense syllables provided knowledge of 
the absolute positions of the syllables in subsequent SL to 1 group 
and knowledge of relative position to a 2nd, while for a 3rd group 


there was no correspondence at al 


ll between the numerals of PA and 


item position in SL. A 4th group for whom syllables (as response 


items) were paired with words served as a control. 


of absolute positions was found to 


the greatest effect for the middlemost items. 


Prior knowledge 
be of significant aid in SL, showing 
Those Ss given knowl- 


edge of relative positions also learned more rapidly than did the control. 


There is recent evidence to suggest that 
the absolute position that items hold in the 
list is one of the cues in serial learning. 
This has been demonstrated in experiments 
studying transfer from one serial list to an- 
other (Young, 1962) or from serial learning 
to a spatial discrimination task (Ebenholtz, 
1963a; Jensen & Rohrer, 1965) and in studies 
using a variable starting position (Ebenholtz, 
1963b; Keppel, 1964; Winnick & Dornbush, 
1963). Recently, Young, Hakes, and Hicks 
(1967) used a design in which Ss were 
transferred from a serial list to a paired- 
associate list whose responses were the 
items learned in the serial list and stimulus 
items were the ordinal position numbers. 
Their data suggest that the absolute posi- 
tion cue serves most effectively at the begin- 
ning and end of the list. 

The present study follows directly from 
the study of Young et al. (1967), examining 
the effect of prior knowledge of positions 
provided by paired-associate (PA) learning 
upon subsequent serial learning (SL). 
Four groups were tested; in the first, the 
numbers of the positions of items in the 
serial list served as the stimuli in prior 
PA learning with the items from the serial 
list as responses. In a second group, PA 
learning provided knowledge of relative posi- 
tions, while for a third group there was no 
correspondence between the numerals that 
were the stimulus items of PA and posi- 
tions of items in SL; a fourth group, serving 
as a control, had syllables paired with words 
for PA. 

Method.—The sequence of events in the 
experiment consisted of PA learning fol- 
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lowed by SL. There were four transfer ar- 
rangements: numbers as stimuli in PA 
(1 through 12) corresponded to the posi- 
tions of syllables in SL (Group A); num- 
bers as stimuli in PA (9 through 20) had 
only sequential correspondence with positions 
in SL (Group B); numbers as stimuli in 
PA (1 through 12) had no correspondence 
with positions in SL (Group C); and, in 
the control group, words, rather than num- 
bers, were the stimulus items paired with 
nonsense syllables (Group D). 

A total of 56 Ss Participated, with 14 in 
each group. All were introductory psychol- 
ogy students who served in the experiment 
as volunteers; none had had previous verbal 
learning experience. The Ss were randomly 
assigned (in order of appearance) to each 
of the four experimental groups, 

A Lafayette memory drum was used to 
present learning materials, with the long-arm 
attachment employed for PA learning. The 
12 nonsense syllables used, identical for PA 
and SL, were chosen from Glaze (1928) 
and averaged 73% associative value; in 
addition, they were arranged for SL to avoid 
repetition of letters and any obvious sequen- 
tial links. The numbers used in PA were 
in the form of words; five different arrange- 
ments of pairs were used in PA to prevent 
serial learning. The words used for Group 
D were matched as closely as possible in 
terms of initial letters and word length to the 
numerals used in Group A. The words 
were: off, tow, there, fail, fort, set, sieve, 
easy, neat, tin, elevate, twine. Two differ- 
ent list arrangements and word pairs were 
employed in subsets of each of the experi- 
mental groups. 

SL and PA were carried out identically 
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with learning of both types by the anticipa- 
tion method to the criterion of one errorless 
trial. The Ss were instructed in both learn- 
ing situations to pronounce the nonsense syl- 
lables, and first and second tasks were 
separated by a 90-sec. interval. Instructions 
for the second task made no mention that 
the same syllables were involved. In SL, 
syllables were exposed in the memory drum 
at a 2-sec. rate with a 4-sec. intertrial inter- 
val; for PA, stimulus items were exposed 
every 4 sec., with a 2.2-sec. anticipation in- 
terval, and a 4-sec. interyal between succes- 
sive presentations. 

Results —Trials to reach 100% criterion in 
the first PA learning task are shown in 
Table 1 for the four groups. Analysis of 
variance of these data found no significance 
in the small differences, permitting the as- 
sumption of group equivalence. 

The data for the SL transfer task are also 
shown in Table 1. On this task, group dif- 
ferences are obvious, and one-way analysis of 
variance found significance in the group 
comparison, F (3, 52) =7.54, p < .01. Dun- 
can’s new multiple range test (Edwards, 
1960, pp. 136-139) found all interpair dif- 
ferences, except the difference between C 


PER TRIAL 


MEAN CORRECT RESPONSES PER S$ 


SERIAL 


Fic. 1. Mean correct responses per S per trial as a function of item position in SL for the four groups. 


TABLE 1 


MEAN NUMBERS OF TRIALS TO CRITERION FOR 
THE PA AND SERIAL Lists 


PA Serial 

Group 
M SD M SD 
A 16.43 4.19 3.43 2.87 
B 17.61 5.27 11:57 6:60 
B: 16.02 3:27 17.21 5.91 
D 18.36 6.86 16.50 2199 


and D, to be significant at p<.05. Thus, 
both Groups A and B were found to have 
learned more rapidly than either Group D 
(control) or Group C. In addition, Group A 
had learned significantly more rapidly than 
did Group B. 

The learning of individual items in the 
serial list was analyzed as a function of serial 
position. These data are shown in Fig. 1. 
A two-way (groups, position) analysis of 
variance indicated significance in the main 
effect of groups, F (3, 52) = 23.30, p<.01. 
The main effect of serial position was also 
significant, F (11, 572) =25.57, p<.01. 
Significance was further found in the Groups 
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X Positions interaction, F (33, 572) = 2.46, 
p<.0l. 

From Fig. 1 it is apparent that the greatest 
contrast between the data of the control 
group and of Group A is found in the 
center of the list. This cannot be asserted to 
be due to the transfer variable which pro- 
vided prior position learning, however. At 
the beginning and end positions, the three 
other groups did almost as well as Group A; 
in fact, the control group (Group D) per- 
formed nearly perfectly at Position 1 and 
did almost as well at the terminal position. 
In other words, given the shape of the con- 
trol curve, which is a typical serial position 
curve, it is simply not possible for as much 
improvement to accrue to first and last posi- 
tions as in the central positions. 

Discussion —Where, for Group A, posi- 
tional knowledge from prior PA learning 
was specific and explicit, there was a large, 
significant advantage over the control group. 
Positive transfer was also evident in the 
data of Group B, for whom the PA task 
provided information about the relative posi- 
tions of syllables in SL, and small negative 
transfer was found where inconsistent in- 
formation about positions was given. Hence, 
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prior information about both absolute and 
relative positions of items in SL appears to 
benefit such learning. 
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RELATIVE IMPORTANCE OF PROBABILITIES AND 
PAYOFFS IN RISK TAKING? 


PAUL SLOVIC anp SARAH LICHTENSTEIN 2 


Oregon Research Institute, Eugene 


This paper is concerned with understanding how people make decisions 
about gambles when the relevant probabilities and payoffs are ex- 
plicitly stated. It is proposed that decisions may be determined by 
a person’s beliefs about the relative importance of probabilities and 
payoffs and by limitations on his ability to act on the basis of these 
beliefs when processing the information that describes a gamble. An 
explanation based on this information-processing orientation is offered 
as an alternative to present notions about probability and variance pref- 


erences, 


Finally, 2 experiments are reported to indicate the useful- 


ness of considering gambling decisions within the context of information 


processing. 


This paper is concerned with con- 
ceptualizing the process by which a 
person makes decisions about gambles 
whose probabilities and payoffs are 
explicitly stated. 

Gambles commonly are characterized 
as multidimensional stimuli. For ex- 
ample, a two-outcome gamble in which 
one outcome is a gain of some amount 
of money and the other is a loss can be 
described by its location on four basic 
tisk dimensions—probability of win- 
ning (Pw), amount to win ($w), 
probability of losing (Pz), and amount 
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States Public Health Service. Portions of 
this work were presented at the meetings of 
the Western Psychological Association, San 
Francisco, May 1967, and the American Psy- 
chological Association, Washington, D. C., 
September 1967. 

*The authors are indebted to Amos 
Tversky for his criticisms of an earlier 
version of this paper. 
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to lose ($:). The essence of an ade- 
quate descriptive theory of risk taking 
is a thorough understanding of the way 
in which people go about integrating 
information from these basic risk di- 
mensions when evaluating gambles. 
The influence of these basic risk di- 
mensions is examined here with par- 
ticular emphasis on two ideas. The 
first is “importance beliefs.” By this 
is meant that when a person evaluates 
a bet, he pays more attention to some 
risk dimensions than to others because 
he believes that these particular dimen- 
sions are most important for his pres- 
ent decision. For example, a person 
with very little money and great fear 
of losing it may focus his attention on 
the amout to lose ($u) and base his 
decisions almost exclusively on this 
dimension, largely disregarding the 
other information provided in the bét. 
Such beliefs about the relative im- 
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portance of the risk dimensions may 
derive from previous experience, from 
logical analysis of the decision task, or 
even from quite irrational fears or 
prejudices. Importance beliefs, as de- 
fined above, have been useful for de- 
scribing and understanding how people 
combine information in other multi- 
dimensional contexts such as those 
which occur in clinical judgment or di- 
agnosis( see, e.g., Hammond, Hursch, 
& Todd, 1964; Hoffman, 1960; Holz- 
berg, 1957; Shepard, 1964). How- 
ever, this notion has received little or 
no attention in the area of decision 
making under risk. 
The second concern is with informa- 
tion-processing considerations. As- 
suming that a person holds strong be- 
liefs about the relative importance of 
the various risk dimensions, his capac- 
_ ity to employ these beliefs successfully 
when making decisions may be quite 
limited. According to Miller (1961), 
a decision maker under stress may com- 
pletely neglect some items of informa- 
tion and employ other items according 
to a priority scheme in order to reduce 
information overload, Similarly, the 
work of Bruner, Goodnow, and Austin 
(1956) showed that attempts to reduce 
“cognitive strain” have a very strong 
influence on information use in decision 
tasks. Experimental studies demon- 
strating the influence of information- 
processing considerations in risk tak- 
ing include the work of Miller and 
Meyer (1966) on the structure and 
number of alternatives available to the 
decision maker; Herman and Bahrick 
(1966) on the amount of information a 
person must encode in order to eval- 
uate a bet; and Slovic, Lichtenstein, 
and Edwards (1965) on the effects of 
boredom on choices among bets. Con- 
cern with the decision maker’s infor- 
mation-processing capabilities also un- 
derlies Simon’s (1957) notion of 


“satisficing” and Toda’s (1962) dis- 
cussion of the decision processes of a 
“fungus eater.” 

In summary, the above considera- 
tions lead us to propose that the de- 
cision maker may give some dimensions 
of a gamble considerably more weight 
than others because he believes that 
these dimensions are more important 
for the particular judgment he is 
making and because his limited infor- 
mation-processing capability forces 
him to focus on these dimensions and 
neglect others. The remainder of this 
paper will attempt to marshall support 
for these arguments. 


Descriptive THEORIES AND 
Constructs 


A review of the most influential 
theories and constructs used to de- 
scribe risk taking will help place the 
present ideas in perspective. 

The most popular class of models in 
this area has been that of the expecta- 
tion models. The prototype of this 
class is the expected value (EV) 
model, which states that the attractive- 
ness of a gamble corresponds to the 
gamble’s mathematical expectation. 
For the two-outcome gamble offering 
a chance of gain or loss, 


EV = Py $w + Py: $y. 


This model asserts that when a person 
chooses among two or more alternative 
gambles, he maximizes EV. 

Since Bernoulli (1738), it has been 
evident that the EV of a risky course 
of action is not the sole determiner 
of its attractiveness for the decision 
maker. Bernoulli and others proposed 
that people maximize expected utility 
(EU) where utility is the subjective 
value of money. More recently, EU 
theory has been amended to incorporate 
the notion that people make decisions 
on the basis of subjective rather than 


aa 
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stated probabilities. According to this 
amended version, behavior is governed 
by the subjectively expected utility 
(SEU) of a gamble (Edwards, 1955). 
For our exemplary two-outcome gam- 
ble, 


SEU = S (Pw) :U($w) + 
S(Pr)-U($r), 


where S(P) and U($) represent sub- 
jective functions corresponding to 
stated probabilities and payoffs. 

The formal axiomatized SEU model 
(eg, Savage, 1954) does not require 
that the subjective probability and util- 
ity functions be interpreted in any par- 
ticular way. They can be viewed sim- 
ply as transformations of the scale of 
stated probabilities and payoffs— 
transformations that are predictive of 
risk-taking decisions. However, sub- 
jective probability is quite commonly 
interpreted as a measure of the decision 
maker’s,opinion about the likelihood of 
an éy€nt, and utility is usually thought 
of as the subjective worth of an out- 
come. Future references to subjective 
probability and utility will be with these 
interpretations in mind, 

Another influential approach to the 
understanding of choices among bets 
has been the study of probability pref- 
erences. Probability preferences were 
first examined by Edwards (1953, 
1954a, 1954b) whose Ss made choices 
among pairs of gambles. Edwards ob- 
served consistent tendencies to prefer 
or avoid long shots (gambles offering a 
small probability of a large payoff). He 
also observed consistent choices of bets 
having specific intermediate probabil- 
ities. 

At first, attempts were made to ex- 
plain probability preferences by means 
of subjective probabilities (Edwards, 
1955). Later work by Coombs and 
Pruitt (1960), however, was based on 
the idea that a person has an ideal 


Pw and that, when other variables 
such as EV are controlled, choices 
among bets are determined by the simi- 
larity of each bet’s Pw to the person’s 
ideal. 

Yet another factor thought to be 
an important determiner of gambling 
decisions is preference for variance. 
Theorists such as Fisher (1906) and 
Allais (1953) have argued that people 
base their risk-taking decisions not only 
on expectation but also on the disper- 
sion of a gamble’s possible outcomes. 
Any gamble can be viewed as a prob- 
ability distribution over the possible 
outcomes. The mean of this distribu- 
tion is the gamble’s EV. The variance 
of this distribution has been the most 
commonly studied index of dispersion. 
For the two-outcome gamble discussed 
earlier, 


variance = PwPy($w — $1)?. 


A person’s “utility for risk” often 
has been equated with his preference 
for variance (Allais, 1953; Coombs & 
Pruitt, 1960; Kogan & Wallach, 1967; 
Royden, Suppes, & Walsh, 1959; Slo- 
vic, 1962; Van der Meer, 1963). 

Although the first experimental 
study of variance preferences (Ed- 
wards, 1954c) concluded that they 
were less important than probability 
preferences, studies done since have 
been interpreted as indicating that vari- 
ance has rather strong effects (Coombs 
& Pruitt, 1960; Davidson & Marschak, 
1959; Lichtenstein, 1965 ; Littig, 1962; 
Royden et al., 1959; Van der Meer, 
1963). 

Coombs and Pruitt (1960) found 
that Ss had stable probability prefer- 
ences and, in addition, variance prefer- 
ences that interacted in a systematic 
fashion with these probability prefer- 
ences. Specifically, Ss seemed to pre- 
fer higher levels of variance for gambles 
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that contained their preferred prob- 
abilities. 

To date, probability and variance 
preferences have been examined in 
more than 30 studies. Without ex- 
ception studies claiming to find such 
preferences have used gambles whose 
probabilities and payoffs were highly, 
if not completely, confounded. This 
confounding means that patterns of Ss’ 
preferences may be explained not only 
by recourse to notions of probability 
and variance preferences but, alter- 
natively, by strategies that involve dif- 
ferential weighting of the basic dimen- 
sions of the gambles. This point is 
elaborated below. 

Table 1 presents gambles similar to 
those used by Coombs and Pruitt to 
determine probability and variance 
preferences, All of these gambles have 
an expected value of approximately 
zero. The gambles are organized into 
10 sets defined by the five rows and 
five columns. All gambles within a 
single column have the same prob- 


abilities but differ in variance. Choices 
among bets within each of these five 
column sets are used to study variance 
preferences. All gambles within a 
single row have equal variance but 
differ in probabilities. Choices within 
the row sets are used to study prob- 
ability preferences. 

Look at the bets within any row set. 
As you follow along from left to right, 
note that the probability of winning 
increases as the amount to win de- 
creases. Simultaneously, the probabil- 
ity of losing decreases and the amount 
to lose increases. Because of this con- 
founding, choices among these gambles 
cannot be interpreted unambiguously as 
preferences for specific probabilities. 
For example, consider the first and 
last bets of Set A: y to win 90¢ and 
ty to lose 10¢ vs. 3% to win 10¢ and 
ip to lose 90¢. If S places greater 
importance or weight on $w than on 
the other three attributes of the bets, 
he might consider that the first bet’s 
ninefold advantage in $w would out- 


TABLE 1 
Bets USED To DETERMINE PROBABILITY AND VARIANCE PREFERENCES 
Set 

Set Variance 1 2 3 4 5 

Pw = 1/10 = = A = 
Fi soya | ey = 710, | Pease | Be a 0n Bw = 9710 
A .09 +8 90 | +8 .45 +$ 30 +8 .20 +$ .10 
a E . 20rbest S630 hast 445 — 90 
B 1.00 RAIT E1150) 10H logins Hane gs ++ 33 
gh Maa 65 EEE E Sates) — 297 
(9) 5.00 + 6.75 + 3.40 + 2.25 + 145 H alo 
EO — 1455 oR A25 a E3 A0 — 6.75 
D 25.00 + 14.85 | + 760 | + 5.00 + 3.25 + 1.65 
Boe Chay oo OO emer: Gole || r14/85 
E 90.00 + 2840 | + 14.45 + 9.50 + 6.20 + 3.15 
= 345 | — 620 | = 950°) 2 4aas — 28.40 


Note.—The upper value in each cell is the 
* Pw = probability of winning, soreng 
» Pt, = probability of losing. 


payoff; the lower entry is the possible loss. 
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weigh its ninefold disadvantage in Pw 
causing him to prefer the first bet over 
the last. His preference, heretofore 
interpreted as a “probability prefer- 
ence,” could be attributable instead to 
an overfocusing on $w. This same 
preference ordering could result from 
placing greater importance on mini- 
mizing $r. If Pw and Py were the 
dimensions carrying the most weight, 
then the ninefold differences in prob- 
abilities between the two bets would 
offset the differences in outcomes, and 
the last bet would be preferred over 
the first. Thus, Ss who focus pri- 
marily on the money aspects of the 
bets would appear to prefer low prob- 
abilities (long shots), while those who 
believe that probability is more im- 
portant than money would appear to 
prefer “almost sure things.” 
Consideration of any of the column 
sets in Table 1 shows that variance 
preferences also are equivocal. Note 


that the amounts to win and lose are’ 


correlated highly with one another and 
with variance. Both payoffs increase 
as variance increases. An S who be- 
lieves that minimizing $r, is more im- 
portant than maximizing $y would be 
observed to prefer bets having the low- 
est variance. Conversely, an 5 who 
places more importance on gains than 
on losses would appear to prefer bets 
with high variance. 

A large proportion of the systematic 
preferences found by Coombs and 
Pruitt (1960), Van der Meer (1963), 
and Slovic et al, (1965) were for bets 
having extreme levels of probability or 
variance. They were, therefore, con- 
sistent with the preferences that would 
have occurred had Ss simply been at- 
taching unequal importance to the 
probability and payoff dimensions. 

The more complicated interactions 
between variance preferences and prob- 
ability preferences found by Coombs 


and Pruitt cannot be explained simply 
by assuming that S always overempha- 
sizes one of the four risk dimensions. 
However, the slightly more complex 
strategies described below can account 
for these interactions and are consistent 
with the present orientation. 

In one common interaction pattern, 
S prefers the bet with smallest Py 
within each of the Sets A through E 
(Table 1). In Sets 1 and 2 he prefers 
high variance, while in Sets 4 and 5 
he prefers low variance. From his 
probability preferences we infer that 
S believes $w and/or $r, to be more im- 
portant than Pw and Pr. One possible 
interpretation of S’s variance shift is 
that he gives most weight to whichever 
dollar dimension involves the largest 
amounts of money—a kind of saliency 
effect. Thus in Sets 1 and 2 the wins 
are larger than the losses, so S maxi- 
mizes the amount to win, choosing the 
larger-variance bets, while in Sets 4 
and 5, where losses are larger than 
wins, S minimizes his losses, choosing 
low-variance bets. 

Another common pattern (Pollatsek, 
1966) is one in which S most prefers 
bets with Pw = y (but also prefers 
bets with Pw = +; over those with Pw 
= 75) and prefers maximum variance 
when Pw >.5 and minimum variance 
when Pw < .5. This pattern could re- 
sult from the following strategy : If the 
Pw’s of two bets being compared differ 
greatly (eg. Pw, <.5 and Py,.> 
-5), then choose the bet with the high- 
est Pw. But if a pair of bets have Pw’s 
that are similar or equal, then maximize 
the amount of the most probable out- 
come. This strategy seems to reflect 
the belief that probabilities are more 
important than payoffs; the latter are 
considered only when probabilities are 
similar. ; 

Recent empirical evidence has been 
found to buttress these logical criti- 
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cisms of the probability and variance 
preference data. Lichtenstein (1965) 
failed to find probability preferences 
among three-outcome gambles in which 
the usual confoundings between prob- 
abilities and payoffs were not present. 
Slovic and Lichtenstein (1968), by 
presenting bets in different formats, 
manipulated variance without changing 
the probabilities and payoffs that were 
explicitly displayed to Ss. Variance, 
when manipulated in this way, had little 
or no influence on the decisions of 
most Ss. 

The logical shortcomings in the de- 
sign of experiments that purport to 
provide evidence for probability and 
variance preferences, combined with the 
recent negative evidence from more 
carefully controlled studies, enhance the 
plausibility and attractiveness of the 
notion that decisions are determined by 
simple rules for processing information 
about probabilities and payoffs. In the 
hope of learning more about these 
rules, the following experiments were 
designed to permit precise quantitative 
study of the manner in which responses 
to gambles change as individual com- 
ponents of a gamble are manipulated, 


EXPERIMENT I 
Method 


In order to study the relative importance 
of each component of a gamble, a special 
type of gamble was constructed in which 
probabilities of winning and losing, as well 
as their respective payoffs, could be varied 
independently. This type of stimulus, called 
a duplex gamble, is illustrated in Fig.l. A 
duplex gamble is represented by two large 
discs, each with a pointer on it. The left- 
hand disc is used for determining winnings. 
The right-hand disc determines losses. To 
play, S must spin the pointers on both the 
winning and losing discs. This means that 
he can win and not lose; lose and not win; 
both win and lose; or neither win nor lose. 

A list of the 27 bets used in this study 
appears in Table 2. For these bets, Py and 
Py each occurred three times at each of three 


WIN $1 LOSE $4 


WIN O LOSE O 


Fic. 1, Example of a duplex gamble where 
Py= 4, $v=$1, Pu=.2, and $, = $4. 


levels: .2, .4, and .8; $w and $: also occurred 
three times at each of three levels: $1, $2, 
and $4. The four risk dimensions had zero 
intercorrelation across bets. The average 
EV across the set was $0. 


Subjects—The Ss were 213 undergrad- 
uates, paid for their participation and 
assigned to four groups, Each group judged 
the attractiveness of the same bets but did so 
under different instructions. The Ss did not 
actually play any of the gambles. 

Rating instructions—The 88 Ss in the 
rating group were asked to indicate theit 
strength of preference for playing each bet 
on a bipolar rating scale. Attractive bets 
were rated between +1 (slight preference 
for playing) and +5 (strong preference for 
playing), inclusive. Unattractive bets were 
rated between —1 (slight preference for not 
playing) and —5 (strong preference for not 
playing), inclusive. 

Bidding instructions—Investigators study- 
ing gambling decisions have often asked Ss 
to describe their opinion about a gamble’s 
attractiveness by equating it with an amount 
of money such that they would be indifferent 
to playing the gamble or receiving the stated 
amount. These types of responses will be 
referred to here as “bids.” Three different 
bidding techniques were used in this study. 
Persons in the first bidding group were 
asked to state the largest amount of money 
they would be willing to pay E in order to 
play each bet. For an undesirable bet, they 
stated the smallest amount’ E had to pay 
them before they would play. This procedure 
has been used previously by Slovic et al. 
(1965), Lichtenstein (1965), and Coombs, 
Bezembinder, and Goode (1967). The Ss 
in the second bidding group were given 
ownership of a ticket to play each gamble. 
They were asked to state the least amount 
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TABLE 2 
List OF THE 27 Bets EVALUATED ny EACH S 
Ret | Pw | ise] ee | ge EV 
1 2 $2 38 $2 | —$1.20 
2 8 1 fe 4 -00 
3 2 1 38 4 | — 3.00 
4 8 2 8 1 + .80 
5 8 2 ot 4 00 
6 A 4 A 1 + 1.20 
7 38 1 8 2 |— .80 
8 i 2 A 1 .00 
9 4 1 “A 4 | — 1.20 
10 2 1 4 2 |- .60 
11 4 2 A 2 00 
12 8 2 a2 2 | + 1.20 
13 8 4 A 2 | + 2.40 
14 8 4 8 4 -00 
15 A 1 8 1 = 40 
16 A 4 8 2 .00 
17 4 2 ja 1 — .60 
18 2 4 A 4 |—  .80 
19 A 1 W 1 00 
20 -2 4 8 1 00 
21 4 4 By 4 /+ .80 
22 4 1 2 2 -00 
23 ee 4 2 2 + 40 
24 8 1 A 1 |+ 40 
25 2 2 2 4 |— 40 
26 “4 2 8 4 | — 2.40 
27 8 4 2 1 + 3.00 


of money for which they would sell this 
ticket. For unattractive gambles, they stated 
the highest amount of money they would pay 
E in order not to play the gamble. This 
procedure has been used by Becker, DeGroot, 
and Marschak (1964), Lindman (1965), 
Coombs et al. ( 1967), and Tversky 
(1967a, 1967b). Following a procedure em- 
ployed by Coombs et al. (1967), Ss in the 
third bidding group had to state a fair price 
for a gamble not kowing whether they or E 
owned the right to play it. The maximum 
allowable bid for an attractive bet, under any 
instructional set, was $w, the most money the 
bet offered to win. The maximum allowable 
negative bid for an unattractive bet was $r. 

The mean bid across the 27 gambles for 
the first, second, and third bidding groups 
was —59¢, + 19¢, and —5¢, respectively. The 
relative sizes of these means match the re- 
sults obtained by Coombs et al. and reflect 
a tendency for Ss to want to be paid money 
rather than pay E. In other respects the 
behavior of the 125 Ss in the three bidding 
groups was similar, and their data were com- 
bined and designated as that of the bidding 
groups. 


Results 


Data analysis—The primary data 
analysis consisted of correlating each 
S’s responses with the levels of each 
tisk dimension across the 27 gambles. 
Since the risk dimensions were them- 
selves uncorrelated, the correlations be- 
tween risk dimensions and responses 
are directly proportional to the weights 
in a linear regression equation char- 
acteristic of each S. The absolute mag- 
nitudes of these correlations thus pro- 
vide information about the relative im- 
portance of each dimension in deter- 
mining the responses. 

A rather unusual model of risky 
decision making underlies this data- 
analysis technique ; namely, 


A(G) = p + w,Pw + wSw 
+ w,Py + w,$r, 


where A(G) is the attractiveness of a 
gamble and the w’s are weights reflect- 
ing the relative importance of each di- 
mension. This model combines, in 
additive fashion, variables such as Pw 
and $w or Pr, and $r, which are usually 
thought to combine multiplicatively, In 
addition, it assumes that the import of 
probabilities and payoffs is a linear 
function of their objective values, 
The extent to which these assump- 
tions of additivity and linearity are 
violated by Ss’ behavior was tested in 
prior work with duplex bets. The Ss 
made either bidding or rating responses 
to a complete factorial set of 81 bets 
(three levels of each of the four risk 
dimensions) with one replication, 
Analysis of variance techniques, aug- 
mented by Hays’ (1963, pp. 406-407) 
method for partitioning variance, indi- 
cated that an average of about 75% 
of the total response variation was ac- 
counted for by a simple weighted com- 
bination of the four risk dimensions 
and less than 1% was due to non- 
linear use of these single dimensions. 
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r= -710 
SLOPE = -82 


fs -175 
SLOPE= -20 


=$1.54 


MEAN BID = 


PREDICTION EQUATION : 


en k 


STANDARD DEVIATION = $1.44 


BID = 64 (Pw) + 05 Hw) = 20 (PL) — 8219.) — .77 


Fic. 2. Regression analysis of one S’s bidding responses. 


For Ss who rated bets, an additional 
1% of the response variance was due 
to interactions indicating multiplicative 
combining of Pw and $w or Py, and $r. 
Under bidding instructions these inter- 
actions accounted for about 7% of the 
variance. Almost all of the remaining 
variance was error. 

The special advantage of the regres- 
sion technique and the model under- 
lying it is the fact that it permits inde- 
pendent assessment of the relative in- 
fluence of all four risk dimensions. It 
is fortunate, therefore, that the model 
produces a reasonably good fit to this 
type of data. 

Figure 2 illustrates the regression 
analysis for a single S in the bidding 
group. The scatterplots and correla- 
tions indicate that this S’s bids were 
influenced considerably more by vari- 
ation in Pw and $; than by variation 
in $w or Pr. The bids correlated .74 
with the EV of the gambles and .92 
with combinations of the four risk 
dimensions weighted according to the 
prediction equation in Fig. 2. Had S 
been bidding the EV for each gamble, 
his responses would have correlated 
+.47 with Pw and $w and —.47 with 
Pr and $r. 


Tables 3 and 4 present the correla- 
tions between risk dimensions and re- 
sponses for selected Ss from both 
groups as well as the averages over all 
Ss in each group. The group data in- 
dicated that a weighted combination of 
the risk dimensions correlated, on the 
average, .86 with both bids and ratings. 
Remarkably little shrinkage of these 
multiple correlations was subsequently 
found—about .016 on the average—on 
cross validation of an S’s weights with 
a second replication of bets. As one 
would expect, the weighted prediction, 
having four fitted parameters, corre- 
lated more highly with the responses 
than did the EV model, 

Of particular interest are the enor- 
mous differences in weights, both 
within and between Ss, shown in 
Tables 3 and 4. These Ss were chosen 
to emphasize these differences, but 
even for the entire group, an S’s high- 
est correlation with a dimension was, 
on the average, twice the size of his 
lowest correlation. These inequalities 
of weighting across dimensions have 
been replicated several times using real 
play conditions and as many as 81 
gambles instead of 27. These data 
indicate that the responses of many Ss 
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TABLE 3 

UTILIZATION OF THE RISK DIMENSIONS BY SELECTED Ss IN THE RATING GROUP 

Correlations between Risk Dimensions 
ani ti oj ttractiveness P 
Subject No. ams SER eee 

Pw $w Pi $L 

1 -76 33 | —.19 | —.28 | 90 77 

2 +82 14 —.33 —14 91 -66 

3 56 29 00) —.33 90 85 

4 42 34 =y ad .92 85 

5 32 38 =.29 —.76 95 86 

6 .26 2 —.45 —.62 86 15 

Group Average (N = 88) .48 36 —.39 —.40 86 79 


Note,—The most highly weighted dimension for each S is underlined. 
* R is the correlation between the weighted sum of the risk dimensions and S's ratings. 


were overwhelmingly determined by 
one or two of the risk dimensions and 
were remarkably unresponsive to large 
changes in the values of the less im- 
portant factors. 

The upper half of Table 5 indicates 
that the responses of approximately 
two-thirds of the Ss were influenced 
more by Pw than Py. This result pro- 
vides strong evidence that something 
other than the usual conception of sub- 
jective likelihood is required to explain 
the influence of stated probability on 
behavior. Since the events that defined 
Pw and Py, were represented by identi- 


cal segments of identical discs, there is 
no reason to expect that the subjective 
probabilities of these events would dif- 
fer. This finding even violates the 
axiomatic (uninterpreted) SEU model 
because it indicates an interaction be- 
tween S(P) and U($): Fitting this 
data would require a different S(P) 
function when U($) is positive than 
when U($) is negative. 

The lower half of Table 5 illustrates 
that the regression technique repro- 
duces one of the most common findings 
in studies of risk taking—namely, that 
potential losses exert more influence 


TABLE 4 
UTILIZATION OF THE Risk DIMENSIONS BY SELECTED Ss IN THE BIDDING Group 
ee peace 
imensions an ids a 
Subject No. K Soranasi wo 
Pw $w Pu $L 

ip 81 13 a 27, — .30 91 15 

2 73 01 —47 | — 07] .88 -70 

3 21 A | —.27 | — 49 | 93 82 

4 14 67 =.22 — 49 85 65 

5 37 22 —.63 | — 33 | .83 81 

6 38 31 =11 — .80 95 -80 

7 19 10 eee Tot -96 69 

8 .00 .00 00 | —1.00 | 1.00 AT 
Group Average (N = 125).| 40 38 —.33 | — 50 -86 -80 $ 


Note.—The most highly weighted dimension for each S is underlined. 
Rie the cordation belwean the Mente ata ete a naelad, and S's bids. 
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TABLE 5 


DISTRIBUTIONS SHOWING THE NUMBER OF SS 
Wuo WEIGHTED Pw MORE THAN Pr, $w 
MORE THAN $1, AND VICE VERSA 


Relative Weighting | Rating | Bidding | Total 


Group | Group 

Pw more Impor- 60 80 140 
tant than Py 

Py, more Impor- 26 40 66 
tant than Pw 

$w more Impor- 27 37 64 
tant than $1, 

$t more Impor- 51 87 138 


tant than $w 


Note.—Within each group, differences between the 
frequencies with which Pw exceeded Pr and $1 ex- 
ceeded gw were significant at p < .01 (sign test). 


on a gamble’s attractiveness than po- 
tential gains of the same magnitude 
(Atthowe, 1960; Lichtenstein, 1965; 
Mosteller & Nogee, 1951; Myers & 
Katz, 1962; Myers & Sadler, 1960; 
Rettig & Rawson, 1963; Suydam & 
Myers, 1962). 

Another finding of interest is the 
fact that Ss’ mean ratings or bids in- 
creased monotonically as Py increased 
from .2 to 4 to 8 and decreased 
monotonically as Pr, increased. This 
indicates that if attractions or aver- 
sions to specific probability levels did 
exist, they were not strong enough to 
overcome the basic tendency to like a 
bet more as Pw increases and as Py, 
decreases. 

Introspective comments from Ss.— 
After they had completed their re- 
sponses, Ss were asked to write a para- 
graph describing how they went about 
evaluating the attractiveness of the 
duplex bets. Their comments were 
quite informative and help support our 
ideas about importance beliefs, A 
large number clearly indicated that 
they believed certain dimensions to be 
far more important than others in de- 
termining their responses. For the 
most part, the regression weights de- 
rived from their responses reflected 


their stated strategies. A selection of 
comments from them is given below 
with their correlational weights for di- 
mensions Pw, $y, Pr, and $r in 
brackets. 


Rating group: 

I found playing the bets most desirable 
only when I had an excellent chance of 
winning, ie. 80% to win and 20% to lose. 
I found I looked at the percentages only 
and really didn’t pay much attention to the 
amount I would win or lose [weights = .64, 
.00, —.52, .00]. 

The desirability of playing balanced mainly 
between the probability of winning and the 
amount possibly to be lost. If these factors 
were fairly even, the amount to win and 
probability of losing came into play [weights 
=.53, 42, —20, —.48]. 


Bidding group: 


I decided wholly on the basis of amount 
to lose. I’m a very conservative gambler 
[.00, .00, .00, —1.00]. 

I was unwilling to lose very much—no 
matter how much I might win. I was more 
concerned with the probability of losing and 
the amount to lose and would require that 
the operator pay me enough for playing to 
aesae for most losses [.01, .24, —.39, 
—.62]. 

I don’t play unless the odds are in my 
favor [.73, .00, .00, —.30]. 


Comparison of bidding and rating 
groups.—One of the most important 
results of this experiment is the find- 
ing that, for the groups as a whole, 
ratings correlated more highly with Pw 
than with any other dimension, while 
bids correlated most highly with $r 
(see the group means in Tables 3 and 
4). A multivariate analysis of vari- 
ance (Vanderplas, 1960) showed the 
differences between the average weight- 
ing profiles of each group to be sig- 
nificant, p < .001, x2(4)=27. The 
generality of the differences between 
the rating and bidding groups is further 
illustrated by Table 6, which indicates 
the percentage of Ss within each group 
for whom a particular risk dimension 


he 
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TABLE 6 


NUMBER AND PERCENTAGE OF SS FoR WHOM 
A Given Risk DIMENSION Was 
Most IMPORTANT 


Risk Dimension 


Pw $w Pr $L 


Rating Group 


Number 44 8 13 23 

Percentage (50) | (09) | (15) | (26) 
Bidding Group 

Number 23 24 


12 66 
Percentage (18) | (19) | (10) | (53) 


Note.—A x? test indicated that distributions within 
each group differed from uniformity at p < .01. The 
distribution for the rating group differed significantly 
from that of the bidding group (p < .01). 


was most important. Whereas 50% 
of the Ss in the rating group relied 
predominantly on Pw, only 18% of the 
Ss in the bidding group did. Similarly, 
the percentage of Ss for whom $r was 
most important was 26% in the rating 
group but 53% in the bidding group. 

The differences between rating and 
bidding seem to illustrate the influence 
of information-processing considera- 
tions on the method by which a gamble 
is evaluated. Apparently the require- 
ment that Ss evaluate a gamble in 
monetary units when bidding forces 
them to attend more to the payoff di- 
mensions—also expressed in terms of 
dollars—than they do when rating a 
bet. 

The bidding and rating tasks can be 
conceptualized as involving a two-stage 
process. In Stage 1 S decides whether 
the bet is one he would or would not 
like to play. In Stage 2 he quantifies 
his first bipolar judgment by indicating 
the degree to which he likes or dis- 
likes the bet. In an attempt to under- 
stand the differences between rating 
and bidding groups, the importance of 
the risk dimensions was examined 
separately within each of these stages. 

First, every S’s response to a given 
bet was dichotomized into a 0 (if he 


found the bet unattractive) or a 1 (if 
he found the bet attractive). Next, 
these dichotomous responses were cot- 
related with the levels of each cue as 
was done in the earlier analyses. This 
provided an index of the importance 
of each dimension in Stage 1 of the 
response process. 

Two additional regression analyses 
were conducted on each S’s responses 
—one for the subset of bets he found 
attractive, the other for unattractive 
bets. The purpose of each analysis 
was to determine the importance of 
each risk dimension in determining re- 
sponses in Stage 2. Since there was 
often a low but nonzero correlation 
between pairs of risk dimensions within 
each of these subsets of bets, the cor- 
relations between responses and the 
levels of the dimensions could not, by 
themselves, be used to represent the 
independent importance of the dimen- 
sions, Therefore, Hoffman’s (1960) 
technique was used to determine the 
percentage of variance in an S’s re- 
sponses that could be attributed to his 
use of each risk dimension after cor- 
recting for biases due to nonorthog- 
onality among the dimensions. The 
counterpart of this percentage of vari- 
ance index in the original analysis and 
in the Stage 1 analysis where orthog- 
onality holds is simply the squared 
values of the correlations between 
levels of the risk dimensions and the 
responses. 

Table 7 presents the results of the 
stage-by-stage analyses. All values 
have been transformed into percentage 
of variance estimates to facilitate com- 
parison. Part a of the table repeats 
the comparison based on the total set 
of responses and represents the com- 
bined effect of both Stage 1 and Stage 
2. Part b presents the results of the 
Stage 1 analysis. The data indicate 
that the mean weights for the two 
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TABLE 7 


PERCENTAGE OF VARIANCE ACCOUNTED FOR 
By USE or THE RISK DIMENSIONS DURING 
Eacu STAGE OF THE RESPONSE PROCESS 


Risk Dimensions 
Pw $w PL $L 
Total Response (Part a) 
Rating Group 24.9 | 14.1 | 16.3 | 18.1 
Bidding Group 18.0 | 16.7 | 12.9 | 27.2 
Stage 1 (Part b) 
Rating Group 18.6 | 10.6 | 12.9 | 11.7 
Bidding Group 18.3 | 10.1 | 13.5 | 12.2 
Stage 2 (Part c) 
Attractive Bets 
Rating Group 30.2 | 11.8 | 12.8 | 16.7 
Bidding Group 6.4 | 49.8 5.3 TA 
Unattractive Bets 
Rating Group 10.6 | 13.1 | 15.8 } 16.2 
Bidding Group 4.1 7.9 8.2 | 54.4 


groups were remarkably similar at the 
end of the first stage. For both groups 
the most important determiner of the 
dichotomous (like-dislike) response 
was Pw. Thus, the differences found 
by analysis of the final responses must 
have occurred during Stage 2. 

The results of the Stage 2 analyses 
are presented in Part c of Table 7. It 
is apparent that, when an S in the 
bidding group found a bet attractive, 
his judgment of the relative magnitude 
of its attractiveness was principally de- 
termined by the size of $w; and when 
he disliked a bet, $u was the primary 
determinant of the degree of dislike. 
This weighting pattern was character- 
istic of about 90% of the Ss in the 
bidding group. For Ss in the rating 
group, the difference between the 
method used to estimate degree of at- 
tractiveness and that used to determine 
the degree of unattractiveness was con- 
siderably less marked, although there 
was some tendency for Pw to be used 
more when quantifying attractiveness. 

On the average, categorization of 
the entire set of bets into attractive 
and unattractive subsets reduced the 
variance of the risk dimensions by only 
15%. In addition, the mean variances 
of the risk dimensions within each of 


the attractiveness categories were vir- 
tually identical for the two groups of 
Ss. Therefore, restriction of range can 
be discounted as a possible explanation 
for the group differences shown in 
Part c of Table 7. 

Both bids and ratings presumably 
reflect the same underlying character- 
istic of a bet—namely, its worth or 
attractiveness. Why should Ss employ 
probabilities and payoffs differently 
when making these related responses? 
The introspections of one S in the 
bidding group are especially helpful 
in providing insight into the type of 
cognitive process that could lead bid- 
ding responses to be overwhelmingly 
determined by just one payoff factor. 
This S said, 


If the odds were . . . heavier in favor of 
winning . . . rather than losing ..., I 
would pay about 4 of the amount I would 
expect to win. If the reverse were true, I 
would ask the experimenter to pay me about 
: +» 4 of the amount I could lose. 


Note the initial dependence on prob- 
abilities in Stage 1 followed in Stage 
2 by a complete disregard for any 
factor other than the winning payoff 
for attractive bets or the losing payoff 
for unattractive bets. After deciding 
he liked a bet, this S used the amount 
to win, the upper limit of the amount 
he could bid, as a starting point for 
his response. He then reduced this 
amount by a fixed proportion in an 
attempt to integrate the other dimen- 
sions into the response. Likewise, for 
unattractive bets, he used the amount 
to lose as a starting point and adjusted 
it proportionally in an attempt to use 
the information given by the other 
tisk dimensions, Such adjustments, 
neglecting to consider the exact levels 
of the other dimensions, would make 
the final response correlate primarily 
with the starting point—one of the pay- 
offs in this case. It seems likely that 
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90% of the Ss in the bidding group 
who relied primarily on payoff factors 
during Stage 2 were using a similar 
starting point and adjustment proce- 
dure. 


EXPERIMENT II 


Earlier it was hypothesized that 
probability preferences among standard 
two-outcome bets such as those in 
Table 1 could be interpreted in terms 
of the relative importance of probabil- 
ities and payoffs to the decision maker. 
Specifically, preferences for Pw = .9 
or .7 were equated with the notion 
that probabilities were more important 
than payoffs. Preferences for long 
shots (Pw = .1) were hypothesized to 
reflect greater importance placed on 
payoffs rather than probabilities. Ex- 
periment II attempted to test these 
hypothesized relationships by compar- 
ing regression weights derived from 
duplex gambles with probability prefer- 
ences obtained from ratings and choices 
among standard two-outcome gambles. 

Seventy-eight undergraduates served 
as Ss in the first part of this experi- 
ment. They rated the attractiveness 
of a set of duplex bets similar to those 
in Table 2. They also selected their 
favorite bet from each row of the 
matrix of standard gambles in Table 1. 
At the end of the experiment they 
actually played several of their pre- 
ferred gambles to determine their sal- 
ary for participating. An S was con- 
sidered to have a stable preference for 
a particular probability if he selected 
the bet having that probability at three 
or more of the five levels of variance. 
There were 56 Ss who satisfied this 
requirement. They were further classi- 
fied into two groups on the basis of 
their regression weights derived from 
the duplex bets: one group contained 
Ss for whom either Pw or Py had the 
highest regression weight, the other 


TABLE 8 


RELATIONSHIP BETWEEN REGRESSION 
WEIGHTS AND PROBABILITY 
PREFERENCES 


= 


Dimension Having Highest 
Probability egression Weight 
Preference 


Pw or Pu ($w or $u 


Choice Method 
9 13 
16 


Hiwinui 
Rw 
NI 
N Deno 


Rating Method 
4) 


Hwi 
| mooO 
s neces 


Note.—Cell entries are numbers of S's in the desig- 
nated categories, 


group consisted of those for whom 
either $w or $u was most important. 
The upper part of Table 8 presents 
the frequency distribution of these two 
categories at each level of probability 
preference. It is clear that the hypoth- 
esized relationships between probabil- 
ity preferences and the relative import- 
ance of probabilities vs. payoffs were 
confirmed. Persons with preferences 
for high Pw, when choosing among the 
standard bets, weighted probabilities 
more heavily than payoffs when rating 
the duplex gambles. Persons who pre- 
ferred low Pw weighted payoffs more 
highly than probabilities. 

The nine Ss who preferred Pw = 7, 
yet gave highest weight to $w or $r, 
represent a conspicuous exception to 
the predicted relationship. However, 
examination of their regression weights 
showed that the payoff dimensions 
strongly dominated the probability «di- 
mensions for only one of them. For 
the other eight Ss, the average weights 
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were Pw: 39, $w: 40, Py: .39, $u: 49. 
Probabilities had far more influence on 
their ratings than in the case of the 
six Ss who preferred Py =.1 and, 
according to prediction, weighted $w 
or $u most highly. The average weights 
for these six Ss were Pw: .26, $w: .47, 
Pr: .25, $u: .51, showing a clear domi- 
nance of payoffs over probabilities. 
This experiment was replicated with 
one slight change. Instead of picking 
one bet in each row of the matrix of 
standard bets to indicate their prob- 
ability preferences, Ss rated the attrac- 
tiveness of these bets exactly as they 
had rated the duplex gambles. Each 
standard bet was rated twice. An S 
was considered to have a preference 
for a particular probability level if bets 
having that probability had the highest 
average rating. Eighteen of the 22 
undergraduate Ss who served in this 
replication experiment could be classi- 
fied into a probability preference cate- 
gory on the basis of this procedure, 
The relationship between these prob- 
ability preferences and regression 
weights for these Ss is shown in the 
lower part of Table 8. Again, Ss who 
preferred standard gambles with high 
Pw weighted Pw or P, most highly 
on the duplex task, while Ss who pre- 
ferred long shots weighted $y or $r, 
most highly, i 
These data demonstrate that weights 
derived from evaluations of duplex 
gambles can predict choices and ratings 
of the more typical gambles where P, 
= 1 —Pw. Probability preferences may 
thus be reinterpreted as resulting from 
strategies based on the relative im- 
portance of the basic risk dimensions. 


Discussion 


Descriptive theories of risk taking al- 
ways have been closely linked with norma- 
tive models, When logical considerations 
indicated that people should behaye ac- 


cording to certain prescribed principles, 
these principles were incorporated into 
descriptive models against which perform- 
ance could be compared. While it is 
undeniably valuable to employ normative 
models as starting points for descriptive 
theory, preoccupation with normative as- 
pects seems, in this case, to have led to 
neglect of certain psychological aspects of 
the risk-taking process. The purpose of 
the present research was to investigate the 
usefulness of some neglected psychological 
variables—in particular, a person’s beliefs 
about the relative importance of prob- 
abilities and payoffs and his ability to act 
on the basis of these beliefs when process- 
ing the information contained in the de- 
scription of a gamble. 

These experiments were not designed 
to test the SEU model. The results do 
cast doubts, however, on the common in- 
terpretation of that model, which regards 
S(P) as a measure of subjective likeli- 
hood and U($) as a measure of subjec- 
tive worth, In Exp. I, the inequalities 
found among risk dimensions with regard 
to their influence on the attractiveness of 
a gamble were often too large to be ac- 
counted for by the rather close corre- 
spondences between subjective and objec- 
tive probabilities and between utility and 
money, reported in other studies (Ed- 
wards, 1955, 1962; Robinson, 1964; Shu- 
ford, 1961; Stevens & Galanter, 1957; 
Tversky, 1967a, 1967b). The finding that 
Pw has more import than Py, is particu- 
larly critical, since it constitutes a viola- 
tion of the assumption, in the SEU model, 
that there is just a single probability func- 
tion, S(P), which is independent of the 
payoffs, Finally, the discrepancies be- 
tween ratings and bids for the same bets 
suggest that the S(P) and U($) functions 
change as a result of changes in response 
mode. If S(P) and U($) functions can 
be altered drastically by a change in re- 
sponse mode (which seems quite unre- 
lated to subjective worth or subjective 
likelihood), then surely these functions 
have been misinterpreted. 

The above criticisms should not be con- 
strued as implying that the concepts of 
subjective probability and utility are ir- 
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relevant to risky decisions. On the con- 
trary, these concepts belong in the reper- 
toire of factors that influence decisions 
(and, accordingly, influence regression 
weights). The authors’ objection is to 
the position that subjective likelihoods and 
subjective values are the sole determiners 
of decisions. 

Other aspects of the work reported here 
focus on the concepts of probability and 
variance preferences. The designs of 
previous experiments which purportedly 
demonstrated probability preferences con- 
founded the basic risk dimensions in a 
manner that allows for alternative ex- 
planations of the results, Our interpreta- 
tion of these previous results involved 
Ss’ beliefs about the relative importance 
of these basic dimensions, In the duplex 
bets of Exp. I, where the risk dimensions 
were independent, no probability prefer- 
ences were found strong enough to coun- 
teract the natural affinities for large Pw 
and small Py. Experiment II employed 
both duplex gambles and standard gam- 
bles ; the regression weights, derived from 
the duplex bets and reflecting Ss’ dif- 
ferential use of the risk dimensions, were 
highly predictive of the probability pref- 
erences found among the standard bets. 

While it is tempting to assert that 
beliefs about the relative importance of 
probabilities and payoffs completely deter- 
mine both probability preferences and the 
weighting coefficients, these data neither 
confirm nor disconfirm this. Slight prob- 
ability preferences may actually exist and 
may be influencing the weighting coeffi- 
cients, For example, a slight attraction 
to Pw =.2 coupled with a slight disutility 
for Pw =.8 would cause S’s responses to 
change less as Pw changed from .2 to 8 
than they would if such probability pref- 
erences did not exist. This would con- 
tribute to a smaller regression weight for 
the dimension of Pw. 

Confounding of the risk dimensions 
also is found in the design of studies 
showing variance preferences. Again, it 
is here argued that fairly simple strategies 
based on Ss’ differential weighting of the 
risk dimensions can account for variance 
preferences. Relevant supporting data 


may be found in Slovic and Lichtenstein 
(1968). 

The picture of the decision-making 
process that emerges from the present 
study is one of a person struggling to 
integrate several sources of information 
into a single choice or judgment. The 
decision maker is guided by certain beliefs 
(e.g., probabilities are more important 
than payoffs), which he combines with 
strategies designed to make his task less 
complex, One such strategy is to sub- 
divide the evaluation of a gamble into two 
stages. In the first stage, gambles are 
classified as either attractive or unattrac- 
tive. In Stage 2, the degree of attractive- 
ness is quantified. When bidding, this 
quantification seems to proceed as a crude 
adjustment of $w if the bet is attractive 
or $r if the bet is unattractive. 

The notion that gambling decisions re- 
quire complex information processing 
provides a framework for understanding 
a number of other findings that might 
otherwise seem unrelated. The first of 
these is that Ss’ choices among bets be- 
come much less orderly under conditions 
that foster extremely high motivation to 
make the correct decision. Specifically, 
Slovic et al. (1965) and Pollatsek (1966) 
provide evidence indicating that choices 
become less consistent, transitive, and 
single peaked when Ss are motivated and 
interested in their task. When they are 
bored or overworked by being forced to 
make hundreds of decisions about these 
complex stimuli, they tend to use the 
simplest possible weighting strategies, 
such as “always select the gamble with 
the higher probability of winning,” or 
“always minimize the maxium loss.” 
Such strategies reduce the cognitive strain 
of choosing among stimuli whose loca- 
tions on the risk dimensions are especially 
designed to maximize conflict. 

When a person is asked to choose 
among gambles having equal probabilities 
of winning and losing and equal expected 
value (such as bets within Column 1 of 
Table 1), he has only two conflicting 
dimensions to integrate in order to make 
his choice. These are $w and $y. How- 
ever, when he is asked to choose among 
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bets equal in EV and variance (such as 
those within Row 1 of Table 1), there 
are four conflicting dimensions to in- 
tegrate. The latter should be a more diffi- 
cult task. This may explain why Coombs 
and Pruitt (1960), Van der Meer (1963), 
and Pollatsek (1966) found choices 
among the latter types of gambles to be 
less consistent and less transitive than 
choices involving only the two payoff 
dimensions. 

Another related finding is that the de- 
gree to which a person maximizes EV 
when choosing among gambles decreases 
as the number of risk dimensions that 
need to be processed increases (Herman 
& Bahrick, 1966). Apparently the greater 
complexity resulting from the need to 
integrate a larger number of risk dimen- 
sions into a decision leads persons to em- 
ploy modes of processing that are incom- 
patible with the EV model. 

The finding in Exp. I that Ss rely 
more on payoffs when bidding than when 
rating a bet suggests a possible explana- 
tion for some long-puzzling inconsistencies 
in data bearing on the relationship be- 
tween objective and subjective probabil- 
ities, Quite a number of studies (eg., 
Edwards, 1962; Philip, 1947; Robinson, 
1964; Shuford, 1961; Stevens & Galanter, 
1957; Teichner, 1962) have asked Ss to 
estimate subjective probabilities corre- 
sponding to relative frequencies or vis- 
ually displayed proportions. The results 
indicate a very close correspondence be- 
tween objective frequency or proportion 
and its subjective counterpart, In one of 
the few attempts to have Ss estimate sub- 
jective probabilities corresponding to 
stated numerical probabilities in gambles, 
Dale (1959) found that low probabilities 
were underestimated and high probabil- 
ities were overestimated. On the other 
hand, when Ss evaluate bets by bidding, 
the subjective probability functions in- 
ferred from these bids have the reverse 
shape of those found by Dale; i.e., a given 
change in objective probability produces 
a smaller change in the corresponding 

subjective probability (Lindman, 1965; 
Nogee & Lieberman, 1960; Preston & 
Baratta, 1948; Tversky, 1967a, 1967b). 


These relatively flat subjective probability 
functions are exactly what one would 
expect to find in bidding studies, since the 
introduction of a monetary response mode 
increases the saliency of the money dimen- 
sions and decreases the relative impor- 
tance of the probabilities. 

The overdependence of bids on payoffs 
also can explain a finding that puzzled 
Lindman (1965), Lindman had Ss bid 
for a number of gambles and also make 
paired-comparison choices among triplets 
of these gambles. He noted a consistent 
discrepancy between bids and paired-com- 
parison preference orderings. Examina- 
tion of the bids indicates that they were 
ordered almost perfectly according to the 
larger of the values, $w or $r, whereas the 
paired-comparison choices, like the rat- 
ings and Stage 1 decisions in the present 
first experiment, were not so dependent 
on payoffs. 

The regression analysis technique used 
here serves as a convenient overall sum- 
mary of the influence of each risk dimen- 
sion on Ss’ judgments. While the results 
of such analysis may be due to a number 
of different considerations, two concepts 
have here been emphasized: (a) the 
weights may reflect Ss’ belief that some 
of the risk dimensions are more important 
for, or more relevant to, the task at hand; 
and (b) change in weights across tasks 
indicates changing strategies due to Ss’ 
attempts to ease his information-process- 
ing problems. The simplicity and de- 
scriptive sensitivity of the regression tech- 
nique should make it useful for studying 
other aspects of risk-taking behavior such 
as the effect of display variables, the rela- 
tionship between risk taking and per- 
sonality, or the relationship between risk 
taking and real life decision making. 
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FORGETTING CURVES WITH SEMANTIC, PHONETIC, 
GRAPHIC, AND CONTIGUITY CUES? 


ALBERT S. BREGMAN 
McGill University 


Forgetting curves were obtained by presenting nouns 1 at a time in 
a long sequence with recall tests interspersed in the series. 4 types of 
cues were used in tests: contiguous nouns, phonetic attributes, graphic 
attributes, and semantic attributes. Curves obtained with each cue 


could be factored into an STM and an LTM component. 


Recall 


with attribute cues was highest, the relative effectiveness of different 
ones depending on the retention interval. 


In the test phase of most memory 
bxperiments one of two types of cue 
ig provided for S: (a) an instruction 
to\ recall the most recently presented 
item, or (b) the presentation of a stim- 
ulus context, such as another item, with 
which the target item has been arbi- 
fan associated on some prior occa- 
sion. The former may be called a 
“recency” cue, and the latter, a “con- 
tiguity” cue. These two cues are not 
the only, or even main, types of cues 
used for retrieval in natural life verbal 
memory situations. Such questions as 
“What is your doctor’s name?” specify 
the target information by giving some 
of its attributes or descriptive features 


1This research was supported in part by 
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0803-01 to the Harvard Center for Cognitive 
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search Council of Canada Grant APA-127. 
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—e.g., that the target is a name, that 
it belongs to a doctor, and that this 
doctor treats you. Attributes and 
their effectiveness as recall cues have 
not been extensively investigated up to 
this time. However, evidence exists 
that suggests that these attributes or 
descriptive features are elementary 
building blocks in adult human mem- 
ory. Bregman and Chambers (1966) 
have found that items in a visual mem- 
ory task were not remembered all or 
none, but when scored in terms of re- 
tained attributes, retention was all or 
none. 

The present experiments study the 
memory for verbal materials when 
attribute cues are used. Specifically, 
there is a question as to whether with 
this form of cuing the same type of for- 
getting curve will be found as with 
contiguity cuing. Waugh and Nor- 
man (1965), using contiguity cuing, 
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have found, both in their own experi- 
ments and others, highly reproducible 
forgetting curves over short intervals. 
These, they show, are capable of being 
resolved into two components, a “pri- 
mary” memory whose loss of material 
is exponential having a half-life of four 
or five items (intervening between 
presentation and test), and a “sec- 
ondary” memory which does not lose 
material over the delay intervals used 
in the reported experiments. The two 
together produce a forgetting curve 
which drops rapidly at first, and then 
levels out abruptly. The present ex- 
periments looked to see whether the 
same results would be found when 
attributes were used as cues. 

A theory that might lead one to ex- 
pect different retention curves with 
attribute cues is that of Tulving 
(1966). He has suggested that it is 
the type of cue used by S in retrieving 
the information, rather than the type of 
storage mechanism, that is responsible 
for the apparent difference between 
long-term memory (LTM) and short- 
term memory (STM). He has pro- 
posed that ancillary information stored 
with a target item, such as its phonetic 
pattern or conceptual category, is used 
in retrieval. According to this theory, 
some types of ancillary information 
(such as the phonetic pattern) are 
interfered with by subsequent inputs 
much more easily than are other types 
(such as the conceptual features). The 
two-factor forgetting curves are the 
resultant, then, of the forgetting, at 
different rates, of two types of stored 
materials in one and the same storage 
mechanism, rather than the product of 
two different storage mechanisms. If 
Tulving’s theory is correct, forgetting 
curves should be seen with quite dif- 
ferent decay rates, depending on the 
cue which S is given to retrieve the 
item; the contiguity cue (and perhaps 
the phonetic and graphic cues) might 


be expected to have a decay rate re- 
sembling that of the STM component 
of two-factor forgetting curves, and the 
semantic cue to have a decay rate re- 
sembling the LTM component, i.e., a 
decay that is rather gradual from the 
presentation point onwards. Each of 
these curves might well be a simple 
exponential decay function, the two- 
factor curve in ordinary recall data 
being a composite of simple curves 
decaying at slow and fast rates. 


METHOD 


The present experiment provided Ss with 
four different sorts of cues to retrieve nouns 
which had been presented earlier, and varied 
the interval between the original presentation 
and the test to obtain forgetting curves. 

Each S received a booklet containing a 
long series of nouns (one per page) which 
he exposed to himself at a fixed rate of one 
item every 3 sec. in time with an auditory 
signal. Some pages in the booklet, instead 
of having a noun printed on them, had a test 
which provided a cue for the recall of one of 
the prior nouns in the series and a space to 
write the answer. The Ss were required 
to respond in the 3-sec. period during whic! 
that page was exposed; if they were alread: 
writing when the tone sounded, they were al- 
lowed to complete their answer before turning 
the page. The fast rate was, in part, an 
attempt to minimize guessing effects. This 
design provided a continuous flow of mate- 
rials and questions. Its basic feature is ‘that 
since tests come unpredictably, Ss keep 
working in a uniform manner. 

The delays between the presentation of a 
target item and its test were 1, 2, 3, 6, 
12, 24, 48, and 96, where a delay of 1 means 
that the test appeared on the first booklet 
page following the noun. The four types of 
cues used on test pages were the following 
(the actual cue as it appeared is shown in 
the square brackets): (a) contiguity cue— 
the noun preceding the target noun in the 
series [. . . came after cousin], (b) phonetic 
attribute—a word that rhymes with the target 
noun [. . . rhymes with thread], (c) graphic 
attribute—the first two letters of the target 
noun, with the remaining number of letters 
indicated by dots [. . . is spelled le--], (d) 
semantic attribute—a category into which the 
thing named by the noun falls [...is a 
chemical element]. 


——— 
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Two aspects of the experimental design 
were introduced to insure that inequalities in 
the effectiveness of the cues were not due 
to Ss being able to guess the word more 
easily from one cue than another: (a) By 
preliminary testing, a pool of nouns was es- 
tablished; each noun was associated with four 
cues that were appropriate for it but each of 
these cues elicited the noun as a first guess 
in a free guessing situation with a very low 
frequency. (b) As a further precaution, con- 
trol Ss were run who did not see the target 
nouns in the sequence at all but were in- 
structed to guess when they did not know 
the answer. The guessing probabilities as 
assessed from these control Ss were used to 
correct the data of experimental Ss. As is 
pointed out below, these procedures will only 
correct for the effects of “pure guessing.” 

There were two experiments. In Exp. I 
all cues in a given booklet were of the same 
type (e.g., contiguity) so that S could pre- 
dict in advance what type of cue would be 
given and presumably would use a homogene- 
ous encoding strategy, optimal for that cue, 
in each booklet. In Exp. II all four types of 
cues were used in each booklet; S could 
not predict in advance what type of attribute 
would be used to cue any specific noun and 
was expected to use a “neutral” or “mixed” 


\ strategy. 


Experimental Design 


‘Experiment I—The Ss were given four 
booklets in random order. In each booklet 
only one type of cue was used. There were 
eight tested items in each booklet, one at each 
of the eight delays. In addition, to provide 
the “Tregeiied delays between presentation 
andtesting, untested filler nouns were used. 
These filler items were arranged into four 
frames, one for each booklet, which remained 
constant for all Ss. Each frame had slots 
at fixed positions in it for the presentation 
and testing of eight items, one at each of 
the eight delays used. Three of the frames 
had 81 untested nouns and the fourth had 80. 
An attempt was made to balance, across the 
four frames, the position in the frame at 
which any particular delay occurred. 

At the beginning of every frame there was 
a short series of presentations and tests of 
four or five items which were cued, at vary- 
ing short delays, by the same type of cue as 
was used in that booklet. These intro- 
ductory items were constant for all Ss and 
were not scored. They had two purposes: 
(a) to train Ss in the procedure and (b) to 
reduce the “primacy” of the earlier pre- 


sented target items. An attempt was made 
to eliminate cases of a training or filler item 
being a plausible response to any of the 
semantic, graphic, or phonetic cues used in 
the tests. 

Each group of four Ss had a new random 
assignment of items to delays in frames. 
Within each group the assignment of cues to 
frames was rotated. 

In calculating the delays, intervening tested 
items, filler items, and tests were all con- 
sidered as equal. This was done for two 
reasons: (a) The data of Tulving and Ar- 
buckle (1966) show that apart from the 
slight differences pointed out by the authors, 
the amounts of output and input interference 
are roughly the same. (b) Waugh and 
Norman (1965), in their analysis, treated 
presentations and tests as equally interfering 
and obtained consistent results across a 
variety of experiments. Delay, then, actually 
refers to the number of pages by which a test 
followed its target item in the booklet, re- 
gardless of what was on those pages. 

The booklets for control Ss were made in 
the same way except for the removal of some 
items. For the phonetic, graphic, and se- 
mantic attribute conditions, the target -item 
was removed. For the contiguity cuing con- 
dition, the target item remained, but the 
item before it (the one used as the cue for it 
in the test) was removed. In this last case, 
control Ss had the same response pool to 
choose from as experimental Ss but had no 
information as to where the desired item 
was located. 

Experiment II.—This experiment was ex- 
actly the same as the above except that all 
four types of cue appeared in each booklet 
in a random order (with the limitation that 
each cue type be used twice in each booklet). 
Assignment of cue types to items, delays, 
and frames was balanced over groups of four 
Ss. The training items at the beginning of 
each booklet also contained all four cue types. 
There were no separate control Ss for Exp. 
IL. 

Questionnaire—After finishing the four 
booklets, Ss in both experiments filled out a 
questionnaire which had two parts. The 
first enquired about encoding strategies and 
for awareness of the purpose of the experi- 
ment. The second asked Ss for definitions of 
the words used as test items. This was a 
check to see whether the semantic and pho- 
netic cues were relevant for each S individ- 
ually. 

Subjects—College students unselected with 
respect to age or sex served as paid Ss. 
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There were 92 in Exp. I and 100 in Exp. II. 
In addition, there were 24 control Ss in Exp. 
I. The Ss were run in groups of two to 
six persons. 


RESULTS 


The raw data were the 32 responses 
of each S, 1 response at each of eight 
delays under four cuing conditions. 
These were screened by a comparison 
with the definitions given by S on the 
questionnaire for the 32 target items. 
If S’s response on the recall test was 
incorrect but his definition showed that 
E’s semantic or phonetic cue was in- 
appropriate, this response was not 
treated as an error but as “missing 
data.” For example, if the word “lead” 
was defined by S as “to show the way,” 
E’s semantic cue “chemical element” 
and phonetic cue “rhymes with thread” 
are inappropriate. If 5 answered with 
a “wrong” word, but this word was 
deemed to satisfy the cue and was from 
the correct booklet, the delay between 
the item’s presentation and the test 
was calculated; if this delay was the 
same as one of the experimental delays, 
the item was treated as an item at that 
delay, and if there was no such experi- 
mental delay, the item was treated as 
missing data. “Missing data” was also 
recorded in cases where E had con- 
structed a booklet incorrectly, putting 
an item at the wrong delay. 

Analysis of variance was used on the 
data. However, individual S’s re- 
sponses were not used as the unit of 
analysis for two reasons: (a) Each S’s 
response was either correct or incor- 
rect at each interval, yielding a dichoto- 
mous dependent variable which does 
not satisfy either the requirements of 
continuity or normality for parametric 
statistics; and (b) there would be many 
empty cells because of missing data. 
Instead, “super-Ss” were created by 
adding the results of 12 consecutively 
run Ss together, for each Cue x Delay 
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condition. To do this, Ss with the 
most missing data were discarded until 
84 remained in Exp. I and 96 remained 
in Exp. II. Then Ss were grouped by 
consecutive 12s into seven super-Ss in 
Exp. I and eight super-Ss in Exp. II. 
Each super-S could obtain a score 
ranging 0-12 on each delay for each 
cue—a reasonably continuous variable. 
The dependent variable was actually 
the proportion correct out of the total 
countable for each super-S in each 
Cue X Delay condition. 

Next, each of the raw scores ob- 
tained in the above manner was cor- 
rected for guessing by solving for 
P(true) in the following formula : 


P(correct) = P(true) + 
[1 — P(true)] [P(guess)], [1] 


P(correct) being the observed propor- 
tion correct, P(true) the true prob- 
ability of knowing the item, and 
P(guess) the guessing probability for 
that cue, as estimated from control Ss. 
The guessing probabilities were .011 
for contiguity cues, .058 for graphi¢ 
cues, .035 for phonetic cues, and . ) 
for semantic cues. The means of the 
corrected probabilities for each delay 
and each condition of cuing are shown 
for the two experiments in Fig. 1. 
It should be noted here, parenthe- 
tically, that only a certain type of 
guessing effect (termed “pure guess- 
ing”) is eliminated by this procedure; 
it occurs only in cases where there is 
no contact between the cue and the 
specific trace of the to-he-remembered 
item that was laid down at the time 
of its presentation. The assumption 
is that, even in such cases, S is free 
to generate a response that conforms 
to the description given in the cue 
alone, and hence will sometimes give 
a correct answer by this means. There 
is no implication made that the cor- 
rection procedure has removed all 
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EXPERIMENT 1 


One Cue Type Per Booklet 


ljusted for Guessing) 
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EXPERIMENT 2 
Mixed Cuing in Eoch Booklet 


96 


Fic. 1. Proportion of items correct (adjusted for guessing), as a function of delay 
of test, for semantic, graphic, phonetic, and contiguity cues. 


effects of different constraints by dif- 
ferent types of cues in their inter- 
aetion with the traces of the items. 

Since the corrected scores are esti- 
mated probabilities, the arc-sine trans- 
formation, 2 sin? \/p, was applied to 
all scores. An overall analysis of vari- 
ance done on the data for Exp. I 
showed significant effects of Cues, 
F (3, 18) = 14.69, p < .001, of De- 
lays, F (7, 42) = 36.42, p < .001, and 
Cue x Delay interaction, F (21, 126) 
= 2.69, p<.01. Similarly, in Exp. 
II, there were significant effects of 
Cues, F (3, 21) = 50.17, p < .001, of 
Delays, F (7, 49) = 106.99, p < .001, 
and Cue X Delay interaction, F (21, 
147) = 1.68, p < .05. 


Since there is a Cue X Delay inter- 
action, separate analyses of variance for 
the effects of Cues were done for each 
delay point in Exp. I and Exp. II. 
The results are shown in Table 1. 
There seems to be no systematic change 
of the contribution of the cues variable 
per se with different delays, 

Next, a Tukey test for multiple 
comparisons was done for the Cues 
effect in Exp. I, and separately in Exp. 
II. In each of the experiments, the 
contiguity cue was significantly differ- 
ent from the other three cues, but the 
other three did not differ significantly 
among themselves. However, one is 
not justified in concluding that the 
graphic, phonetic, and semantic attrib- 
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TABLE 1 


RESULTS OF ANALYSIS OF VARIANCE FOR 
Curs at Eacu DELAY Point 
TAKEN SEPARATELY 


F Value 
Delay 
Exp. I Exp. II 
1 4.35* 3.67* 
2 5.98** 613 
3 1.92 5.16** 
6 2.97 28.43*** 
12 6.78** 7.69** 
24 3.48* 9.24" 
48 5.76** 6.91" 
96 8.98%" 4.09* 


Note,—For F values in Exp. I, df =3 and 18, in 
Exp, II, df =3 and 21, 
* p <.05, 


ute cues had the same overall effects 
since there is a significant Cue X De- 
lay interaction in both Exp. I and II. 
In Exp. II, this appears to arise from 
the fact that the contiguity cue curve 
is dropping much more rapidly with 
delay than the others and reaching an 
asymptote near zero much earlier. 
This would account for the significant 
interaction of Cue X Delay. In Exp. 
I, on the other hand, the contiguity cue 
is not the only one whose efficacy rela- 
tive to the others changes with delay. 
If the contiguity cue is removed from 
the analysis, there is still a significant 
Cue X Delay interaction for the re- 
maining three cues, F (14, 84) = 
2.75, p < .005. This appears to come 
from a general tendency of the cate- 
gory cue to grow relatively better at 
longer delays, and for the graphic cue 
to become relatively worse. 

Next the question of whether the 
observed curves are simple decay 
curves or are a resultant of retrieval 
from two systems was examined. First 
of all, the data were plotted on semi- 
logarithmic coordinates. Rather than 
the straight lines that would be ob- 
tained from exponentials, the data of 
Exp. I and II yielded negatively 
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accelerated curves. This curvature 
could not be made to disappear by 
altering the estimate of asymptote to 
any reasonable value (i.e., to values 
between zero and the value of the curve 
at Delay 96). The curves, therefore, 
are not exponential decay functions, 

Actually the retention curves can 
be fitted relatively well by the assump- 
tion of two memory mechanisms; as- 
sume (a) that information goes with 
probabilities P(STM) and P(LTM) 
into STM and LTM, respectively ; (b) 
that each system loses information 
exponentially, the asymptote of each 
being zero; and (c) the target item is 
retrieved from whichever system still 
contains it. 

The curves can be fitted as follows: 
First, the LTM function is estimated 
from Delays 24, 48, and 96, points 
which are clearly outside the range 
of STM. This is done by graphically 
fitting a straight line to these points 
on a semilogarithmic plot of the prob- 
ability of recall data. Then to solve 


of Waugh and Norman’s equation jis 
used (1965) : 


R(i) = STM(i) + LTM (i) — 
STM(i) LTM(i), [2] 


where R(i) is the probability that an 
item with Delay i will be recalled, and 
LTM (i) is the probability of its being 
in LTM at Delay i, as estimated above. 
Tt is then possible to make a semi- 
logarithmic plot of these residual STM 
points and graphically fit a straight 
line to obtain the estimated STM 
exponential forgetting curve. Since 
the estimated values for the probability 
of items being in STM and in LTM 
at all delays are now available, the 
fitted composite R(i) curve may be 
plotted using Equation 2. 


f 
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Two-factor curves are shown with 
the data of the two experiments in 
Fig. 2. The fit is reasonable consider- 
ing the variability in the curves. How- 
ever, because of the fact that each 
curve is based on four estimated param- 
eters (the initial probabilities and the 
decay rates of the STM and LTM 
components), the obtaining of an ade- 
quate fit is not too surprising. It 
simply shows that the present data 
are not inconsistent with the two-fac- 
tor mechanism proposed by Waugh and 
Norman. The only difference between 
the present findings and the results 
of these investigators is that in order 
to fit the data of the present experi- 
ments, the probability of recall out of 
LTM cannot be viewed as constant 
over the delay intervals used. That 


EXPERIMENT 1 
One Cue Type Per Booklet 


Cue Types 
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EXPERIMENT 2 
Mixed Cuing in Eoch Booklet 


PROPORTION CORRECT (Adjusted~fér Guessing) 


DELAY (Relative Units) 


Fic. 2. Comparison of two-factor curves 
with the observed proportion of items correct 
(adjusted for guessing), as a function of 
delay of test, for semantic, graphic, phonetic, 
and contiguity cues. (Curves are hori- 
zontally displaced for clarity; for actual 
delay values, see Fig. 1.) 
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one must assume a decaying rather 
than a constant LTM component is 
shown by an analysis of variance on 
only the 24-, 48-, and 96-item delay 
points. Recall at these delays is 
viewed as essentially free from the con- 
tribution of STM by Waugh and Nor- 
man; these LTM points taken alone 
show a highly significant effect of de- 
lay in Exp. I, F (2, 12) = 7.37, p< 
01, and also in Exp. II, F (2, 14) 
= 7.37, p<.01. (The identical F 
values are coincidental. ) 

In Table 1 the F ratio of cue vari- 
ance to error variance is shown for 
each delay. In neither case does the 
cue component of variance seem to 
grow less in any clear-cut fashion with 
shorter delays. This suggests that it 
is not only retrieval from LTM that is 
influenced by the cue. If cues influ- 
enced retrieval from LTM only, the 
contribution of cues to variance should 
be less in the earlier delays where most 
of the recall is from STM. 


Discussion 


The most striking fact about the results 
is the general similarity of all the curves. 
Regardless of whether the cue is con- 
tiguity or some more meaningful feature 
of the item, most of the forgetting that is 
going to occur happens within a delay of 
six items, and then forgetting levels out. 
This general result confirms the idea that 
the difference between STM and LTM is 
in the nature of the holding mechanism, 
as proposed by Hebb (1949), and not pri- 
marily due to unequal decay rates of dif- 
ferent types of information, as hypothe- 
sized by Tulving (1966). 

There are, nonetheless, differences in 
the effectiveness of cues, primarily a dif- 
ference between the contiguity and at- 
tribute cues which is most marked (in 
terms of the ratio between the numbers 
of words reinstated by each) at long de- 
lays. There also appear to be some 
smaller differences among the attribute 
cues, especially when S can encode in the 
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most advantageous manner for each type. 
In this case, as might be expected, the 
semantic cue is best at long delays; but, 
surprisingly, the graphic cue is best with 
short delays. 

Why should an attribute cue be so 
much better than a contiguity cue? 
While it seems intuitively obvious that 
this should be the case because of the 
strong prior association between the at- 
tribute and the target item, upon closer 
examination one finds that there is no 
explicit theory to predict the observed re- 
sults. Although the attribute cue is pre- 
experimentally associated with the target 
item, it is also (and more strongly) asso- 
ciated with a large number of other items. 
Hence, the single presentation of the 
target item in the list has to raise the 
associative connection of that item with 
the attribute cue to such a high level that 
other items cannot interfere. On the 
other hand, in the case of continguity 
cuing, there is no significant preexperi- 
mental association between the cue and 
the item. The role of the presentation in 
this case is to raise an association from 
a negligible level of strength to some 
level sufficient for recall. Thus the prob- 
lem with the attribute cue is one of dis- 
crimination, whereas with the contiguity 
cue it is one of an original association. 
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Why should the effects of a single pres- 
entation facilitate discrimination more 
than association formation ? 

It is quite important that one comes to 
understand the subtleties of the mech- 
anisms involved in retrieval via attribute 
cues because these sorts of cues are just 
simpler analogs of questions. A question 
is typically a network of propositions de- 
fining the attributes of the requested 
information. 
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CATEGORY SCALES AND CONTRAST EFFECTS 
WITH LIFTED WEIGHTS? 


JOHN ROSS anp VINCENT DI LOLLO 


University of Western Australia 


Different series of weights were presented by the method of single 
stimuli: A 2X 2X2 design used preshift level, postshift level, and 
number of absolute categories of judgment as independent factors. 
The postshift judgments exhibited the usual contrast effects, the 
same series receiving “heavier” judgments after a lighter than after 
a heavier preshift series. Although the preshift judgments were 
independent of the stimulus level for the 2-category scale, the heavier 
preshift series received “heavier” judgments on the 6-category scale. 
Since category boundaries were shown to be relativistically determined, 
an irreducible confounding with contrast effects was shown to occur 
with the use of category estimation methods. 


Category scaling methods appear to 
show a contrast effect with a shift in 
range of lifted weights. The Ss 
shifted from a heavy to a light range 
give lower mean category ratings than 
light-range controls, and Ss shifted 
from a light to a heavy range give 
\ higher ratings than heavy-range con- 
\ trols. There is also a decrease in the 

\difference between the shifted group 

and controls with trials after shift (Di 
Lollo, 1964). The “undershoot” and 
“Qvershoot” effects have been inter- 
preted as evidence of perceptual con- 
trabt, ie., a change in the perceived 
magnitude of the stimulus objects. On 
the other hand, it has been claimed that 
such effects are due to “labeling” and 
do not reflect any genuine perceptual 
contrast (Stevens, 1958). The work 
of Parducci (1965) on the relation of 
the location of category boundaries to 
the distribution of weights within a 
range suggests that category labeling 
is not absolute, and lends force to the 
argument that the relative positioning 
of category systems may underlie the 


1This research was supported by the Air 
Force Office of Scientific Research, Office 
of Aerospace Research, United States Air 
Force, under AFOSR Grant No. AF- 
AFOSR-968-66 to the second author. 


a” 
differences in category ratings observed 
under conditions of shift. 

The aim of the present study was to 
compare the effects of shifts on the 
rating of lifted weights, using a two- 
category and a six-category response 
scale, and to use the results to ex- 
amine the question of category rating 
as a technique to detect perceptual con- 
trast effects. 


METHOD 


Subjects—The Ss were 88 boys, 16-17 yr. 
old, who were randomly selected from the 
fourth-year high school population in the 
metropolitan area of Perth, Western Aus- 
tralia, 

Apparatus—The stimulus objects con- 
sisted of two series of five weights each. 
The light series (L) ranged from 100 gm. to 
300 gm. in steps of 50 gm., and the heavy 
series (H) ranged from 400 to 600 gm., also 
in steps of 50 gm. The weights consisted of 
4-oz. polyethylene containers filled with lead 
shot and paraffin wax to the amount required. 

Procedure——The Ss were assigned to two 
sets of 44 Ss each. One set judged the 
weights using a two-category response scale, 
and the other a six-category scale. The Ss 
in the two-category set were allocated to 
four groups of 11 Ss each. At the begin- 
ning of a session S was seated at a table and 
asked to put his hand through a curtain 
hiding the weights from his view. Then E 
read the following instructions: 
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You will be presented with a series of 
weights. The weights will be presented 
one at atime. Each time I put a weight in 
your hand I would like you to lift it once 
and then put it down. You should lift the 
weight with a motion of the hand and wrist 
only, leaving your forearm resting on the 
table. After you put the weight down, I 
would like you to tell me your judgment of 
the heaviness of that weight. If, in your 
opinion, the weight felt heavy, just say 
“heavy”; if it felt light, just say “light.” 
This is all I want you to do. 


The experiment consisted of 10 preshift 
trials followed by 10 postshift trials. On 
each trial S judged all five weights in the 
relevant series, randomly presented. The 
random order of presentation was the same 
for all Ss. There were four conditions in 
all: H-H, H-L, L-L, L-H (where the first 
letter refers to the preshift series of 
weights and the second to the postshift 
series). One group of Ss was assigned to 
each of these conditions. The experiment 
proceeded smoothly from the preshift to 
the postshift stage without any interruption 
or warning to S. 

The Ss in the six-category set were allo- 
cated to four groups of 11 Ss each, one 
group being assigned to each of the four 
conditions H-H, H-L, L-L, L-H. Pro- 
cedures were the same as for the two- 
category groups except for the last three 
sentences of the instructions which were 
replaced by the following: 


After you put the weight down, I would 
like you to tell me your judgment of the 
heaviness of that weight by naming one of 
the following six categories: very light, 
light, medium light, medium heavy, heavy, 
very heavy. These categories are listed 
on the sheet placed on the table just by 
your left elbow; would you read them 
to me? 


The sheet with the list of categories re- 


mained in full view of S throughout the 
experiment, 


RESULTS AND Discussion 


Figure 1 presents the trial-by-trial . 


mean ratings given by each group in 
the two-category and the six-category 
conditions. In the two-category case 
ratings are treated as 0 or 1, and in 
the six-category case as 0, 1, 2, 3,4, 


or 5, the lowest category in each case 
being 0. The effects normally taken as 
evidence of perceptual contrast are ob- 
servable in both cases. On the post- 
shift trials Groups L-H overshoot 
their controls (H-H) and Groups 
H-L undershoot their controls (L-L). 
Both conditions also show the progres- 
sive decline in overshooting and under- 
shooting over trials reported by 
Di Lollo (1964) in a study with nine 
categories. 

There is a marked difference be- 
tween the two conditions with respect 
to the behavior of the control groups 
over all trials and the experimental 
groups prior to shift. In the two-cate- 
gory condition the control groups do 
not differ, nor do the experimental 
groups differ prior to shift. In the 
six-category condition there is a sub- 
stantial difference between the level of 
the control groups, with the experi- 
mental groups closely following the 


appropriate control during the preshift / 
phase. The results thus show that pre- / 


shift judgment levels are determined 


Sp SIX-CATEGorY ] 
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Fic. 1. Mean scale ratings for each group 


in the two-category and the six-category 
conditions. 
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by an interaction between the heavi- 
ness of the stimulus series and the 
number of judgmental categories. 

Figure 2 presents the ratings of the 
shift groups only for individual weights 
in the appropriate series, averaged over 
Preshift Trials 6-10 and Postshift 
Trials 16-20. When pre- and post- 
shift ratings are compared, it can be 
seen that the top of Series L overlaps 
the bottom of Series H very considera- 
bly for Groups L-H and H-L in both 
the two- and the six-category condi- 
tions. The overlap is greater for the 
two-category condition. 

If average rating is a procedure-free 
indication of judged weight, as it has 
often been assumed, the results of Fig. 
2 would show that the top weights of 
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Fic. 2. Mean preshift and postshift scale 
ratings for the shifted groups and each 
weight in the two-category and six-category 
conditions, 
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Series L are judged to be heavier than 
the bottom weights of Series H both 
in cases of upward (L-H) and down- 
ward (H-L) shift. It is impossible 
to reconcile this conclusion with the 
interpretation of Fig. 1 as evidence of 
an upward displacement of apparent 
weight within Series H following a 
shift from Series L, and of a corre- 
sponding downward displacement in 
the case of an opposite shift. The in- 
terpretation of average rating as a 
procedure-free indication of judged 
weight leads to a paradox and must be 
discarded. 

All the results can readily be ac- 
commodated by reasonable assumptions 
about the effect of experimental pro- 
cedures on the positioning of category 
boundaries. 

A two-category system requires one 
category boundary, a six-category sys- 
tem requires five. As the preshift and 
control data in Fig. 1 show, the single 
boundary in the two-category condition 
is set to discriminate within the initial 
series, scarcely being affected by abso- 
lute weight. The comparison of the 
two preshift series in Fig. 2 substan- 
tiates this observation. The category 
rating for the median weight of preshift 
Series L (200 gm.) is .76, and for the 
median weight of preshift Series H 
(500 gm.) is .80. Assume that on 
postshift trials the category boundary 
initially remains at the preshift level 
but changes location with experience 
of the postshift series to convey some 
discriminative information about the 
weights in the postshift series. This 
assumption is consistent (a) with the 
relativistic location of the boundary 


- prior to shift, (b) with the extreme 


average ratings on the first postshift 
trial (.98 for Series H and .09 for 
Series L), and (c) with a drift toward 
the control levels on subsequent post- 
shift trials. 
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The results for the six-category sys- 
tem can be understood in a similar 
fashion by assuming that the initial 
locations of the five boundaries defin- 
ing the six categories are determined 
partly on relative (or discriminative) 
considerations and partly to convey 
absolute judgments of the series as a 
whole. The difference between the 
control groups throughout and between 
the shift groups on Trials 1-10 indi- 
cates the effect of absolute considera- 
tions, but the overlap of the judgments 
for Series L and Series H prior to 
shift (Fig. 2) indicates the effect of 
relative considerations. Once again 
the results support the assumption that, 
following a shift, the category boun- 
daries remain initially where they were 
before the shift but are repositioned to 
convey discriminative information. 
There is, as in the case of two cate- 
gories, an extreme rating on the initial 
trial (4.9 for Series H and 1.2 for 
Series L) and a drift toward the con- 
trol levels on subsequent trials. 

On the basis of these results it can- 
not be concluded that perceptual con- 
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trast effects do not occur, but that cate- 
gory scaling methods make it impos- 
sible to detect them. Williams, Ross, 
and Di Lollo (1966) have reported re- 
sults of a recognition task indicating 
that perceptual contrast effects are gen- 
uine. If category boundaries are set 
relatively, as the results indicate, then 
no matter how small the relativistic de- 
termination, there will be an irreduci- 
ble confounding with perceptual con- 
trast effects whenever category esti- 
mation methods are used. 
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STIMULUS AND RESPONSE INTERFERENCE IN 
RECOGNITION-MEMORY EXPERIMENTS+ 
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AND 
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Harvard Medical School 


An analytical technique for determining the effects of stimulus- and 
test-list interference on the strengths of recent memory traces is de- 
scribed and demonstrated. The probability that an item will be 
recognized as one that recently occurred depends on the amount of 
interference attributable to the items that follow it in a study list, 


and the items and responses that precede it in a test list. 


The exact 


relationship between the 2 types of interference seems to depend on 
various details of the experimental procedure used. 


The act of retrieving material from 
memory may interfere with further re- 
trieval. Either assimilating information 
or retrieving it from memory may in- 
terfere with the retention of other 
material stored there. The effect of an 
item’s serial position in a study list on 
its probability of being recalled has 
been described many times. It is now 
well known, moreover, that this proba- 
biJity can also depend on its position in 
a \test list—that is, on how many re- 
spgnses precede its attempted recall 
(Mackworth, 1962; Tulving & Ar- 
buckle, 1963; Waugh & Norman, 
1965). Recent developments in strength 
models of memory now make it pos- 
sible’to assess the exact relationship 
between interference caused by stimuli 
presented by E and by responses made 
by S. 

A variety of experiments on imme- 
diate recall have been found to yield 
similar retention functions, after cor- 
rection for the effects of long-term 


1This research was supported in part by 
Research Grants MH-08083-02 and MH- 
08119-01 from the National Institute of 
Mental Health, United States Public Health 
Service, to Harvard University, Center for 
Cognitive Studies, and to Harvard Medical 
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Thomas Kreilcamp, and Marta Weigle as- 
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storage (Waugh & Norman, 1965). 
The results of most experiments on 
short-term memory can be considered 
to reflect the activity of two systems: 
a temporary storage system, primary 
memory (PM), and a more permanent 
secondary memory (SM), entered from 
PM by means of rehearsal. 

In conventional tests of memory with 
the recognition method, Ss are exposed 
to a list of stimulus items for one or 
more trials and are then presented a 
test list. The test list is composed of 
a mixture of items, some of which are 
old—having just occurred in the origi- 
nal stimulus list—and some of which 
are new—not having been seen pre- 
viously by Ss within the context of the 
experiment. The Ss are expected to 
work through the test list and decide 
whether or not each item was presented 
in the original list. If the process of 
testing can interfere with recent mem- 
ory traces, then conclusions drawn from 
such tests of recognition may be mis- 
leading. The results may be deter- 
mined by factors not taken into account 
by E, such as the length of the test list 
or a given item’s serial position within 
it. Thus, conclusions based on average 
recognition scores may be seriously in 
error.? 


2 Rather than list the studies of recognition 
memory in which a long uncontrolled re- 


551 


552 


It is now possible, however, with the 
aid of existing theories of memory, to 
assess the role of response interference 
in a quantitative fashion. The purpose 
of this study is to demonstrate how this 
can be done. We do not propose any 
new theory here, but rather we wish 
to show how a slight modification of 
existing strength models of memory 
can be used to compare the effects of 
response- and stimulus-list interference 
in two new experiments in recognition. 


THEORY 


The theoretical background has been 

described in three separate studies: a 
general description of recall experi- 
ments (Waugh & Norman, 1965), a 
formal model for recognition-memory 
experiments (Wickelgren & Norman, 
1966), and an extension and test of 
the recognition model (Norman, 1966). 
Basically we assume the following: 
When an item is presented in a stimu- 
lus list, it receives some initial strength 
in PM, and, with the attention given 
to it between its presentation and the 
occurrence of the next item, it also 
receives some initial strength in a more 
permanent memory, SM. The present- 
ing or testing of items other than the 
one under consideration (the critical 
item) causes a decrease in the latter’s 
memory strength. The total memory 
strength of this item is given by the 
additive combination of its strengths 
in SM and PM. 
i More formally, consider the strengths 
in primary (P) and secondary (S) 
memory of an item presented in posi- 
tion s of a stimulus list of length L and 
recalled in position t of the test list: 
P(s,L,t) and S(s, L, t). 

Initial strength—The initial strength 
of items before they have been pre- 


sponse list was used, we simply refer the 
reader to a typical study (Pollack, Norman, 
& Galanter, 1964). 
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sented is zero in both PM and SM: 
P(0, L, t) = S(0,L, t) = 0. 


Decay in Primary Memory 


Stimulus list—Presentation of a 
stimulus item different from the item 
presented in position s causes the 
strength of that item in PM to decrease 
to some proportion ¢ of its previous 
strength: 


P(s,kt,) = $P (sk — 1,),0 <$ < 1. 
Test list—Retrieval (or test) of an 
item causes the strength of all other 


items to decrease to some proportion 0 
of their respective strengths: 


P(s,L,t) = 0P (s Lt — 1), 0< 0< 1. 
Decay in Secondary Memory 
No decay occurs in SM during the 


course of experiments of immediate J 


memory, either during presentation of 


other stimulus items or during testing’, 


Acquisition Presentation of an iterm 
causes its strength to increase to a ayid 
à in PM and SM, respectively. These 
strengths can be measured if the item 
is tested immediately : 


P(s,5,1) =a,0<a 
SSL) =H, OSS As 
Total memory strength.— 
M(s,L,t) = P(s,L,t) + S(s,L,t). 


Solving these conditions on memory 
strength is a straightforward exercise 
and will not be detailed here. The re- 
sult is that : 


M(s,L,t) = abs +a. [1] 


For a direct comparison of the effects 
of stimulus- and test-list interference, 
it is convenient to express the inter- 
fering effect of a test in stimulus units. 
It is always possible to find a number k 
such that 0 = ¢*. Substituting into 


| 
| 
| 
f. 


= 


— 
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Equation 1, we have 
M(s,L,t) = apt +a, [2] 


where i= L — s+ kt. With this for- 
mulation it is possible to say of the 
results of an experiment: each test has 
the interfering value of k stimuli. If it 
turns out that position in the test list 
is not a factor in performance, k will 
equal zero. If a test has the same 
interfering effect as a stimulus, k will 
equal 1. 

Equations 1 and 2 each have four 
parameters, but only two of them, ¢ 
and 0 (if we use Equation 1) or k and 
$ (if we use Equation 2), will be of 
any importance in the analyses that 
follow. 

One more theoretical statement is 
necessary due to the probabilistic na- 
ture of data obtained in recognition 
experiments. Because Ss do make 
false recognitions, we must assume two 
distributions of strengths, one for items 
that were presented in a stimulus list 
(old items), and another for items that 
were not (new items). A number of 
recent studies of recognition memory 
indicate that it is not unreasonable to 
assume that these distributions are nor- 
mal with unit variance (Egan, 1958; 
Murdock, 1965; Norman & Wickel- 
gren, 1965; Wickelgren & Norman, 
1966). In order to make this assump- 
tion consistent with the hypothesis that 
strengths in PM and SM are additive, 
we could assume that strengths in PM 
are distributed normally with unit vari- 
ance, while strengths in SM are con- 
stant (follow point distributions). An 
alternative assumption, one completely 
equivalent for the purpose of these 
analyses, is to assume strengths in both 
PM and SM are algebraic (are given 
by point distributions) and that statisti- 
cal deviations enter the recognition 
Process as normally distributed noise 
Occurring in the decision process. 
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Either assumption gives the same re- 
sult, namely, that the strengths for both 
old and new items are normally dis- 
tributed with unit variance. Our mea- 
sure of the strength of an old item will 
be the separation of the two distribu- 
tions in units of the standard deviation. 
This measure is equivalent to the d’ 
of signal detection theory (Green & 
Swets, 1966). 


Recognition 


The response strategy attributed to 
S in recognition experiments is a sim- 
ple one: he picks some criterion 
strength; if the strength of the test item 
in his memory exceeds the criterion, he 
says that he recognizes it (responds 
“yes”), otherwise he does not (re- 
sponds “no”). Various theories exist 
as to how S selects a criterion strength ; 
they are not necessary for our purposes 
and will not be considered here. 


EXPERIMENT I 
Procedure 


A stimulus list of 30 unrelated mono- 
syllabic words was read to Ss at a rate of 
one word per second. A high-frequency 
tone followed the last word and was in turn 
followed by a test list, consisting of 20 
monosyllabic words read at a rate of one 
word every 2 sec., of which one-third were 
old and two-thirds new, on the average. The 
S’s task was to denote whether he thought 
each test item was old or new by marking 
the appropriate column on an answer sheet, 
The lists were prerecorded and presented by 
means of a tape recorder and high-quality 
loudspeaker. 

Words in Serial Positions 1, 2, 4, 8, 20, 26, 
27, 28, 29, and 30 on the stimulus list were 
tested in Positions 1 through 6, 8, 10, 15, 
and 20 of the test list. Each combination 
of stimulus- and test-list positions was tested 
three times so that a total of 45 lists was 
presented. The experiment was performed 
twice, with different Ss and a different 
ordering of the lists: the stimulus lists were 
the same for both groups of Ss, but the test 
lists were different. A practice list and test 
preceded each session. The words were 
selected from Waugh’s (1961) collection and 
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TABLE 1 
FREQUENCIES OF RECOGNITION AND MEMORY STRENGTHS FoR Exp. I* > 
4 Position on Stimulus List (s) 
on Teat p (false recog)» 
List (¢ 
; 1 2 4 8 20 26 27 28 29 30 
66 54 62 64 73 7 97 97 |100 |100 25 
: 1,08) .78| .98| 1.04] 1.28] 2.56] 2.56] 2.56] — — 

2 8 75 50 41 58 97 97 7 96 97 16 
1.76| 1.66] .99) .76| 1.19| 2.87) 2.87| 2.12| 2.74| 2.87 

3 71 5 53 64 64 90 86 94 95 19, 
1.44| 1.26| .96| 1.24| 1.24| 2.16| 1.96] 1.96| 2.43| 2.52 

4 71 3 52 65 80 76 88 89 89 12 
1.73| 1.25| 1.43| 1.22] 1.56| 2.02] 1.88] 2.35| 2.40] 2.40 

5 73 49 51 68 78 79 83 7 19 
1.49| .86] 114| | 1.35| 1.65| 1.96| 1.68| 1.83| 2.01 

6 73 54 1 53 53 62 9 68 68 3 13 
1.74| 1.23| 1.16| 1.20| 1.20| 1.44] 1.64] 1.60| 1.60) 1.74 

8 2 8 9 42 50 67 56 7 17 16 
1.30| 1.46| .96| .79| .99| 1.43| 1.50] 1.14] 1.43] 1.73 

10 3 0 52 62 55 51 8 47 6 59 11 
1.56| 1.23| 1.28| 1.54| 1.36| 1.26] 1.70] 1.16] 1.38] 1.46] 

15 65 9 3 68 62 56 59 1 47 17 
1.34| 92| 1.02| 1.42] 1.05| 1.26| 1.10] 1.18| 1.23 .88 

20 9 63 9 52 53 49 50 9 69 15 
1.27} 1.37] 76| 1.09| 1.12| 1.02] 1.14] 1.04] 1.02] 1.54 


* Top number of each 
itive frequencies 


were carefully processed to make the lists 
as homogeneous as possible, 

A total of 39 Ss participated in the experi- 
ment, 19 in the first session, and 20 in the 
second, All were Harvard University under- 
graduates and were paid for their services. 


Results 


The results are summarized in Table 
1, which shows the proportion of words 
correctly recognized for each combina- 
tion of stimulus- and test-list positions, 
along with the proportion of words 
falsely recognized at these test-list posi- 
tions. The recognition rate is more or 
less invariant with test position for 
items that appear early in the stimulus 
list, but decreases systematically with 
test-list position for items at the end 
of the test list. This interaction is illus- 
trated in Fig. 1A and 1B. The curves 
in Fig. 1A show the relative frequency 
of correct recognition, p(recog), as a 
function of stimulus-list position, esti- 


a is piecon, bottom number is M (s, 30, t). 
ave been multiplied by 100 to eliminate decimal point. 


mated across Test-List Positions 1 
through 5 and 6 through 20, respec- 
tively. These data indicate that early 
items in a list are recognized somewhat 
more frequently (but not significantly. 
so) when they are tested early rather 
than late. Late items, on the other 
hand, benefit dramatically from early 
testing. 

Figure 1B illustrates recognition as 
a function of test-list position, The 
two curves show the relative frequency 
of correct recognition as a function of 
test-list position (t), estimated across 
early original list positions (s=1, 2, 
4, 8, 20) and late ones (s = 26 through 
30), respectively. 

It is quite evident that p(recog) is 
essentially independent of test-list posi- 
tion for the early items on the stimulus 
list. For items that appear late on the 
stimulus list, p(recog) is a decreasing 
function of test-list position (t), ap- 
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Fic. 1. Proportion of words recognized as 
a function of serial position in the stimulus 
list (s) and test list (t). (The two upper 
curves in each figure show the proportion 
recognized for early and late positions in the 
test list, Fig. 1A, and early and late positions 
in the stimulus list, Fig. 1B. The proportion 
of false recognitions as a function of test-list 
position is also shown in Fig. 1B.) 


proaching an asymptote at around ¢ = 
10. For t> 10, p(recog) seems to be 
independent of both stimulus and test 
positions. 


Strength Measures 


Table 1 also shows the recognition 
scores transformed into strength scores 
by entering the correct and false recog- 
nition probabilities p(yes|old) and 
(yes | new) into tables of d’ (Elliott, 
1964). (Note that the two positions 
with relative recognition frequency of 
1.00 could not be transformed.) Ac- 
cording to the present model (Equation 
1), an item’s memory strength should 


be an exponentially decreasing function 
of its position from the end of the 
stimulus list and its position in the test 
list, approaching an asymptote as the 
separation between its initial presenta- 
tion and test becomes very large. 

It is possible to analyze the results 
in two ways: by a rough graphical 
analysis which appeals to intuition, and 
by a more thorough iterative least- 
squares analysis. Considered first is 
the graphical analysis. 

We can estimate the asymptotic level 
of performance from the mean-strength 
value of conditions where we do not 
expect retention in PM to attain any 
appreciable magnitude, that is, where 
the separation between presentation 
and test exceeds 10 items, or where 
30—s>10. These values of s give us 
an estimate for A of 1.22. 

The interference caused by test items 
is assessed by examining a given item 
decay in PM strength as its position 
on the test list increases. Similarly, 
interference caused by stimulus items 
is tested by examining the decay in 
PM strength at fixed test positions for 
various stimulus positions. The com- 
parison is shown in Fig. 2, PM 
strength, P(s, 30, t), has been esti- 
mated by subtracting the estimate for 
A, 1.22, from the strength values of 
Fig. 1. To reduce the variability of 
the data, the points shown in Fig. 2 
have been obtained by averaging across 
Stimulus Positions 26 through 30 (for 
the analysis of test-list interference) 
and across the first five test positions 
(for the analysis of stimulus-list inter- 
ference). Averaging can be justified 
only if the effects of stimuli and those 
of tests can each be described by ex- 
ponential functions with the same de- 
cay parameters, such that 0 = $. That 
this is so is shown by the fact that in 
the iterative least-squares analysis to 
be described below no averaging was 
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Fic, 2. The results of Exp. I, showing decay in PM strength as a function of test-list 
interference (¢) for late stimulus items (left-hand curve) and decay in strength as a 
function of position from the end of the stimulus list (30—s) for early items (right-hand 
curve), (The slopes of the curves are almost identical.) 


done, yet the results are very similar 
to those shown in Fig. 2. 

A least-squares fit to the points of 
Fig. 2 (ie, to the logarithms of the 
PM strength values) gives an estimate 
of .76 for @ (the parameter for test- 
list interference) and .78 for $ (the 
parameter for stimulus-list interfer- 
ence), The fact that these data are 
well described by straight lines with 
similar slope on the semilogarithmic 
plot indicates that in this experiment 
a stimulus and a test exert a similar 
interfering effect on memory, 

The data have also been analyzed by 
a More exact procedure, A computer 
program continually evaluated Equa- 
tion 2, comparing predicted with mea- 
sured memory strength for the 98 
points shown in Table 1 as it adjusted 
the four parameters of the equation so 
as to minimize the sum of the squared 
deviations of the predicted from the 
obtained values. The iterative pro- 
cedure used a combination of a gradi- 
ent method (the method of steepest 
descent) and a “cuboid” method (a 


gridding technique), stopping when 
the parameter estimates were accurate 
to three significant figures. The esti- 
mates of the four parameters of Equa- 
tion 1 obtained in this way are a= 
3.95, 0 = .739, ¢ = .741, and à = 1.20. 
In terms of Equation 2, k= 1.01. 
Thus, in this experiment, the decay of 
strength for a memory trace is affected 
in a very similar way by each stimulus 
item and each test. 


EXPERIMENT II 


It is possible that the equivalence be- 
tween stimulus and list interference 
found in Exp. I may reflect details of 
experimental procedure, rather than a 
basic property of recognition experi- 
ments in general. To test this possibil- 
ity, a second recognition experiment 
was designed which, although very 
similar to Exp. I, differed in one im- 
portant respect. The stimulus items in 
Exp. I were selected from a large set 
of alternatives, and Ss were never ex- 
posed to the same words in more than 
one list. In Exp. II a limited set of 
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stimulus words was used; Ss had pre- 
viously memorized this set, and the 
‘entire set was used in every pair of 
stimulus- and test-list presentations. 
The tests in this experiment empha- 
| sized either stimulus- or response-list 
interference. Tests of varying mix- 
tures of stimulus- and test-list inter- 
ference were also included to minimize 
the likelihood that Ss would notice the 
selection of conditions and establish 
idiosyncratic strategies. The positions 
tested are shown in Table 2. 


Procedure 


The basic procedure was similar to Exp. I. 
The stimulus list consisted of 16 monosyllabic 
words and the test list of 10 words. Exactly 
2 words on the test list were “old,” having 
just occurred on the stimulus list. The 
words came from a specially selected set of 
24 three-letter words. Each word started 
with a different letter of the alphabet (except 
for Q and X), and Ss were required to mem- 
orize the list before the start of the experi- 
ment. At least 24 hr. elapsed between the 
learning of the lists by Ss and the first day 
of experimentation. At the start of each ex- 
perimental day Ss were tested for their 
ability to recall all 24 words. 

Two rates of presentation were used. In 
one condition the stimulus words were read 
at the rate of one word every. second and the 
test words at the rate of one test every 2 sec. 
In the other condition both stimulus and test 
words were read at the rate of one word 
every 1.5 sec. Within a given session, all 
lists were read at the same rates. The varia- 
tion in rates is of no interest for present 
purposes, and it will therefore be ignored in 
the analyses which follow. The S’s task was 
to denote whether he recognized the test item 
as having occurred in the preceding stimulus 
list by pushing the appropriate key (marked 
“yes” or “no”) on a portable card punch 
which was placed on his desk. The lists were 
prerecorded and were presented by means 
of a tape recorder and a high-quality loud- 
speaker system. 

Ten Harvard undergraduates (male and 
female) served as Ss. Each condition was 
tested by 10 different lists. The Ss were 
divided into two groups of five, each group 
receiving different orders of presentation of 
the various conditions. 


Results 


The results are summarized in Table 
2, which shows the proportion of words 
correctly recognized for the relevant 
stimulus- and test-list positions, along 
with the proportion of words falsely 
recognized at these test positions. 
Memory strength, estimated in the 
same way as in Exp. I, is also shown 
in Table 2. 

The comparison of the effects of 
stimulus- and test-list interference was 
carried out by both the graphical and 
the least-squares iterative procedures 
described previously. The results of 
the graphical procedure are shown in 
Fig. 3. The effects of pure test inter- 
ference are seen by examining the 
recognition score for the last stimulus 
position (s=16) with variable test 
positions. Similarly, pure stimulus in- 
terference is assessed by examining the 
recognition scores from the first test 
position with varying stimulus posi- 
tions. Both interference conditions ap- 
pear to produce exponential functions 
with an asymptote of .70 (estimated 
from conditions where the separation 
between presentation and test exceeded 
nine items and either s = 16 or t= 1), 
The data shown in Fig. 3 represent PM 
strengths, P(s, 16, t), estimated by 
subtracting .70 from the strength 
values of Table 2. Although the re- 
sults from both interference conditions 
in Fig. 3 appear to be reasonably well 
described by linear functions on the 
semilogarithmic plot, the slopes of the 
lines differ considerably, Thus, in con- 
trast to the results of Exp I, stimulus 
and test interference have very differ- 
ent effects. 

When the parameter values are esti- 
mated by the least-squares iteration 
described previously, we are able to 
assess the exact relationship between 
the two forms of interference. The 
parameter estimates were obtained for 


558 DONALD A. NORMAN AND NANCY C. WAUGH 


TABLE 2 
FREQUENCIES OF RECOGNITION AND MEMORY STRENGTHS FoR Exp. II 
Tet List O Stimulus List(s) b(yes)* M(s, 16,1) p (false recog.) 
1 4 60 84 28 
8 79 1.38 
10 85 1.62 
12 90 1.86 
13 97 2.46 
2 15 97 2.35 32 
3 11 83 1.20 40 
13 89 1.48 
4 14 91 1.52 43 
16 88 1.36 
5 11 78 .92 44 
16 90 1.43 
6 12 17 94 42 
14 78 97 
1 16 77 89 44 
8 11 77 94. 46 
9 13 75 -18 46 
16 76 -80 
10 15 69 56 48 


* Relative frequencies have been multiplied by 100. 


all the experimental conditions of Exp. for test-list interference is almost ex- 
II simultaneously, using all 19 experi- actly equal to the square of the decay 
mental conditions. The results are as parameter for stimulus-list interfer- 
follows: «æ= 4.31, 0=.68, ¢=.83, ence: (.83)?=.69. Thus, the & of 
and A=.68. The decay parameter Equation 2 = 2.0. In Exp. II, there- 
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Fic, 3. The results of Exp. II, showing decay in PM strength as a function of test-list 


interference (+) for the last stimulus item (lower curve) and the decay in strength as 


: function i eke ee iS ae of the stimulus list (16— s) for the first test item 
upper curve). e slope of the curve for test-list interf i 
of the slope of the curve for stimulus-list interference.) Lis igs eee ahaa 
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fore, if we define each stimulus item to 
have one unit of interfering value, each 
test produces two units of interference. 


Discussion 


Both experiments have produced sup- 
port for the model summarized by Equa- 
tion 1, but they have yielded different 
estimates for the relative effects of stim- 
ulus- and test-induced interference. In 
Exp. I, stimulus and test interference ap- 
pear to exert equivalent effects on mem- 
ory strength, whereas, in Exp. II, a test 
appeared to have twice the effect of a 
stimulus. These differences perhaps re- 
flect the fact that in Exp. I the items 
were drawn from a large set of stimulus 
alternatives which were not previously 
known to Ss and none of which was ever 
repeated within the experiment. In Exp. 
II a small set of well-learned stimulus 
items was used, and Ss were exposed to 
the entire set on every trial. 

Intuitively, the results of Exp. II are 
perhaps easier to understand: each test 
consists of one stimulus presented by Æ 
plus a response made by S. The F’s 
operations during both acquisition and 
test trials are identical. Thus, if § makes 
an overt response during the test trial, 
we would expect every item on that trial 
to function exactly as one stimulus item. 
According to the results of Exp. II, each 
recognition response made by S, therefore, 
then functions as exactly one additional 
unit of interference. According to the 
results of Exp. I, however, none of S’s 
responses to test items interferes with the 
retention of earlier items. Just why the 
Procedural differences between the two 
experiments should produce such different 
results is not at all clear. 

Both sets of data show, however, that 
the nature of the interference produced 
by items which are to be stored and items 
which are to be recognized is very sim- 
ilar : both apparently produce an exponen- 
tial decay in the PM strength of earlier 
items. This general result is consistent 


with the results of recall experiments 
(Waugh & Norman, 1965). Its practical 
implications for the design of experiments 
on recognition memory are obvious. 


REFERENCES 


Ecan, J. P. Recognition memory and the 
operating characteristic. (AD-152650, 
AFCRC-TN-58-51) Indiana University, 
Hearing and Communication Laboratory, 
1958. 

Extiort, P. B. Tables of d’. In J. A. Swets 
(Ed.), Signal detection and recognition by 
human observers. New York: Wiley, 
1964. 

Green, D. M., & Swers, J. Signal detection 
theory and psychophysics. New York: 
Wiley, 1966. 

Macxwortn, J. F. The effect of the re- 
sponse upon the immediate memory span, 
Canadian Journal of Psychology, 1962, 16, 
120-127. 

Morvocx, B. B., Jr. Signal-detection theory 
and short-term memory. Journal of Ex- 
perimental Psychology, 1965, 70, 443-447. 

Norman, D. A. Acquisition and retention in 
short-term memory. Journal of Experi- 
mental Psychology, 1966, 72, 369-381. 

Norman, D. A, & WIcKELGREN, W. A. 
Short-term recognition memory for single 
digits and pairs of digits. Journal of Ex- 
perimental Psychology, 1965, 70, 479-489. 

Porracx, I., Norman, D. A., & GALANTER, 
E. An efficient non-parametric analysis of 
recognition memory. Psychonomic Sci- 
ence, 1964, 1, 327-328. 

Turvine, E. & Arsuckie, T. Y. Source 
of intertrial interference in immediate re- 
call of paired associates. Journal of Ver- 
bal Learning and Verbal Behavior, 1963, 1, 
321-334, 

Waucu, N. C. Free versus serial recall. 
Journal of Experimental Psychology, 1961, 
62, 496-502. 

Waves, N. C., & Norman, D. A. Primary 
memory. Psychological Review, 1965, 72, 
89-104. 

Wicketcren, W. A, & Norman, D. A. 
Strength models and serial position in 
short-term recognition memory. Journal 
of Mathematical Psychology, 1966, 3, 316- 
347. 


(Received November 22, 1967) 


Journal of Experimental Psychology 
1968, Vol. 78, No. 4, 560-568 


TRANSITIVITY OF VISUAL JUDGMENTS 
OF SIMULTANEITY 
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Judgments of the simultaneity of pairs of brief light flashes are not 
in general veridical, i.e., objective and subjective simultaneity relations 
are not identical. To explain this discrepancy, it has been proposed 
that there exists a “simultaneity center” in the brain where paths 
of excitation from the visual system must coincide to produce the 
experience of simultaneity. The present experiment was designed 
to test an implication of the simultaneity center hypothesis: that simul- 
taneity judgments are transitive. 3 experienced Os were monocularly 
presented with pairs of 5-msec. light flashes. The 3 stimulus lights, 
F, N, and E, were presented pairwise in 3 sessions: F vs. N, F 
vs. E, and N vs. E. Thresholds of simultaneity were obtained for 
each session in terms of the interstimulus interval required to produce 
subjective simultaneity. Within the experimental variability obtained, 
the transitivity relation was confirmed. This finding provides support 


for the simultaneity center hypothesis. 


It often appears that there are two 
distinct species of time: the real, as 
measured by objective instruments ; and 
the apparent, as determined by subjec- 
tive experience. Real and apparent 
time seem particularly different when 
brief stimulus pairs are involved. When 
two brief light flashes are presented to 
the same retinal area but with a tem- 
poral separation of up to 100 msec., 
they are perceived, under certain con- 
ditions, as having no apparent temporal 
separation, Conversely, two events that 
are simultaneous in terms of objectively 
measured time may be subjectively 
nonsimultaneous. For example, objec- 
tively simultaneous brief shocks to toe 
and forehead are subjectively non- 
simultaneous, with the forehead stimu- 
lus consistently reported as having pre- 
ceded the toe stimulus (Efron, 1963, 
p. 261). Hence, objective and subjec- 
tive simultaneity are not, in general 
identical relations. 

On the other hand, objective simul- 
taneity is always a transitive relation. 
That is, if Events A and B are simul- 
taneous in objectively measured time, 


and if B and C are likewise objectively 
simultaneous, then A and C must also 
be objectively simultaneous. Does the 
subjectively simultaneous relation share 
this transitive property? Not neces- 
sarily, since, as shown above, the ob- 
jective and subjective simultaneity re- 
lations do not coincide under certain 
conditions. It is, therefore, entirely 
possible that the subjectively simul- 
taneous relation is nontransitive under 
these same conditions. The present 
study was designed to test this possi- 
bility. 

An appropriate way of accounting 
for a discrepancy of temporal order in 
real time vs. apparent time is in terms 
of measurable nervous system delays. 
If the neural pathway from receptor to 
central projection area excited by one 
of two stimuli is sufficiently longer or 
has a slower transmission rate than 
the pathway excited by the other stimu- 
lus, it can be assumed that simultaneous 
peripheral stimulation of the two path- 
ways will be perceived as nonsimul- 
taneous, with the stimulus exciting the 
longer or slower path occurring later 
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in apparent time. Presumably, the two 
stimuli then could be made to appear 
simultaneous by introducing an objec- 
tively measured delay in the apparently 
earlier stimulus equal to the relative 
neural delay between the two paths. 

The concept that neural path length 
differences cause objectively simultane- 
ous stimuli to be perceived as non- 
simultaneous does not logically imply 
that the two paths must converge at 
any point. Sherrington (1911, p. 384) 
considered this a matter of fundamental 
importance, stating that “Pure con- 
junction in time without necessarily 
cerebral conjunction in space lies at 
the root of the solution of the problem 
of the unity of the mind.” Later, in 
discussing the properties of correspond- 
ing retinal points, Sherrington (1964, 
p. 207) asserted that “corresponding 
right-eye and left-eye perceptions are 
contemporaneous. Their contempo- 
raneity fuses them. There is no need 
of spatial coupling in the brain.” 

It is, however, difficult to accept this 
conclusion. It does not seem reasona- 
ble that the experience of simultaneity 
could arise directly from the activity of 
two entirely noncoincident brain re- 
gions. For, if a response is to be based 
on neural activity in two sensory pro- 
jection areas, this response must itself 
be based on neural activity in a third 
region capable of being modulated in 
some way by the activities of the first 
two regions. It is difficult to see how 
this modulation could be possible with- 
out some structural interconnections in 
which collateral paths from each sen- 
sory area coincide in some neural inte- 
gration area. The existence of such 
a brain region as a mediator of judg- 
ments of simultaneity and temporal 
order has been explicitly proposed by 
Efron (1963, p. 262). On the basis 
of experiments on simultaneity judg- 
ments for stimuli of different modali- 
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ties and for pairs of light flashes 
presented dichoptically, he concluded 
that a region exists in the temporal 
lobe of the speech-dominant cerebral 
hemisphere which reacts to the simul- 
taneity of two or more sensory inputs 
through some form of coincidence de- 
tection involving convergent neural 
paths. This has been called the “simul- 
taneity center” hypothesis. 

If this hypothesis is true, then it 
must be possible to show that simul- 
taneity judgments for pairs of light 
flashes are transitive. To illustrate this, 
consider Fig. 1. Suppose Retinal 
Points A, B, and C communicate with 
the Simultaneity Center S along three 
neural paths with respective transmis- 
sion delays of 80, 50, and 60 msec. 
For Stimulus Pair AB to appear simul- 
taneous, the light flash to Location A 
would have to “lead” (precede) the 
flash to B by (80—50) = 30 msec. 
Similarly, for Stimulus Pair BC to 
appear simultaneous, B would have to 
lag C by 10 msec. (i.e., to lead C by 
—10 msec.), and for Stimulus Pair AC 
to appear simultaneous, C would have 
to lead A by —20 msec. The algebraic 
sum of the three lead times required 
for subjective simultaneity over the 
three stimulus pairs must equal zero. 
As can be seen from Fig. 1, this relation 
is a property of the simultaneity center 
structure, and is independent of the 
particular values of transmission delays. 
In short, it is an intrinsic property of 
the simultaneity center model that the 
algebraic sum of the lead times for 
three stimulus pairs is zero, which im- 
plies that the simultaneity relation is 
transitive, and, given any two of the 
interstimulus delays, the third delay 
can be predicted. 

The present experiment tested the 
transitivity of the three possible pairs 
of three retinal points by measuring the 
interstimulus delays required for sub- 
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Stimuli 


Time 
Delays, msec. 


(tg ~ta) = da-dg = 30 
(tc ~te) = da -dẹ =-I0 
(ty - te) =de-da =-20 

Sao 


Fic. 1, Example illustrating how the 
simultaneity center hypothesis implies the 
transitivity of simultaneity judgments. 
(Stimuli A, B, and C are presented to three 
separate retinal locations. ) 


jective simultaneity for each pair and 
determining whether the algebraic sum 
of these three delay times differed sig- 
nificantly from zero. A significant 
difference would have disproved the 
simultaneity center model. The results 
obtained, however, support the model. 


METHOD 


Apparatus—Figure 2 shows the appa- 
ratus of the experiment. The O was seated 
in a lightproof booth, 62 in. from a circular 
panel on which the stimulus lights were 
mounted. Viewing was monocular with the 
right eye. The three stimulus lights con- 
sisted of a foveal light “F” located on O’s 
line of sight, and peripheral lights “N” and 
“E” located 3° above and to the right of the 
line of sight, respectively. 

The light sources were microminiature 
“Pinlites,” yy in. in diameter, powered by 
a regulated 12-v. de source. An annular 
fixation light, subtending a visual angle of 1°, 
was reflected from the top of the booth to O 
through a sheet of glass oriented at 45° 
from his line of sight. From O's point of 
view the fixation annulus was concentric with 
the central stimulus “F” and at the same 
distance from him as the stimuli. 


The sequence of stimulus presentation 
was controlled by means of a solid state elec- 
tronic timer, accurate to within 1 usec. Dur- 
ing the first 5 sec. of each 15-sec. trial se- 
quence, the fixation light was on. One sec- 
ond after the fixation light was turned off the 
first stimulus light was presented, followed 
by the second stimulus light after an inter- 
stimulus interval (ISI) previously deter- 
mined by E and set manually by him on the 
timer. All stimulus flashes were 5 msec. 
in duration. Intensities of the stimulus 
lights were approximately 1 log unit above 
foveal threshold for all Os, The fixation 
light was set at a comfortable low level. 

Subjects—Three Os with normal uncor- 
rected vision and with considerable pre- 
vious experience in visual psychophysical 
observations were used. All Os were dark- 
adapted for at least 5 min. prior to each ex- 
perimental session. | Head positions were 
fixed by means of an adjustable chair and 
rotatable headrest mounted at the back wall 
of the booth. The left eye was occluded 
with an eyepatch. 

Psychophysical method —In the first phase 
of the experiment measurements were made 
in replication of the method reported by 
Efron (1963, p. 264) and called by him a 
“descending method of limits from alter- 
nate directions.” In this method the tem- 
poral interval between the flashes in a stim- 
ulus pair was reduced by successive 5-msec. 
steps from an initial interval large enough 
to be clearly nonsimultaneous. In all trials 
of a series, the ISI was always reduced in the 
direction of subjective simultaneity. In no 
case was the interval reversed during a series, 


Fixation 


Fic. 2. Schematic drawing of apparatus 
for stimulus presentation. (Inset shows con- 
figuration of stimulus lights; dotted ring 
represents fixation annulus concentric with 
foveal stimulus light F.) 
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although “catch trials’ were included in 
which there was no reduction in the interval. 
The Os were allowed to request the repeti- 
tion of any trial on which they had diffi- 
culty deciding on a response. A series ended 
when the first “simultaneous” judgment was 
obtained. The process then was repeated 
with a series of trials using “negative” ISIs, 
ie, with Flash B before A, rather than A 
before B, thereby obtaining a second point 
of subjective simultaneity. 

In general, the points of subjective simul- 
taneity obtained from “positive” and “nega- 
tive” series differed from one another by an 
interval which may be called the “simul- 
taneity span.” To establish the end points 
of this simultaneity span to within a satis- 
factory confidence level, positive and nega- 
tive series were performed 10 times each in 
alternating fashion. The initial ISI of 
each series was varied at random over several 
5-msec. steps to minimize possible “anchor- 
ing” effects, and to prevent step counting. 
Simultaneity span measurements were ob- 
tained in this way for all Os for the three 
possible stimulus pairs: F-N, N-E, and E-F. 

In the phase of the experiment just de- 
scribed, Os were not required to judge the 
temporal order of the two stimulus flashes, 
but only to detect the presence or absence 
of simultaneity. In the second phase, all 
simultaneity spans were remeasured using a 
modified staircase method in which judg- 
ments of temporal order were required. 

In this second method, eight response 
categories were allowed: +3, +2, +1, +0, —0, 
—1, —2, and —3. The “3” responses were 
given when distrinct and definite time order- 
ing was perceived, and “1” responses were 
given when the time ordering was just barely 
perceptible, with “2” responses for inter- 
mediate cases. A plus response was given 
when Flash A was perceived as preceding 
Flash B, and a minus response when B 
was perceived as preceding A. 

For the subjectively simultaneous situa- 
tion a forced choice was required between 
the +0 and —0 responses, despite the fact 
that no time ordering had been perceived. 
As before, O could request to have any “un- 
Certain” trials repeated. Responses were re- 
layed from a hand-held bank of eight 
microswitches to E’s console outside of the 
booth. After some experience, Os found 
no difficulty in understanding and using this 
Tesponse system with confidence. 

Within each 60-trial experimental session, 
Positive and negative ISIs were alternated 
randomly. In this way both ends of the 
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simultaneity span were being measured in the 
same session, by approaching the point of 
subjective simultaneity from both directions, 
The two initial ISIs, one positive and one 
negative, were chosen to lie well outside of 
the O’s previously determined simultaneity 
span. On successive trials, these positive 
and negative ISI values were reduced in the 
direction of subjective simultaneity by 5- 
msec. steps, except for a small percentage of 
randomly chosen trials in which the ISIs 
were increased. When a +0 or —0 re- 
sponse was obtained, the magnitude of the 
ISI for the next trial from that direction was 
always increased (i.e, in the direction of 
nonsimultaneity). 

From O’s point of view this method was 
of the constant stimulus variety in that the 
response given for a specific trial was not 
predictably related to the response ap- 
propriate for succeeding trials, whereas for 
E the session consisted of determining two 
thresholds at once by the up-and-down 
method with random alternations from one 
schedule to another. 

As in the first phase, this process was re- 
peated for all three stimulus pairs. Each 
session occurred on a different day, and 
the entire series of measurements was per- 
formed twice for all Os. 


RESULTS 


Figure 3 is an example of the data 
obtained in the E vs. N condition using 
Efron’s descending method of limits. 
Bars in the lower part of the figure 
represent the ISI for which judgments 
of “nonsimultaneous” were obtained. 
Each bar represents a separate sequence 
of trials; a bar ends at the ISI value 
where the first “simultaneous” judg- 
ment occurred. Twenty such sequences 
were performed, 10 in each direction 
(indicated by arrows). In the upper 


pärt of the figure is a plot of the per- 


centage of all responses which were 
nonsimultaneous for each ISI. The 
end points of the simultaneity span can 
be defined as the ISI values at which 
50% of the responses are nonsimul- 
taneous. 

Since the variable being manipulated 
was the lag time of the later stimulus 
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Fic, 3. Simultaneity span measurement for E ys. N condition using descending method 


of limits from alternate directions. 


(Lower graph: bars represent ISIs for which “non- 


simultaneous” responses were obtained, ending at the ISI at which the first simultaneous 
Tesponse was given. Upper graph: same data plotted as percentage of “nonsimultaneous” 
responses for each ISI [thin lines] smoothed over a 5-msec. interval [thick lines with crosses]. 


Arrow indicates midpoint of simultaneity span.) 


light with respect to the earlier light, 
all ISIs are expressed as lags. In each 
of the three stimulus pairs, one stimulus 
was assigned a positive lag time and 
the other a negative lag time (i.e., lead 
time). Thus, in Fig. 3, N lags are 
plotted as positive and E lags as nega- 
tive ISIs. For the other stimulus pairs 
a consistent sign convention was used 
(E lags positive in F-E pair, and F 
lags positive for N-F pair). 

Under this convention, if the ISI 
corresponding to the midpoint of the 
simultaneity span for each stimulus pair 
is taken as the point of optimum sub- 
jective simultaneity, then the algebraic 
sum of these midpoints must be zero 
if transitivity holds (cf. Fig. I). That 
is, if a and b; are the IST values of 
the right and left ends of the simul- 
taneity span for Stimulus Pair i, then 
transitivity implies 


z (a; +b;)/2 = 0. 


But this means no more than that the 
algebraic sum of the six simultaneity 
span end points be equal to zero, i.e., 


3 3 

Dat Zb: = 0. 

i=l =1 
Hence in order to test for transitivity, 
it is not necessary that the lengths of 
the three spans be equal, provided only 
that the point of optimum subjective 
simultaneity divides the simultaneity 
span into some constant ratio for all 
three stimulus conditions. 

Figure 4 shows the form taken by 
the data obtained using the modified 
staircase method, for the F-N stimulus 
condition. The graphs combine the 
results of two experimental sessions. 

In the upper graph, data are plotted 
in histogram form with the specific 
responses made indicated by signed 
numbers under the distributions. ‘For 
the condition indicated, plus responses 
denote “F before N” and minus re- 
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Number of Responses 


2nd 


North Lag, msec. 


Session 


Foveal Lag, msec. 


% " Non- Simultaneous" Responses 


Fic. 4. Simultaneity span measurement for N vs. E condition using modified up-and- 


down method. (Upper graph: histogram of 
combined. Plus responses denote “N first,” 
responses denote “simultaneous.” 


responses given at each ISI, two sessions 
minus responses denote “F first,” and O 


Lower graph: same data plotted as percentage of 


“nonsimultaneous” responses obtained for each ISI smoothed over a 5-msec. interval.) 


sponses denote “N before F.” The 
lower curves of the figure are derived 
from the histograms by computing the 
percentage of all responses for a specific 
ISI value which were nonsimultane- 
ous. Responses of 0, whether plus or 
minus, were considered simultaneous 
responses. In addition, “false alarm” 
responses (plus responses in the left- 
hand histogram or minus responses in 
the right-hand histogram) were counted 
as simultaneous responses since they 
indicate a confusion of temporal se- 
quence. The percentage curves were 
smoothed over a 5-msec. interval. 

A comparison of data obtained from 
the two psychophysical methods yielded 
similar results, whether or not tem- 
poral ordering judgments were re- 
quired. There was in fact a tendency 
for simultaneity spans measured using 
temporal ordering judgments to be 
somewhat narrower than when mea- 
sured using simple simultaneous/non- 


simultaneous judgments. This implies 
that whenever O was able to perceive 
some degree of nonsimultaneity in the 
stimulus pair, he also was able to judge 
the temporal order of the stimuli con- 
sistently. The number of confusions 
in temporal order were in general small 
for all Os, averaging 20% for the barely 
nonsimultaneous response category (1), 
and only 27% for the simultaneous re- 
sponse category (0) where a 50% false 
alarm rate might have been expected. 
That is, even when simultaneous judg- 
ments were being made, the forced- 
choice technique indicated that some 
degree of information about the tem- 
poral order of the stimuli was available. 

No systematic differences were found 
between the simultaneity span mid- 
points as determined by the two psy- 
chophysical methods used. This implies 
that, with respect to determining the 
point of optimal subjective simultaneity, 
the two types of judgment are equiva- 
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lent. The doubts raised by Efron 
(1963, p. 279) on this question, there- 
fore, were not confirmed by the present 
results, 

Two general features of the varia- 
bility of measured simultaneity spans 
emerged from this study. Measured 
span lengths varied considerably be- 
tween-Os. For the three Os, span 
lengths averaged over the three experi- 
mental conditions were 51 msec., 120 
msec., and 195 msec. This indicates 
that simultaneity span length is to a 
large extent determined by O’s criteria 
rather than by individual sensitivities 
to time intervals. This conclusion is 
further supported by the fact that si- 
multaneity spans for the E-N stimulus 
pair, in general, were somewhat longer 
for all Os than their spans for the other 
two stimulus pairs (which included 
the foveal stimulus), so that the greater 
“difficulty” of this condition was re- 
flected in longer simultaneity spans. 

In contrast, there was remarkably 
small inter-S variability in the range of 
ISI values defining individual simul- 
taneity span end points, ie., in the 
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range between the ISI at which the first 
simultaneous response was obtained and 
the ISI at which the first nonsimul- 
taneous response was obtained. These 
transition ranges were approximately 
50 msec. for all Os under all three ex- 
perimental conditions. This indicates 
that once O has set his response cri- 
terion, he is able to exhibit a relatively 
unvarying sensitivity to time intervals. 
This finding is especially fortuitous, 
since a test of the transitivity hypothe- 
sis requires only that the end points of 
the simultaneity spans be known accu- 
rately and does not depend on their 
lengths as such. 

Figure 5 is a plot of all of the ex- 
perimental results for one O (RMB). 
The origin of the graph is the point of 
objective simultaneity, and the horizon- 
tal axis is the ISI value in milliseconds 
between each of the three stimulus 
pairs, using the sign convention de- 
scribed earlier. The horizontal lines 
represent the measured simultaneity 
spans for each session. Small vertical 
lines at the outer ends of the bars repre- 
sent the ISI value at which the first 
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Fic. 5. Simultaneity spans of all three stimulus conditions for O (RMB). (Arrows 
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simultaneous response was obtained, 
and the inner vertical lines represent 
the ISI at which the last nonsimul- 
taneous response was obtained. Ar- 
rows indicate simultaneity span mid- 
points. 

Lines labeled Exp. I were obtained 
using the descending method of limits 
from alternate directions, and those 
labeled Exp. II were obtained using 
the randomly alternating up-and-down 
method. The filled circles represent 
E-N span midpoints predicted from 
measured F-E and N-F midpoints 
under the transitivity hypothesis. 

The data consist of simultaneity span 
midpoints for each stimulus pair mea- 
sured three times for each O (one 
measurement using Efron’s descending 
method of limits and two replications 
using the modified staircase method). 

An analysis of variance was carried 
out on the data from each O and on 
their pooled data (Table 1). Stimulus 
Pair X Replication interaction was used 
as an estimate of within-cell error vari- 
ance under the assumption that a true 
Stimulus Pair x Replication effect was 
unlikely to have been present in the 
experimental method, since Os did not 
receive stimulus-pair sequences in the 
same order for each replication. 

The F tests of the main effects due 
to stimulus pairs were significant for 


TABLE 1 


Tests or DIFFERENCE BETWEEN SUBJECTIVE 
AND OBJECTIVE SIMULTANEITY, AND OF 
THE TRANSITIVITY OF SUBJECTIVE 
SIMULTANEITY 


Planned Comparison 
of Simultaneity Span 
Midpoints Averaged 
over Stimulus Pairs, 
against 0 Msec. 


Main Effect due 
to Stimulus Pairs 


F df ? F af b 


cs 24.3 | 2,4 | <.01] 1. 

RF 16.3 | 2,4 | <.05| 2.28 | 3.4 | >.20 
RMB | 118.0 | 2.4 | <.01| 1.94 | 3,4 | >.25 
Pooled | 109.0 | 6,12 01| 1.70 | 9,12 | >.15 


all three Os, indicating that the points 
of subjective simultaneity of one or 
more of the three stimulus pairs do 
differ from the objectively simultane- 
ous. Therefore, it is possible to con- 
clude that real and apparent simul- 
taneity were distinctly different for the 
stimulus pairs, and that the basic con- 
ditions for the proposed test of transi- 
tivity were met. 

To test the transitivity hypothesis it 
was necessary to determine if the mean 
of the measured simultaneity span mid- 
points over stimulus pairs differed sig- 
nificantly from 0 msec. For this, a 
planned comparison test (Winer, 1962) 
was performed on mean simultaneity 
span midpoints summed over stimulus 
pairs, against a value of 0 msec. Table 
1 shows the results of these tests for 
each O and for their pooled data. 

In no case do these tests allow the 
rejection of the null hypothesis that the 
true mean of simultaneity span mid- 
points is 0 msec. Hence, the transitivity 
hypothesis may not be rejected. 


Discussion 


The present study can be related to that 
of Hirsh and Sherrick (1961) who in- 
vestigated judgments of the temporal or- 
der of pairs of 5-msec. light flashes, pre- 
sented binocularly at eccentricities of 5°, 
10°, and 20°, using a forced-choice meas- 
urement technique. Both horizontal and 
vertical stimulus pairs were used. They 
found that “approximately the same time 
is required to tell which of the two stim- 
uli came first under all conditions,” i.e., 
that there was approximately a 50% con- 
fusion rate of time sequence at the point 
of objective simultaneity under all experi- 
mental conditions. The results of the 
present study strongly contradict the gen- 
erality of these findings. 

An underlying assumption of the pres- 
ent study was that there were no neural 
interactions among the three neural path- 
ways excited by the stimulus flashes other 
than at their presumed common region of 
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coincidence. Any interactions specific to 
the particular pathways of the stimuli 
used would have obscured or obliterated 
the transitive property being investigated. 
Although the retinal loci used were far 
enough apart to insure the absence of con- 
ventional “masking” interactions (meta- 
contrast or paracontrast), there was no 
guarantee that the stimuli would not be 
subject to masking effects reported to 
exist between more widely separated 
visual stimuli under simultaneous pres- 
entation conditions (Boynton & Bush, 
1957). Since these latter interference ef- 
fects vary considerably with retinal loca- 
tion, if they had been present, the pre- 
sumed neural delay in one stimulus path- 
way would have been a function of the 
retinal location of the second stimulus 
pathway. It is probably safe to say that 
if these interference effects had been pres- 
ent, a distinctly nontransitive relationship 
might have been found. That, in fact, no 
significant nontransitivity was found indi- 
cates that locus-specific neural interac- 
tions were not important under the condi- 
tions of the present experiment. 

If the results of the present experiment 
had shown visual simultaneity judgments 
to be nontransitive, Efron’s simultaneity 
center hypothesis would have been clearly 
disconfirmed, and some alternate form 
of explanation for the phenomenon of 
temporal judgments (perhaps the Sher- 
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ringtonian notion) would have been nec- 
essary. The results found, however, 
strongly support Efron’s hypothesis, 
which contends that for a pair of stimuli 
to appear simultaneous there must be tem- 
poral coincidence of two streams of neural 
activity within a single localized brain 
region. Further verification of the simul- 
taneity center hypothesis would do much 
to provide a more complete explanation 
of the phenomenon of “apparent simul- 
taneity.” 
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The effects of different methods 
upon the detection and report of 
search task. Ratios of value were 


of payoff, ratios of value, and load 
targets were investigated in a visual 
2:1, 4:1, and 8:1 points; loads were 


4, 8, and 12 letters; and methods of payoff were total amount of 


points accumulated (Method 1), 
plus competition for most points 
amount of points accumulated pl 


total amount of points accumulated 
accumulated (Method 2), and total 
lus penalties for incorrect responses 


(Method 3). The results indicated that Ss paid off under Methods 
1 and 2 employed a strategy aimed at maximizing hit rate; while S's 
paid off under Method 3 employed a strategy aimed at maximizing 


the hit/error ratio. 


Recent investigations by Taub (1965) 
and Christ (1965) have indicated that 
human information processing ability 
may be less affected by input variables 
(load and category) than by conditions 
involved in the retrieval of information 
stored in short-term memory. Taub, 
using differential letter value (high vs. 
low) as a between-S variable, found 
that as value ratio increased, there was 
a selective recall from short-term mem- 
ory indicated by the number of high- 
valued initial reports. The Ss’ accuracy 
in reporting high-valued items remained 
Constant, but the accuracy of report of 
low-valued items decreased, reflecting 
an overall decrement in performance. 

Christ also investigated the effect of 
differential letter value upon short-term 
memory and has shown that as the 
number of letters (load) presented 
within a value category increased, the 
percentage of recall decreased although 
the total number of letters recalled 


1This paper is part of a thesis by the 
first author submitted in partial fulfillment 
of the requirements for the Master of Sci- 
ence degree in the Department of Psy- 
chology, University of Massachusetts. This 
research was sponsored by the Office of 
ks Research under Contract Nonr 3357 


increased. Results from this experi- 
ment suggest that value ratio had no 
effect on either the initial reports or the 
overall number of reports given by Ss. 
These findings were inconsistent with 
Taub’s results and imply that these Ss, 
unlike Taub’s, did not adjust initial re- 
ports as a function of ratio of value. 
The differences between the results of 
these two studies may be due to the 
fact that Christ penalized Ss for targets 
omitted while Taub did not. Hence, 
the method of payoff may influence 
S’s performance by determining his 
strategy. 

The purpose of this study is to in- 
vestigate method of payoff as the major 
variable in conjunction with load, value, 
and value ratio in determining strategy 
in a visual search task. It was pre- 
dicted that when Ss were paid for the 
total number of points accumulated or 
were competing with all other Ss for a 
monetary prize for the best perform- 
ance in accumulating points, they would 
employ a general and random response 
strategy. There would be an overall 
increase in responding but a decrease 
in the accuracy of responding. When 
Ss were paid for points accumulated 
and penalized for incorrect responses, 
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however, it was predicted they would 
employ a selective response strategy. 
There would be an overall decrease in 
responding but an increase in the accu- 
racy of responding. 


METHOD 


Subjects—The Ss were 45 undergraduate 
students enrolled in the introductory psy- 
chology courses at the University of Massa- 
chusetts. All Ss received course credit 
for their participation and monetary rewards 
based on their performance in the experi- 
ment. 

Apparatus—The stimulus displays con- 
sisted of 135 different slide projections of 
white alphabetic characters on a black back- 
ground, The slides varied in number of 
categories (load) with each letter of the 
alphabet representing a category. The three 
levels of load were 4, 8, and 12 letters per 
slide. Each level of load was replicated 45 
times, each with a different random sample 
of the alphabet, with the restriction that 
within each half of the alphabet the letters 
appear equally often over the 45 slides, Af- 
ter each letter was chosen, it was randomly 
assigned to a location within a 10 X 10 matrix 
with the restriction that, on the average, half 
of the letters selected from each half of the 
alphabet were placed in the center 36 posi- 
tions of the matrix and the other half placed 
in the peripheral 64 positions. Following 
position placement, capital letters were typed 
on unlined white cards, using typewriter 
spaces as matrix cells. The cards were then 
photographed on 35-mm. negative film, from 
which slides were made for projection. 

A slide projector (Kodak Carousel), wired 
for automatic control of slide exposure and 
interslide interval, was used to present the 
stimulus displays. The projected size of the 
slide was 40 X 60 in., while the projected size 
of the letters was 4 in. The Ss sat in alter- 
nate chairs in a small classroom in groups of 
five each. The average distance from Ss’ 
eyes to the projected letters was 10 ft. 

Design.—To allow for a comparison of val- 
ues, a distinction was made between the first 
and second halves of the alphabet. The let- 
ters A through J comprised one value cate- 
gory, the letters Q through Z, the other. 
The letters K through P were not used, thus 
creating a clear distinction between the two 
categories of letters. 

The experimental design was a 3 X 3 x 2x 
3X9 mixed design with method of payoff 
and value ratio as between-Ss variables and 
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value, load, and trial blocks as within-S vari- 
ables. Methods of payoff were: (a) Method 
1—a monetary remuneration for the total 
points accumulated for correct responses, (b) 
Method 2—the same remuneration as above 
plus a $5.00 prize for the most points accum- 
ulated within each value ratio group, and (c) 
Method 3—monetary remuneration for total 
points accumulated for correct responses after 
penalties had been subtracted for incor- 
rect responses. The Ss were penalized on 
the same basis used for accumulated points, 
ie., losing the same number of points for 
an incorrect response as they would have re- 
ceived for a correct response of the same 
value. Value refers to the two subsets of 
letters designated as high and low valued for 
each group. Value ratios were 2:1, 4:1, 
and 8:1. Load refers to the number of let- 
ters per slide (4, 8, and 12). Trial blocks 
refer to successive blocks of a 15-slide 
presentation. Five Ss were run in each com- 
bination of the between-Ss variable with 
value ratio and method of payoff being the 
same for all Ss in a group. 

Procedure—The Ss were told that they 
would be shown briefly exposed, projected 
arrays of randomly positioned letters. They 
were instructed to search, and later (im- 
mediately after the removal of each projected 
array) to report on prepared answer sheets 
as many of these letters as they could remem- 
ber. All Ss received the following general 
instructions : 


The stimuli you are about to see are 
divided into two major categories. The 
A-J stimuli are (low-, high-) valued stim- 
uli and have a value of (2, 4, 8:1) points. 
The Q-Z stimuli are (high-, low-) valued 
stimuli and have a value of (2, 4, 8:1) 
points. Each point is worth 3 cent. This 
means for every correct high-valued target 
reported you will receive (1, 2, 4) cents, 
and for every correct low-valued target 
reported, 4 cent. 


Additional instructions were given to those 
S's working under the different payoff condi- 
tions: 


(Method 1) You will not be penalized 
for incorrect responses. 

(Method 2) You will not be penalized 
for incorrect responses. Also, whoever ac- 
cumulates the most points will receive 
a $5.00 prize. 

(Method 3) You will be penalized for 
incorrect responses the same number of 
points you would have received had the 
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responses been correct. Thus, you will 
be penalized (1, 2, 4) cents for an incorrect 
high-valued response and 4 cent for an 
incorrect low-valued response. 


To insure that Ss knew which conditions 
they were working under and would be 
familiar with the task in general, each S was 
given a card listing the two alphabet halyes 
and designating which was of high value, 
what the ratio of value was, and what the 
method of payoff was (including cost for 
incorrect responses). As training, all Ss 
viewed three slides, one of each load condi- 
tion, The E discussed the relevant informa- 
tion on the cue cards, while the slides re- 
mained in view, pointing out specifically on 
each slide how points (and penalties) would 
be tallied. 

During the experimental session, all Ss 
were presented with three groups of 45 slides 
in random order and given a 5-min. rest be- 
tween groups. The Ss performed under 
conditions of dim illumination which was 
considered bright enough to enable them to 
write their responses, but not bright enough 
to interfere with slide clarity. The Ss wrote 
down their reports to each slide on three 
numbered scoring sheets, each containing 
three blocks of 15 trials. Within each block 
of 15 trials, there were five slides from each 
load condition arranged in a random se- 
quence. Each slide was exposed for .75 sec. 
Immediately after the slide was turned off, 
Ss were given approximately 8 sec. to record 
their responses. This was followed by a .5- 
sec. warning buzzer and a 1-sec, interval 
before the onset of the next slide. 
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RESULTS 


In order to assess S’s response 
strategy, accuracy, and item processing, 
the following three dependent measures 
were obtained: (a) the total number 
of high- and low-valued targets re- 
ported, (b) the percentage of the tar- 
gets correctly reported, and (c) the 
number of correct first reports. Analy- 
ses of variance were performed on the 
first two measures, while ¢ tests were 
performed on the last measure. 

Table 1 summarizes the total num- 
ber of targets reported. Each cell 
represents the respective total responses 
for each load and value within a block 
of trials averaged across Ss and trial 
blocks. As load increased, the total 
number of targets reported increased 
significantly for all methods and values, 
F (4, 72) = 6.064, p < .01. Methods 
1 (points) and 2 (points plus competi- 
tion) yield approximately equal num- 
bers of total responses at all loads, and 
both clearly differentiate target value; 
high-valued targets are reported more 
frequently than low-valued targets. 
Method 3 yields fewer responses than 
Methods 1 and 2 at all loads and results 
in no clear differentiation of target 


TABLE 1 


MEAN Torat RESPONSES PER TRIAL BLOCK For Payorr METHOD, TARGET 
VALUE, VALUE RATIO, AND LOAD 


Payoff Method 1 Payoff Method 2 Payoff Method 3 
Load Value Ratio y 
i i High Loy 
Vaised | valuea | Valed | | vated | Valed | Valued 
ae 7.16 5.71 749 7.00 5.24 4.78 
4 4 7.79 6.96 7.42 7.42 5.31 4.73 
8:1 4.96 5.78 5.58 5.96 4.22 3.38 
2; 8.98 5.40 9.18 711 5.84 6.53 
8 res 9.20 9.31 8.67 6.91 5.91 6.04 
8:1 7.24 6.16 7.33 6.18 4.24 4.64 
is 9.69 6.60 8.91 7.62 7.07 TAT 
12 re 10.29 9.76 9.62 8.22 6.56 ee 
8:1 7.42 6.93 7.80 7.29 5.20 $ 
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value. There was also an increase in 
total responding as a function of load 
for all methods of payoff under all 
value-ratio conditions with Methods 1 
and 2 yielding consistently higher re- 
sponding, F (8, 72) = 3.160, p < .01. 

A trend analysis of the total response 
data for the Method of Payoff X Value 
X Trial Blocks interaction shows that 
Methods 1 and 2 yielded a significant 
linear increase in responding, F (2, 12) 
= 45.553, p<.01, over trial blocks 
that may be attributed to Ss’ learning 
the task. These methods of payoff also 
produced a consistent though nonsig- 
nificant linear trend differentiation of 
target value since both methods are 
based on the total number of points 
accumulated. Method 3 produced a 
consistent response depression and non- 
differentiation of target value over trial 
blocks, both of which may be explained 
by the penalty imposed for incorrect 
responding associated with this method 
of payoff, 
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Fic. 1. The percentage correct of the 
targets reported as a function of load with 
method of payoff as a parameter, 


Figure 1 summarizes the percentage 
of targets correctly reported as a func- 
tion of load with method of payoff as 
a parameter, F (4, 72) = 6.798, p< 
01. Accuracy increased as load in- 
creased under Methods 1 and 2 but 
remained relatively constant under 
Method 3. Further, Method 3 pro- 
duced a significantly greater percentage 
of correct target reports than Method 
2 or Method 1 at all loads. 

If a correct first report is simply 
a reflection of which target was de- 
tected first, there should be no target 
value differentiation (high vs. low) 
other than that which can be expected 
to occur by chance. Any consistent 
departure from chance would present 
convincing evidence of differential item 
processing. Therefore, ¢ tests were 
performed between the number of cor- 
rect high- and low-valued first reports 
for each load under each method of 
payoff. The results of these analyses 
indicated that significantly more correct 
high-valued targets were reported first 
at Load 8 under Methods 1, #(15) = 
2.460, p < .05, and 2, #(15) = 4.233, 
p < .01, and at Load 4 under Method 
3, #(15) = 3.257, p< .01. None of 
the remaining comparisons attained 
significance. 


Discussion 


The results indicate, as Edwards (1961) 
has suggested, that method of payoff pro- 
vides Ss with instructions concerning the 
choice of strategies to be used for max- 
imizing total point accumulation. The 
predicted increase in the total number of 
targets reported as a function of load, 
combined with a consistent target value 
differentiation found under Methods 1 and 
2, suggest that these Ss were utilizing a 
strategy aimed at maximizing their hit/ 
error ratio. Because of the penalty im- 
posed for errors, Ss were reporting fewer 
but more correct targets under Method 3, 
thus producing, as predicted, a higher 
percentage of targets correctly reported 


ie. 
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than under Methods 1 or 2. Since this 
reduced rate of responding reflects no con- 
sistent value pattern it is further sug- 
gested that the strategies utilized by these 
Ss were, in effect, minimizing all sources 
of error. Thus the imposition of penalties 
for incorrect responses, where low-valued 
incorrect responses were minimally penal- 
ized in relation to high-valued incorrect 
responses, produced a suppression of both 
high- and low-valued responses. 

Since Ss working under Method 3 are 
guarding against errors while reporting 
what they see, it is reasonable to assume 
that the processing of each target takes 
more time as compared to those Ss work- 
ing under Methods 1 and 2, Conse- 
quently greater differential item process- 
ing for the fixed exposure should occur 
at a lower load under Method 3 as com- 
pared to Methods 1 and 2. Support for 
this argument is found in the analysis of 
the first-report data; there were signif- 
icantly more high-valued correct first re- 
ports at Load 8 under Methods 1 and 2 
and at Load 4 under Method 3. 

Tt was suggested earlier that Christ’s 
inability to reproduce Taub’s differential 
value effect, i.e., depressed responding to 
low-valued targets as value ratio increased, 
might be due to differences in method of 
payoff. The method of payoff in the pres- 
ent study was similar to that used by 
Taub and produced effects more akin to 
those obtained by Christ. In fact, analy- 
Sis of the total response data indicates 
that Ss did not adjust their strategy as a 
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function of value ratio under any of the 
payoff conditions. This result is not sur- 
prising since Ss had no knowledge of the 
value ratios other than the one to which 
they were exposed. For example, Ss 
working under the 2:1 condition were 
unaware of the existence of the 4:1 and 
8:1 conditions. Since value ratio was 
manipulated as a between-Ss variable, 
Ss had no basis for differential respond- 
ing other than the common reference of 
1 within each group. While it has been 
demonstrated that differences in the rela- 
tive amount of reward associated with two 
values produces differential responding in 
a two-choice situation (Taub & Myers, 
1961), the results of the present study and 
those obtained by Christ do not lend sup- 
port for its effect in a task of the type 
used in the present experiment. 
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JUDGMENT OF RECENCY UNDER 
STEADY-STATE CONDITIONS* 
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Judgment of recency (JOR) of sequentially presented items was 
assessed in 8 Ss. Each S received 8 different sequences of 250 
items, each item being 1 of 8 letters of the alphabet. The items 
were distributed within the sequences so that each of the possible 
intervals, 1-15, between successive presentations of the same item oc- 
curred approximately equally often. Mean JOR was found to be a 
negatively accelerated increasing function of the actual recency, with 
the proportion of perfect judgments declining from 100% to chance 
performance after about 9 intervening presentations. The Weber- 
Fechner psychophysical function gave a very accurate description of 
the mean, median, and variance of recency judgments and of the pro- 
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portional constant error function. 


How does one judge the recency of 
events from memory? The dating of 
most of one’s memories undoubtedly 
occurs from reconstruction, i.e., by ref- 
erence to other memories and known 
logical relationships. But when the 
logical cues and sequential relationships 
which guide perception of time are 
minimized, how is recency judged? 

Judgment of recency (JOR) is the 
ability of an individual to judge the 
recency of occurrence of an event in a 
sequence of similar events. One pro- 
cedure for studying JOR consists of 
presenting a list of items and requiring 
S to make a numerical estimate of the 
number of items presented between suc- 
cessive occurrences of the test item. 

Although James (1890, p. 610) long 
ago expressed considerable interest in 
the ability to judge recency, direct em- 
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pirical investigations of JOR are sparse. 
Peterson (1967) tested and rejected 
the hypothesis that an individual based 
his JOR of an item on the amplitude 
of the strength of the memory trace for 
that item, Yntema and Trask (1963) 
found that when Ss were asked to pick 
the more recently presented word from 
a test pair of words, accuracy was bet- 
ter (a) the longer the interval between 
the initial presentation of the two test 
words, and (b) the shorter the interval 
between the presentation of the more 
recent of the two words and the presen- 
tation of the test. Brelsford, Freund, 
and Rundus (1967) examined JOR in 
a paired-associate task and found that 
JOR was more accurate for recent 
stimuli when the response was correct 
than when it was incorrect. 

This study is concerned with testing 
the ability to judge the recency of 
presentation of members of a small set 
of items which are repeatedly presented. 
The primary goal is the detailed speci- 
fication of the JOR function under con- 
ditions approaching a steady state. 


METHOD 


Materials—The items in the stimulus list 
presented to Ss for recency judgments were 
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8 letters of the alphabet: Q, R, S, T, W, X, 
Y, and Z. This set was chosen to minimize 
interitem similarity, both visual and acoustic, 
and still maintain a well-defined set of items. 
The stimulus set contains the last 10 letters 
of the alphabet except for U and V. 

The eight letters were arranged into a se- 
quence of 250 individual presentations ac- 
cording to the uniform lag algorithm devel- 
oped by Bjork (1966). The sequence con- 
structed by this algorithm distributes the N 
items of a stimulus list (here, N =8) over a 
series of M presentations (M =250) so that 
all intervals between presentations of the same 
item, ranging in length from 1 to 2N — 1, are 
approximately equally likely. The intervals 
(called lag values) are only approximately 
equally likely within any given sequence be- 
cause the random selection of lag values re- 
sults in some departure from a perfectly uni- 
form distribution. Therefore, one sequence 
of 250 presentations provides interpresenta- 
tion lags of 1 to 15 items inclusive, with an 
approximately equal number of occurrences 
of each lag value. One undesirable conse- 
quence of the algorithm is that whichever 
item is designated as the last item in the 
original stimulus list is used more often in 
the constructed sequence. An adjustment for 
the overrepresentation of the last item was 
made by rotating the eight letters through the 
eight stimulus list positions and constructing 
eight lists, so that each letter was the more 
frequent in one list, thus counterbalancing 
the overrepresentation across the eight items 
of the stimulus set. 

Subjects—The Ss were eight undergrad- 
uate students at Stanford University who 
were paid for their participation. Each S 
was tested individually and served for three 
Sessions, separated by at least 1 day. 

Apparatus.—tThe letter sequences were pre- 
sented on a Bina-View display cell pro- 
grammed by a punched tape reader so that 
one letter was presented every 3 sec. The S 
and the presentation equipment were in two 
adjoining soundproof rooms. The S viewed 
the display cell through a window and E 
recorded the lag judgments spoken by S over 
an intercommunication system. 

Procedure—The Ss were instructed that 
as each successive letter was presented they 
Were to respond with a number from 1 to 15 
indicating their best judgment of the recency 
of the current test letter. The correct re- 
Cency judgment, they were instructed, is the 
number of letters since the most recent prior 
Presentation of the test letter including itself 
or, in other words, the number of intervening 
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letters plus one. The correct response to a 
letter occurring immediately following itself 
would be “one”; with one intervening letter, 
“two”; etc. The Ss were instructed to 
respond “zero” to the first presentation of 
a letter within a sequence. After the initial 
Presentation of each letter, all later presenta- 
tions constituted both a test for its previous 
exposure and another presentation for its 
later test. The S’s numerical judgments are 
called the “judged lags,” and they represent 
the basic data for JOR analysis. The begin- 
ning of a sample protocol might look like the 
following : 


Letter presented: RTWTTSRWR 
Correct lag: 000210652 
Judged lag: 000210843 


After reading the instructions, S received a 
short practice sequence to ascertain that he 
understood the procedure. The first session 
then concluded with two complete sequences 
of 250 items. The remaining two sessions 
each consisted of three lists, each list re- 
quiring about 125 min. The order of 
presentation of the eight lists was counter- 
balanced across the eight Ss. Subtracting 
the eight initial exposures of each letter from 
250 total presentations leaves 242 observa- 
tions per sequence. With the eight lists 
for eight Ss, a total of 15,488 observations 
were gathered, distributed over the 15 lags. 
Multiplying the small deviations from a uni- 
form distribution in the original sequence, 
generated according to the uniform lag 
algorithm, by 64 repetitions of the sequence 
results in the number of observations for one 
lag ranging 576-1,408, with most of the lags 
having about 1,000 observations. 


RESULTS AND Discussion 


The average judged lag as a functiom 
of the actual lag is shown in Fig. 1. 
The curve rises rapidly at first, with 
negatively accelerated slope, then be- 
comes approximately linear with mod- 
erate slope. The dashed diagonal line 
represents the “true” lag, i.e., perfect 
performance. The mean performance 
curve shows an initial overestimation of 
recency and a later underestimation, 
crossing the true line at approximately 
7.3. 

Practice effects —Without knowl- 
edge of results, will recency judgments 
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Fıc. 1. Mean judgment of recency as a 
function of the actual recency interval (lag). 
(The dashed diagonal line represents per- 
fect performance.) 


systematically change as a result of 
experience? The question of practice 
effects can be examined in two ways: 
within each sequence of 250 items and 
across successive sequences. Across 
sequences, the question is whether there 
is general improvement (or deteriora- 
tion) in performance with increasing 
experience. Within sequences, the 
question is whether performance shifts 
as a function of position within the 
Presentation list, perhaps reflecting a 
changing bias in JOR. First, across 
sequences, analysis of variance reveals 
a barely significant order of presenta- 
tion effect, F (7, 49) = 2.23, p < .05. 
The mean judgment across successive 
sequences shows a small increase over 
the first three sequences, but is rela- 
tively constant thereafter. Within se- 
quences, an analysis of variance found 
no significant order effect over succes- 
sive fifths within the combined eight 
lists for each S, F (4, 28) = 2.39, In 
both these analyses, the effect of lag 
magnitude was highly significant, p < 
.001. Changes in mean JOR as a func- 
tion of experience are evident in these 
statistical analyses, but they appear to 
be small and, with the exception of a 
slight increase in the mean judgment 
over the first three sequences, do not 
follow a systematic trend. 
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Probability of correct judgment— 
One result derivable from the distribu- 
tion of recency judgments is the proba- 
bility of an exactly correct judgment. 
If a correct judgment can be said to 
represent “retention” of the actual lag, 
then this is a memory function. The 
probability of a correct judgment as 
a function of the lag in Fig. 2 bears a 
strong resemblance to typical retention 
functions. At Lag 1, performance is 
essentially perfect, but there is a sharp 
decline, reaching an asymptote of 
chance performance (xy) at about Lag 
2 

Psychophysical analysis—The ex- 
perimental situation for testing JOR 
closely resembles time perception ex- 
periments involving an estimation pro- 
cedure. One form of this procedure 
requires S to make a numerical estimate 
of the time duration between two sig- 
nals. The JOR paradigm requires an 
estimate of the number of items since 
the last presentation of the test item. 
With regular presentation intervals, 
such as used in this study, there is a 
perfect correlation between elapsed time 
and lag number. It is not too surpris- 
ing, therefore, to find that JOR results 
are similar to those from studies of 
time perception. In particular, a typi- 
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Fig. 2. Proportion of exactly correct recency 
judgments as a function of the lag. 
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cal result in time perception studies is 
overestimation of the duration of short 
intervals and underestimation of the 
long intervals (Fraisse, 1963; Wood- 
row, 1951). This same result occurred 
in the JOR results (Fig. 1). The 
“indifference point,” the point at which 
the judgment curve crosses the true 
line, is known to be highly variable and 
subject to situational and individual 
differences (Boring, 1942; Woodrow, 
1951). » 

Given the methodological and em- 
pirical similarity between JOR and 
time estimation, it is worthwhile to 
attempt to apply to JOR the same 
theoretical interpretation which has 
been applied to time estimation. The 
most common description of time esti- 
mation results is in terms of the 
classical Weber-Fechner psychophysical 
function. The usual statement of the 
Weber-Fechner function is that the 
magnitude of the sensation (or re- 
sponse) is a logarithmic function of 
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the stimulus intensity, or 
R=a log S$ +b. [1] 


This “law” has a long history of appli- 
cation to dozens of stimulus dimensions 
in hundreds of experiments with vary- 
ing degrees of success (Boring, 1942). 

Applying the Weber-Fechner func- 
tion to JOR data requires some as- 
sumptions about psychophysical re- 
cency judgments. First, the “sensa- 
tion” which is being judged is the lag 
number. Hence, one must assume that 
the sensation of recency is proportional 
to the lag number n. Second, the 
observed judgments, J, are assumed to 
be proportional to the cumulative num- 
ber of jnd’s for JOR. With these 
assumptions, the Weber-Fechner law 
for JOR may be written as 


J(n) =clogn+d, [2] 


where c and d are the slope and inter- 


cept constants. 
The fit of Equation 2 to the mean 


—— VARIANCE, 
až (n) * 6.76 log n+ 3,17 


Fa MEAN 
T(n) = 6.45 log n+ 1.66 
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i i 5 j function of the 
Mean, median, and variance of recency judgments as a 
4 (The theoretical functions are the best fitting Weber-Fechner 
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recency judgments is shown in Fig. 3. 
The parameters were determined by a 
least square criterion for the observed 
means for n=2 to n=15, with c= 
6.45 and d = 1.66. The regression line 
accounts for 99.5% of the variance 
observed in the JOR means. The stan- 
dard error of the estimate is .125. Most 
of the observed deviation is due to the 
first few points. The sum of the 
squared deviations from the prediction 
line is .191 and the mean absolute 
deviation is .084. Fits for individual 
Ss are almost as good: The regression 
line accounts for at least 95% of the 
variance in each one of the eight indi- 
vidual mean JOR curves. The semilog 
function also holds across stages of 
practice with only small shifts in the 
level of performance. 

Also shown in Fig. 3 are the ob- 
served medians and variances for Lags 
2 to 15 and their corresponding least 
square fits. The medians, determined 
by linear interpolation from the judg- 
ment distribution for each lag, are often 
used in psychophysical measurement as 
the best estimate of central tendency 
when extreme skews are encountered, 
as they are in judgments of Lags 2, 3, 
and 4. The medians are also well de- 
scribed by Equation 2, with all indi- 
cators suggesting a fit equally as good 
for the means, e.g., the standard error 
of the estimated medians is .141, For 
the variances, the best fit line approxi- 
mately parallels the function for the 
means, but this does not necessarily 
imply a direct relationship between the 
mean and variance of recency judg- 
ments. Part of the rise in the variance 
is attributable to the skewness in the 
short lag judgments which would tend 
to curtail some of the possible varia- 
bility. 

Stevens (1957) has proposed a new 
version of the psychophysical law which 
implies a power function relating sen- 
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sation and physical magnitude. The 
log-log relationship implied by the 
power function has proved to be supe- 
rior in many instances to the semi- 
logarithmic relationship of the Weber- 
Fechner function (Stevens, 1957). 
This is not the case for JOR, however. 
Figure 4 shows the mean judgments 
plotted in log-log coordinates with the 
best fitting straight line. There are 
systematic departures from the best fit 
line : The predictions underestimate the 
means for intermediate lags and over- 
estimate the means for the shorter and 
longer lags. The fit of the semilog 
Weber-Fechner function is clearly su- 
perior to the log-log Stevens power 
function. 

A sensitive indicator of the adequacy 
of a psychophysical function is the 
proportional constant error (PCE), 
computed by dividing the amount of 
overestimation or underestimation in 
judgment by the stimulus value. The 
PCE for the JOR data and the pre- 
dicted PCE from the Weber-Fechner 
function and the power function are 
shown in Fig. 5. Again, the fit for 
the Weber-Fechner form of psycho- 
physical model is quite accurate and 
superior to the power law. 

The experimental paradigm for JOR 
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Fic. 4. Mean judgment of recency as a 
Power function (Stevens, 1957). (The theo- 
Tetical equation is the best fitting log-log 
function by a least squares criterion.) 
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mean judgment of recency and two theo- 
retical PCE functions. (The theoretical 
functions are based on the same parameters 
used in the theoretical mean curves of Fig. 
3 and 4.) 


used in the present investigation is 
very similar to a hypothetical mecha- 
nism postulated in a recent time esti- 
mation theory (Treisman, 1963), Ac- 
cording to this theory, a counter records 
the number of pulses generated in a 
regular manner by a pacemaker. Via 
a decision mechanism, counter readings 
are translated into responses (time esti- 
mations), dependent upon the past 
history of the decision mechanism. To 
implement this model, among other re- 
quirements, it is necessary for Treisman 
to postulate a hypothetical pacemaker 
which produces regular pulses. The 
function of the pacemaker feature is 
identical to the experimental paradigm 
for JOR: events are presented at regu- 
lar intervals and Ss are asked to judge 
the number of intervening events be- 
tween particular events. In this way 
the JOR paradigm makes explicit one 
of the hypothetical components of 
Treisman’s timekeeping mechanism. 


579 


Another important feature of the Treis- 
man model is that it produces the 
Weber-Fechner form of the psycho- 
physical function relating time estima- 
tion to actual duration. This is also 
consistent with JOR results in the 
present investigation. Further investi- 
gation of JOR may help to develop time 
estimation models along the lines pro- 
posed by Treisman, particularly when 
JOR can explicitly evaluate processes 
which only can be inferred for time 
estimation procedures. 
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OSGOOD DIMENSIONS AS AN ENCODING CLASS 


IN SHORT-TERM MEMORY* 
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3 experiments were conducted to determine whether or not words 
are encoded for short-term memory (STM) storage by the connota- 
tive meaning of the extremes of the 3 dimensions of the Osgood 
semantic differential. 50 experimentals and 50 controls served in 
each experiment. After reciting 3 words (a triad) twice and a 
backwards counting task, Ss attempted recall. On the 5th trial 
experimental groups were presented a triad from the opposite end of 
the same differential scale; control groups received a triad from their 
usual end of the dimension. In all instances the experimental 
groups excelled the control groups to a significant degree on the 
shift trial, or, in other words, a release of proactive inhibition (PI) 
occurred, The results imply that the words at each end of the 
semantic differential scale are homogeneously encoded in STM and 
represent a psychological class differing from words at the other end 


of the scale. 


In the short-term memory (STM) 
situation employing the Peterson and 
Peterson (1959) technique, proactive 
inhibition (PI) builds up rapidly (Kep- 
pel & Underwood, 1962), reaching a 
maximum after three or four trials. 
Tt has been shown by Wickens, Born, 
and Allen (1963), however, that the 
PI will be eliminated if the item pre- 
sented is different from that used in 
the previous trials. In that experiment, 
one group of Ss was presented with a 
series of consonant trigrams (CCCs) 
and another group with a series of 
three-digit numbers (NNNs); each 
group was then shifted to a test item of 
the other class. Both groups showed a 
marked improvement on the shift trial, 
performing as well on that trial as on 
the first one of the experiment. Sub- 
sequently, Loess (1967) has demon- 
strated the same build-up and release 
of PI when Ss shift from a homogene- 
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ous class of word triads (such as species 
of birds) to a new class of word triads 
(species of trees). 

Together these experiments indicate 
that in the STM situation, triads or 
trigrams, all elements of which are 
homogeneous with respect to conceptual 
class, seem to be encoded not only as 
unique items, but also as members of 
the same conceptual class. If another 
item is drawn from a different con- 
ceptual class, then competition no 
longer occurs or, at least, is minimal. 
A reduction in amount of retroactive 
inhibition is also found in long-term 
memory if the originally learned list 
and the interpolated list are composed 
of items drawn from different classes 
of material (Friedman & Reynolds, 
1967; Postman, Keppel, & Stark, 
1965). 

The results of the Wickens et al. 
(1963) and of the Loess (1967) ex- 
periments suggest to the authors the 
possibility that the shift procedure may 
be used as something of a projective 
technique of organization in STM; a 
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way of asking S what response classes 
are being employed without requiring 
him to identify and label them. 

The semantic differential developed 
by Osgood, Suci, and Tannenbaum 
(1957) describes a three-dimensional 
verbal space identified by the three 
characteristics of Evaluation, Potency, 
and Activity. Using the Wickens et al. 
method, the present experiment asks 
whether or not meaningful verbal mate- 
rials are encoded in STM by the 
Osgood dimensions. 

The general plan of the experiment 
consisted of presenting word triads 
drawn from one end of a scale and then 
shifting to the other end, for the ex- 
perimental groups, while the control 
groups remained on triads drawn from 
the same scale end. If the two ends of 
the semantic scale represent different 
classes in STM storage, the accumu- 
lated PI should be reduced by a shift 
from one end of the scale to the other 
of the experimental groups, and the 
performance should be superior to that 
of the control groups. In this manner, 
the STM situation can be used as a 
partial validation of the Osgood se- 
mantic differential. 


METHOD 


Subjects—The Ss were 300 male and fe- 
male introductory psychology students at 
Ohio State University who chose this ex- 
periment to meet part of their course re- 
quirement. There were 100 Ss, 50 experi- 
mental and 50 control, on each of the three 
Osgood scales. 

Materials—Three separate experiments 
were conducted on each of the three 
semantic differential dimensions: Evalua- 
tion, Potency, and Activity. Fifteen words 
were selected from the extreme ends of each 
dimension according to the profile of the word 
obtained by Heise (1965). A word was 
chosen if it fell on the extreme end of one 
dimension and was relatively neutral on the 
other two dimensions. No synonyms or 
antonyms of any words were used. Because 
similarity of word sounds may be responsible 


for interference or noninterference effects 
(Conrad, 1964; Wickelgren, 1965), an ef- 
fort was made to include only words with 
different initial sounds. However, to meet 
the criteria of extremity along the dimen- 
sion and to avoid the use of synonyms or 
antonyms, a few words were included which 
contained the same initial letter. These 
words, however, neither appeared in the same 
group of three words on a single trial nor in 
the same position in the group of words (i.e, 
the first, second, or third word) on subse- 
quent trials, 

Apparatus—The equipment used in the 
experiment consisted of a Carousel slide pro- 
jector, a Gerbrands tape timer, an electric 
metronome, and 2X2 in. slides on which 
the experimental material was printed. 

Procedure——All materials were projected 
on a wall about 3 ft. in front of S. Follow- 
ing an asterisk which appeared for 2 sec., the 
triad was presented for about 1.5 sec. The 
S recited it twice, the numerical slide 
appeared, and S counted backwards by 3s 
from a three-digit number to the beat of a 
metronome at a l-sec. rate. The three-digit 
numbers were selected so that a single digit 
never appeared in the same position in any 
three-digit sequence. 

At the end of 15 sec. of counting, a question 
mark signaled the end of the retention inter- 
val and the beginning of the 10-sec. interval 
to recall the three words. After an intertrial 
interval of 6 sec., an asterisk appeared for 
2 sec., serving as a ready signal for the next 
trial. 

On the fifth trial the experimental group 
was presented words (the critical test items) 
from the opposite end of the same differ- 
ential scale. Equal numbers of Ss changed 
from the ends of each dimension, positive to 
negative and vice versa. The control group 
received five trials of words from the same 
end of the dimension. 

Order of appearance of the trials was coun- 
terbalanced so that each group of words ap- 
peared equally as often on each of the five 
trials for different Ss. The appropriate con- 
trol S was run after each experimental S 
(i.e. the same triad was presented on the 
critical fifth trial). To enable S to gain 
facility in counting backwards at the 1-sec. 
rate, a single three-digit number was given 
before the start of the experiment. The sub- 
traction numbers were counterbalanced so 
that the same number was neither in the 
same ordered position throughout the ex- 
periment, nor followed the same triad, 
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RESULTS 


The results of the experiment on the 
Evaluation dimension are presented in 
Fig. 1. As is usually true in these 
types of experiment, a reduction in 
accuracy of performance occurred pro- 
gressively during the first four trials 
for the experimental group, followed 
by a marked increment in performance 
on the critical shift trial. The control 
group, on the other hand, showed no 
such increment. 

A t test for independent groups with 
equal n’s indicated a significant differ- 
ence, t (98) = 3.39, p < .005, between 
performance on Trial 5 for the experi- 
mental group and for the control group. 
A t test for dependent groups indicated 
a significant difference, t (98) = 3.28, 
b < .005, between performance on Trial 
4 and on Trial 5 for the experimental 
group and no difference for the control 
group, t (98) = .66. 

The results for the Potency scale, 
shown in Fig. 2, are essentially the 
same as for the Evaluation scale, result- 
ing in a significant difference, t (98) 
= 3.27, p < .005, between retention on 
Trial 5 for the experimental group and 
for the control group; and a significant 
difference, t (98) = 4.23, p < .005, be- 
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control groups on the Evaluation scale, 
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Fic. 2. The results for the experimental and 
control groups on the Potency scale. 


tween performance on Trial 4 and on 
Trial 5 for the experimental group and 
no difference for the control group, 
t (98) = 47. 

The results for the Activity scale 
were similar with a significant differ- 
ence, t (98) = 4.15, p < .005, between 
performance on Trial 5 for the experi- 
mental group and for the control group. 
These results are plotted in Fig. 3. 
Again a significant difference, t (98) 
= 3.13, p<.01, was found between 
performance on Trial 4 and on Trial 5 
for the experimental group. However, 
the results also indicated a significant 
fall, t (98) = 1.87, p < .05, in perform- 
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Fic. 3. The results for the experimental and 
control groups on the Activity scale. 
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ance for the control group from Trial 
4 to Trial 5. This latter result might 
be attributable to chance, or to some 
characteristic of the words chosen for 
this scale. 


Discussion 


Since the three dimensions of the scale 
show approximately the same results in 
this experiment, they will be discussed to- 
gether. All groups show the usual PI ef- 
fects found in similar experiments (Kep- 
pel & Underwood, 1962; Loess, 1967; 
Wickens et al., 1963). Specifically, for 
the three scales combined, the percentage 
correct on Trial 1 was 95, and it dropped 
to 65 on Trial 4. 

All three experimental groups showed 
a significant increase in performance 
when they were shifted from one to the 
other end of the scale, This increase in 
performance is interpreted as evidence of 
a reduction in PI as a consequence of the 
shift. The sharp PI release associated 
with a change in class of material has 
been demonstrated in the Wickens et al. 
experiment where the shift was from con- 
sonants to numbers and vice versa; and 
also by Loess (1967) when a shift was 
made from classes such as birds, vege- 
tables, and countries. It seems safe to 
conclude, on the basis of these experi- 
ments, that PI release will occur if there 
is a shift in materials from the class in 
which the PI had accumulated to another 
class. The results of the present experi- 
ment are, therefore, best interpreted by 
assuming that S's will encode verbal mate- 
tials by some meaning characteristic 
which is associated with the extremes on 
each of the Osgood scales. The finding 
further suggests that the dimensions of 
the scale are in fact bipolar or at least 
that the sample of words.chosen from the 
two scale ends clearly represent different 
classes of connotative meaning. 

The build-up of PI during the preshift 
trials, together with the performance 
change resulting from the shift, implies, 
in addition, that the words at each end of 
the scale are relatively homogenous with 
Tespect to at least a portion of their 


underlying meaning or the psychological 
process which represents this portion. 
Thus, in a sense, the results of the experi- 
ment offer some support for the validity 
of the bipolarity of the scale, 

One possible interpretation of the in- 
crease in performance on the shift trial 
is that, because the material is new, it is 
perceived better and hence learned better. 
However, when Ss were questioned at 
the conclusion of the experiments on the 
Potency and Activity dimensions as to 
whether or not they perceived a difference 
in any of the trials, only a few of them 
answered in the affirmative. Thus, the 
improvement in performance consequent 
upon the shift was not usually associated 
with a mechanism of which S$ was aware, 
or, at least, which he could verbalize. In 
other words, the Wickens et al. technique 
is sensitive to psychological structures 
within the individual which he himself 
does not readily recognize. 

The above fact seems to argue for an 
interference interpretation of the results 
rather than for a heightened perceptual 
interpretation, since one would expect that 
a perceptual heightening would be asso- 
ciated with an awareness of the external 
conditions responsible for the perceptual 
change. 
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STIMULUS CATEGORIZING IN THE GENERALIZATION 
OF A VOLUNTARY RESPONSE? 


ERIC ARONSON ann ALBERT ERLEBACHER 


Northwestern University 


A test was made of the hypothesis that labeling the original stimulus 
in a voluntary response paradigm would either increase or decrease 
stimulus generalization depending upon the appropriateness of the 
label to the test stimuli. 3 different label conditions were employed. 
Results indicated a significant labeling effect, but only in the form 
of decreased generalization; there was no evidence for the enhance- 
ment of generalization due to labeling. These results were discussed 
in relation to previous research suggesting enhancement of generaliza- 


tion due to labeling. 


Indirect evidence for a stimulus label- 
ing effect in human stimulus generali- 
zation (SG) has been provided by 
Thomas and Mitchell (1962) and Na- 
kamura and Kaswan (1962). In these 
studies, the obtained SG gradients sug- 
gested that Ss were labeling the train- 
ing stimulus such that SG was enhanced 
or depressed, depending upon the ap- 
propriateness of the label to the test 
stimuli. Thus, Thomas and Mitchell 
(1962) found that when the original 
stimulus was 510 mp (a bluish-green), 
maximum responding during SG test- 
ing shifted from 510 mp to 520 mp (a 
purer green). These results suggested 
that S's labeled the 510 mp stimulus 
“green” and responded more often to 
test stimuli of a greener hue because 
the label served as a response mediator. 

Thomas and DeCapito (1966) per- 
formed a slightly more direct test of 
the labeling hypothesis by having their 
Ss label the original stimulus, which 
was of a hue intermediate to two pri- 
mary hues. The SG gradients were 
shown to be predictable from the pri- 
mary color name which an S gave to 
the original stimulus. Stimulus gener- 


1Research conducted during the senior 
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alization was enhanced when the test 
stimuli were more representative of the 
label than was the original stimulus, 
and depressed when the test stimuli 
were less representative. A cause- 
effect relationship between labeling and 
SG cannot be inferred from this study, 
of course, because the labeling variable 
was not an experimental one. It can 
be argued, e.g., that some unknown 
mechanism is responsible for both label- 
ing and SG differences, or perhaps that 
differences in SG tendencies cause the 
variability in labeling. In the present 
experiment, E provided Ss with a label 
for the original stimulus, The original 
and test stimuli were designed so that 
one label would be appropriate to the 
original stimulus and less appropriate 
to the test stimuli, while a second label 
would be more appropriate to the test 
stimuli than to the original. A third 
label, employed as a neutral control, 
was equally appropriate to all stimuli. 
This experimental procedure consti- 
tutes a direct test of the hypothesis that 
labeling the original stimulus affects 
subsequent SG behavior. 


METHOD 


Design—The original stimulus was a 
square rotated 45° from the normal upright 
position, so as to appear as a diamond, There 
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were five test stimuli, these also being squares 
of the same size as the original stimulus, 
rotated 35°, 25°, 15°, 5°, and 0° from the 
normal upright position. Three independent 
groups of Ss were supplied with one of 
three labels for the original stimulus. The 
three labels which were applied to the 
original stimulus were “diamond,” “square,” 
and “form.” Across test stimuli (going from 
35° to 0°) there ought to be an increased 
tendency for Ss to attach the label square, 
a decreased tendency to attach the label 
diamond, and no change in the tendency to 
attach the label form to a stimulus. 

Subjects—The Ss were 120 introductory 
psychology students at Northwestern Univer- 
sity. Each was randomly assigned to one 
of the three treatment conditions. 

Apparatus.—The Ss were seated at a table 
directly facing a wooden panel which sepa- 
rated them from E. Stimuli were projected 
on a 24 X 53 in. rectangular window screen in 
the center of the’ panel, The stimulus ap- 

peared as a 1}-in. white square on a black 
background. The response manipulandum 
was a telegraph key located on the table 
directly in front of S. 

Procedure—The Ss were instructed to re- 
spond as quickly as Possible, by raising a 
finger off the key when they saw the dia- 
mond, square, or form. The label occurred 
four times in the course of these instructions. 
Each S then received three training trials 
with the original stimulus. The. pretest in- 
structions immediately followed these trials. 
The Ss were instructed to respond to the 
exact same stimulus they had just seen. 
Actually, half of the Ss received instructions 
which placed heavier emphasis on a negative 
response set (not responding to stimuli other 
than the original). The effect of this vari- 
able has been omitted from the present 
analysis. 

The test series consisted of five consecutive 
sequences of each of the five test stimuli, the 
sequences being taken from a randomly 
chosen Latin square. These sequences were 
also counterbalanced across Ss, so that an 
equal number of Ss in each condition received 
one of five sequence orders, Eight booster 
trials, ie, further original stimulus presenta- 
tions, were included in the test series, So 
that S would not be likely to discover a re- 
curring pattern to the series, from one to 
three booster trials occurred after each of the 
first four test stimulus sequences. Exposure 
time for all stimuli was 4 sec., with 10 sec. 

intervening between stimulus resentations, 
The Æ called out “ready” to alert § ore 
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forthcoming stimulus presentation, the length 
of this foreperiod yarying randomly from 3 
sec. to 5 sec. A response was recorded only 
if S released the key within 1 sec. after the 
onset of a stimulus. 


Resutts AND Discussion 


An S’s score on each of the five test 
stimuli was simply the number of re- 
sponses given out of a possible five. 
An arc-sin transformation was per- 
formed on these data to satisfy analysis 
of variance assumptions. 

The SG gradients for the three label 
conditions are presented in Fig. 1. The 
main effect of labeling is significant, _ 
F (2, 60) = 5.25, p < 01. This effect 
is clearly due to less responding in the 
diamond condition, the square and form 
groups being roughly equal in response 
level. A trend analysis over the test 
stimuli indicates that the label by devia- 
tions from linearity interaction is also 
significant, F (6, 240) = 2.40, p < .05. 
This effect is indicated by the greater 
curvature in the “diamond” gradient. 
Since the response measure has a lower 
limit of zero responses, this curvature 
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DECREASED GENERALIZATION FROM STIMULUS LABELING 


may merely reflect a basement effect 
in the data, suppressing what might 
otherwise be a steeper, and essentially 
linear diamond gradient. The differ- 
ence between the diamond and the other 
two conditions is certainly to be attrib- 
uted to generalization since the gradi- 
ents have different shapes and there 
are no appreciable differences in re- 
sponse levels to the original stimulus. 
Based on eight such original stimulus 
presentations, the mean responses in 
degrees were 88.73, 78.96, and 76.91 
for the diamond, square, and form 
groups, respectively, F (2, 117) = 
1.27, p>.10. It thus appears that 
labeling does affect SG, supporting the 
interpretation of Thomas and Mitch- 
ell (1962), Nakamura and Kaswan 
(1962), and Thomas and DeCapito 
(1966), although these data provide 
no evidence for the enhancement of SG 
due to labeling—the square condition 
produced no more SG than the form 
condition. 

Thomas and DeCapito (1966) have 
suggested that labeling the original 
stimulus can enhance, as well as de- 
press SG. It is possible, of course, that 
the test stimuli in the present experi- 
ment do not in fact.produce an in- 
creased tendency to attach the label 
square; yet there is no apparent reason 
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why the assumption is any less valid 
for this case than for colors, as em- 
ployed by Thomas and DeCapito 
(1966). Given the difficulty of remem- 
bering a color without a verbal label, 
however, it may be that spontaneous 
labeling is most likely to occur in the 
color-stimulus situation, The Ss may 
use a label which increases “errors” 
in a test series because they have no 
good recall for the original color other 
than that provided by the label. As 
long as S can recall the original stimu- 
lus independently of its verbal referent 
(i.e., form a visual image), he has some 
basis for knowing when he is commit- 
ting errors, and thus is not likely to 
produce a mediated similarity effect. 
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College Ss performed memory search through circular sequences. 
Stimuli were presented and the target was rule defined as a trans- 
formation a specified number of steps away from the stimulus. 
Reaction time (RT) was related to transformation size (—2 through 
+3), transformation direction (positive or negative), and to materials 
transformed (letters or digits). In each case there were statistically 
significant effects. It was predicted that RT would be a linear 
function of the number of sequence steps between stimulus input and 
required response. The prediction was only partially confirmed. For 
some combinations of condition and practice RT was linear, indi- 
cating a serial search process in which a constant amount of time 
is required for each item in a sequence; at other times there were 
significant departures from linearity. Practice effects included both 
a decrease in total RT across trial blocks and a change in the slope 
of the RT functions. Peak search rates of about 3-5 characters per 
second were observed, suggesting that search through circular se- 


quences requires relatively deep processing of sequence items. 


It is possible to specify four dis- 
tinct task variants that require search- 
ing through lists or sequences: (a) 
External search for a particular target 
or set of particular targets. The proto- 
type would be the work of Neisser 
(1963) and Neisser, Novick, and Lazar 
(1963) in which Ss search for, say, a 
letter K embedded in a visually pre- 
sented list of other letters. With this 
type of task Ss are capable of process- 
ing letters at rates in excess of 40/sec 
(based on Neisser, 1963). (b) Ex- 
ternal search for a rule-defined target. 
In this case the target might be defined 
as any proper name or any animal name 
(Neisser & Beller, 1965) that appears 
in a visually presented list of words. 
Search rates are estimated at about 4 
words/sec. (c) Internal search for a 
particular target. Here Ss store in 
memory a short list of digits and then 
indicate whether a probe stimulus is or 
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is not present in the stored set (Morin, 
Derosa, & Stultz, 1967; Sternberg, 
1966, 1967). Sternberg (1966) 
assumes exhaustive sequential search 
and estimates search rates at about 
25 to 30 digits per second. (d) 
Internal search for a rule-specified 
target. The present experiment 
deals with this category. Stimuli 
are presented and the target is rule 
defined as a transformation a specified 
number of steps away from the stim- 
ulus, 

A transformation as employed here 
has the meaning of a change from one 
specifiable state to another (Weber & 
Woodward, 1966). Transformations 
of various sizes are applied to the 
separate stimulus items of a circular, 
or continuous, sequence. The result 
of a transformation is a target or term- 
inal response, also in the sequence, a 
given distance away from the starting 
State. The steps required in going 
from the starting stimulus to the term- 
inal response are said to be searched 
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through. By varying the size of the 
transformation it then becomes possible 
to vary the amount of search required 
(at least as a lower bound) and assess 
the rate at which it takes place. 

The variables employed include size 
and direction of transformation and 
type of stimulus materials. Of par- 
ticular interest is the form of the func- 
tion relating RT and size of transfor- 
mation. If linear search occurs, as pre- 
dicted, the function will be linear, but 
predictions of magnitude of effects 
cannot be made in advance. 


METHOD 


Subjects—The Ss were 48 male under- 
graduate volunteers from Kenyon College. 
They were assigned at random, 8 per ex- 
perimental group. 

Apparatus and materials—The apparatus 
consisted of a Gerbrands Model B electronic 
voice key with two microphones and a Stand- 
ard Electric timer calibrated to .01 sec. The 
materials were circular sequences of five 
digits (3, 4, 5, 6, 7, 3, 4, . . .) and five letters 
(E, F, G, H, I, E, F,...). The sequences 
were printed in circular array on 4X 6 in. 
cards with unidirectional arrows separating 
sequence items, as shown in Fig. 1. 

These particular letter and digit sequences 
were chosen because the experiment required 
verbal production of stimuli and responses, 
and early letters of the alphabet can be 
readily confused in an aural setting. Hence 
the letter sequence in Fig, 1 represented the 
earliest alphabetic sequence not likely to 
lead to perceptual confusions. Since there 
is evidence (Fitts & Switzer, 1962) to indi- 
cate that familiarity with a subset of mate- 
rials influences RT, it was also decided to 
choose a relatively unfamiliar subset of digits, 
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Fic. 1. Letter and digit sequences pre- 


sented to Ss on 4X6 in. cards during in- 
structions. 


“ of 20 blocks for each S. 


in order to make the two types of stimulus 
materials more comparable. 

Procedure—A 6X2 factorial design was 
employed, with the first factor subsequently 
broken into positive and negative components. 
The six independent groups were T-s, T-1, To, 
Ts, Tz, Ts, corresponding to type of input 
transformation that was required. In addi- 
tion, there was the repeated-measures varia- 
ble which was type of material making up 
the sequence, letters vs. digits. Within a 
block of 10 items the five digits and the 
five letters were presented aurally in a pre- 
determined random order. There was a total 
Immediately on 
presentation of a stimulus item S was to 
produce the appropriate verbal response, 
which differed from group to group. 

The Ss in Group To had only to repeat 
the identical letters and digits after they were 
spoken by £, a zero transformation of the 
stimulus items. Group Tı was instructed to 
give the letter or digit, as the case might be, 
that was one step forward in the direction of 
the arrows. Thus if successive stimuli were 
"B72 “BS” 3. 35,5 was tovsay: “4” (Go) 
respectively, in order to obtain correct re- 
sponses. For Group Tz each stimulus item 
was to be transformed two forward steps in 
the direction of the arrows. For Group Ts 
the correct responses were three forward 
steps away from the stimuli. In the negative 
conditions, T-ı and T-s, the transformation 
was in the direction opposite the arrows, i.e., 
opposite the direction normally learned. A 
T-s condition was not run because its input- 
output relations between stimulus and re- 
sponse would have been identical to Ts. 

With the foregoing procedure, stimulus- 
response digram frequencies could not be 
held constant across all groups. Instead the 
resulting digram frequencies for letters were 
assessed by recourse to normative data 
(Mayzner & Tresselt, 1965). Based on a 
20,000-word sample the mean digram fre- 
quency for each group, T-a through Ts, was, 
respectively, 148.40, 56.20, 115.40, 280.80, 
670.60, and 148.40. 

Instructions to Ss emphasized both ac- 
curacy and speed. Figure 1 displays were 
present during instructions, but they were 
removed as soon as the instructions were 
terminated. Each S was run individually 
and was informed of errors with a cor- 
rection made by E; but no knowledge of RT 
was given. The foreperiod (between the 
click of the relay reset button on the voice 
key and vocal presentation by E of a letter 
or digit stimulus) varied unsystematically 
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from 1 to 6 sec. The intertrial interval (be- 
tween S’s response and pressing the reset 
button) was approximately 12 sec.; during 
this time E recorded on a prepared sheet the 
RT to the preceding stimulus. Only RT for 
correct choices was recorded. Errors 
(wrong responses or “ahs”) were tabulated 
separately as check marks. 


REsuLts 


Descriptive statistics for RT as a 
function of transformation condition 
and materials are presented in Table 1. 
Mean RTs in Table 1 were determined 
by averaging over number of correct 
choices and number of Ss. 

Both size of transformation and type 
of stimulus material proved to be effec- 
tive variables. For the positive trans- 
formation groups (T,, T,, eT eae 
increased monotonically for letters and 
for digits, For the negative trans- 
formation groups (T, KaT SORT. 
also increased monotonically with size 
of transformation for both letters and 
digits. Except for Group T, there was 
a substantial difference in RT for type 
of stimulus materials. Digits were 
processed consistently more rapidly 
than letters. The form of interaction 
observed indicated that the larger the 
absolute size of transformation the 
greater the difference between letters 
and digits. Significance tests con- 


TABLE 1 


RT DESCRIPTIVE STATISTICS AS A FUNCTION 
OF MATERIALS AND TRANSFORMATION 
SIZE AND DIRECTION 


Group 
Materials 


Tae Duy To Tr Ta Ts 


Letters 


Me |1.88) 1.36 | 0.36 | 0.77 | 1.41 
SD | 0.37} 0.18 | 0.06 | 0:14 | 0.21 O32 


Digits 
M 1.10 | 0.89 | 0.35 | 0.62 | 4. 
SD |0.13|0.10 | 0.06 | 0. Y ve 


* Means determined by averaging over num] 
y A 8 for each group) and number of eee 
sponses, 


firmed the foregoing. A 6 X 2 analysis 
of variance revealed a significant size 
of transformation effect, F (5, 42) = 
54.55, p < .001, a significant materials 
effect, F (1, 42) = 198.45, p < .001, 
and the significant interaction of Trans- 
formation Size X Stimulus Materials, 
F (5,42) = 17.15, p < .001. 

With respect to direction of trans- 
formation, comparison of Groups T,, 
T, vs. Groups T_,, T_, showed larger 
RTs for negative transformations, irre- 
spective of whether letters or digits 
were considered. Separate Scheffé 
tests for the two types of stimulus ma- 
terials indicated significant differences 
for letters, F (1, 42) = 135.02, and for 
digits, F (1, 42) = 13.76. These F 
values exceeded the required F values 
at the .01 and .05 levels, respectively. 
Also comparison of Groups T_, and 
Ts, which required identical S-R pairs, 
showed by the Scheffé test a significant 
difference for digits, F (1, 42) = 
13.76, p< .05, but not for letters, 
F (1, 42) = 1.57, p > .05. 

Examination of Fig. 1 indicates that 
the stimulus sequences depart from the 
normally learned order of letters and 
digits in that they have “sequence 
breaks” between “I” and “E” and be- 
tween “3” and “7.” Those stimulus 
items on the one side of the sequence 
that require a response on the other 
side of the sequence may be termed 
“sequence break” items; remaining 
items with the usual letter or digit 
order between stimulus—response pairs 
may be termed “normal” items. It was 
possible that the effects obtained in 
Table 1 occurred because of the greater 
number of sequence break items in the 
larger transformation conditions. A 
Separate partitioning of data into se- 
quence break vs. normal items is shown 
in Big. 2. Sequence break items show 
larger RTs, but the general form of the 
function for normal items is much the 
same as for all items in Table 1. Se- 
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quence break items did differ from nor- 
mal items in that they produced more 
errors. Because error data was for 
the most part comparable to the RT 
data, except for being more variable, 
it is not reported here. Significant de- 
partures from linearity in the top 
panels of Fig. 2 are starred. All such 
departures are for negative transforma- 
tion conditions. 

Finally, it is noted that there are 
practice effects evident in Fig. 2. 
There is an overall reduction in RT 
across blocks. In particular, the RT 
functions flatten over blocks; hence, 


search rates increase with practice. 


Peak search rates of about three and 
five characters per second are observed 
for positive letters and digits, respec- 
tively, in Block 20. (This estimate 
was obtained by finding the reciprocal 
of the slopes, ) 


Discussion 


The primary effects of transformation 
size, direction, and materials are not sur- 
prising. Neither is the interaction of 
Size X Materials. But the magnitudes of 
effects and their functional forms are not 
obvious; nor is an explanation thereof. 

With the exception of the departures 
from linearity in Fig. 2 it would seem that 
S searches through the sequences one step 
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at a time at a fairly constant time per item 
until the terminal item is reached; S 
then responds overtly. One way the ter- 
minal item can be identified is if S 
engages in some kind of metered search 
through the sequence. An overt response 
is made when the appropriate “meter read- 
ing” shows up. This explanation seems 
compatible with the data and is in accord 
with introspective evidence. The one 
difficulty would be the departures from 
linearity in Fig. 2. The departures are 
mostly in the same direction (concave 
down) and may suggest that S’s search 
speed increases as he travels through the 
sequence. A within-Ss design for size of 
transformation would have bearing on 
this point, 

An alternative explanation is some 
variety of short-circuiting in which S 
does not process intervening items be- 
tween starting state and terminal re- 
sponse; he simply bypasses them by pair- 
ing the ends of the stimulus—response 
sequence. Although this is possible, ex- 
amination of RTs for individual Ss on 
individual displays shows no indication 
of a discontinuous and sustained shift in 
performance that might be expected if 
short-circuiting occurred. Moreover there 
is little relation between the diagram fre- 
quencies cited earlier and the results of 
Table 1 and Fig. 2. If short-circuiting 
occurred, it ought to be helped by high- 
frequency stimulus-response pairings; this 
is not the case, 

Because Fig. 2 indicated practice ef- 
fects on both overall RT and the slopes 
of the RT functions there is evidence that 
the memory search studied here is differ- 
ent from the memory search studied by 
Sternberg (1966, 1967) who found no evi- 
dence for practice effects. “Another im- 
portant difference concerns search rates. 
Sternberg obtained search rates of 25 to 
30 characters per second, in marked con- 
trast to the peak rates of 3 to 5 characters 
per second obtained here, However the 
present search rates are quite comparable 
to (a) the rates obtained in searching 
through word lists for words of a given 
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meaning class (Neisser & Beller, 1965) 
and (b) the rates for silent and oral 
speech (Landauer, 1962) where every 
word is processed. The rate differences 
seem to reside in the depth of processing 
required. If only a few distinctive fea- 
tures of a character must be processed 
(as in Sternberg’s recognition task), the 
rate is much higher than if the entire 
character must be generated or synthe- 
sized (Neisser, 1967) as seems to be the 
case here. 
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INITIAL IMPROVEMENT IN SIMPLE REACTION TIME 


IN-MAO LIU anp SHANG-WU KUO 


National Taiwan University 


In order to study the factors responsible for the improvement occur- 
ring in the initial few trials of a simple reaction time (RT) task, 
9 experimental groups of 20 Ss each had 3 kinds (hearing alone, 
pressing alone, and responding to a light) and three degrees (1, 3, 
and 10 trials) of pretraining. A control group of 20 Ss was given 
0 amount of pretraining. In the subsequent simple RT task, Ss 
had to press a key on hearing a tone. It was found that the groups 
that received 3 and 10 hearing alone pretraining trials started with 
shorter mean RTs and then took about the same number of trials to 
reach the performance asymptote (compared with the control group) 
and that the groups given 3 and 10 pretraining trials of responding 
to the light reached the asymptote on the 2nd trial. The results 
poirited to the operation of 2 processes: information-checking mech- 


anism and acquisition of responding 


In simple reaction time (RT) ex- 
periments, though the average S con- 
tinues to improve for several hundred 
trials spaced out over several days, the 
largest improvement usually occurs in 
the first few trials. A question of 
interest is why a simple task such as 
simple RT requires a few trials for the 
initial improvement. 

The concept of uncertainty, as re- 
cently applied to RT studies (Hick, 
1952; Hyman, 1953; etc.), cannot be 
used directly to explain the initial im- 
provement taking place in simple RT 
training. This is because the uncer- 
tainty associated with either the stim- 
ulus or response per se in simple RT 
is clearly zero, though there is a cer- 
tain amount of time uncertainty with 
respect to the stimulus onset (Klem- 
mer, 1957). 

On the other hand, starting with an 
analysis of events in a simple RT situa- 
tion seems promising. Quite early in 
the history of RT work, Lange (cited 
in Woodworth & Schlosberg, 1954) 
found it possible to train S (a) to be 
prepared to receive the stimulus, (b) 
to be prepared to make the response, 
or (c) to make the response on recep- 
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to stimulus onset. 


tion of that stimulus. In other words, 
S could also be trained to concentrate 
his attention either on the stimulus to 
be received or the response to be made. 
Later studies indicated that S is not 
likely to follow such instructions con- 
sistently unless the experimental condi- 
tions favor one or the other type of 
preparation. 

However, S’s preparation can be 
manipulated directly by Æ through a 
procedure somewhat analogous to that 
of sensory preconditioning (Brogden, 
1939). During pretraining trials (ana- 
logous to preconditioning trials) S 
practices only some part of what con- 
stitutes a simple RT task. The effect 
of this specific pretraining is then deter- 
mined. 

By using the above procedure, there 
are three possibilities to account for the 
initial improvement in simple RT 
training. One possibility is that in the 
course of practice the stimulus only 
loses its novelty, since the response of 
pressing a key is so simple. The 
orientation reaction associated with a 
novel stimulus is known to cause 
simultaneous vasoconstriction in the 
extremities and vasodilation in the 
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head. Sokolov (1963) believes that 
this represents the diversion of blood 
from the periphery to the brain, which 
-must be ready to analyze incoming in- 
formation first and then organize 
action. As a consequence of this effect, 
the reaction time to a novel stimulus 
may be longer. If this explanation is 
correct, then S’s experience with the 
stimulus alone during the pretraining 
trials will have effect on the later train- 
ing trials. Another alternative is that 
repeating the response of pressing a 
key alone contributes to the initial im- 
provement, since pressing a key may 
not be so simple a task as one might 
expect. The second possibility can be 
easily tested by requiring S to practice 
pressing a key alone (without any 
stimulus presented) on the pretrain- 
ing trials. Still another alternative is 
that S, in the course of practice, ac- 
quires to respond to the general nature 
of the stimulus onset (not necessarily 
restricted to the onset of the stimulus in 
question), which cannot be explained 
in terms of experience with either stim- 
ulus or response alone. The third pos- 
sibility can be assessed by requiring S 
to press a key to another stimulus 
during the pretraining trials. 


METHOD 


Subjects—The Ss were 200 junior high 
school girls, They had not served in any 
psychological experiment before. Each S 
was rewarded with a piece of rock candy 
for participation in the experiment. 
$ Apparatus and procedure—The tones and 
lights used as stimuli were Produced by a 
Maico Audiometer through the right ear- 
phone and a 45-w. lamp through a red glass 
of 6 cm. in diameter, respectively. The stim- 
ulus durations and intertrial intervals were 
controlled by Grason-Stadler Programming 
Equipment. 

According to order of their appearance, Ss 
were divided into one control and nine oe 
perimental groups of 20 Ss each, The S 
was seated in a soundproof room. An air- 
conditioner in the room produced a constant 
level of noise. An S in the control group 


was instructed to press a telegraph key as 
soon as possible on hearing a tone (200 msec., 
75 db., with reference intensity of .0002 dyne/ 
em’). An S in the experimental groups was 
told that she would receive practice trials 
first, followed by formal experimental trials. 
As soon as the practice trials were over, the 
message of “formally start” was given to S. 
The same average intertrial interval of 10 
sec. (the range being 8-12 sec.) was used 
throughout. Therefore, there was no inter- 
ruption between the practice (pretraining) 
session and the experiment proper (training) 
session. 

There were three kinds and three degrees 
of pretraining to form a 3 X 3 factorial design 
by using the nine experimental groups. As 
to the first kind of pretraining (E-t groups), 
S was presented the tones a certain number 
of times (1, 3, or 10, as a degree of pre- 
training, resulting in Group E-t-1, E-t-3, or 
E-t-10, respectively). During the pretrain- 
ing trials, S was instructed not to press the 
key. During the training trials, S was to 
press the key as soon as possible on hearing 
the tone, With respect to the second kind 
of pretraining (E-p groups), § was only to 
Press the key a certain number of times 
(again, 1, 3, or 10, and given the group 
abbreviation of E-p-1, E-p-3, or E-p-10, re- 
spectively). In order to keep the average 
intertrial interval of 10 sec. for S’s pressing 
of the key, $ was to press the key when E 
delivered “now press.” The E’s voice was 
as soft as possible, and S was not instructed 
to press the key as soon as possible on hear- 
ing E’s voice. As for the last kind of pre- 
training, E-(1-p) groups, S was instructed to 
Press as soon as possible on seeing the 200- 
msec. light for a certain number of trials—1, 
3, or 10, giving rise to Group E-(1-p)-1, 
E-(1-p)-3, or E-(1-p)-10, respectively. 

Each S was run 20 trials altogether. This 
means that if S received 3 pretraining trials 
of a certain kind she was given 17 training 
trials. Of course, S in the control group re- 
ceived no pretraining trial and received 20 
training trials. 


RESULTS 


All groups showed no further im- 
provement after six training trials, the 
asymptotic performance being in the 
neighborhood of 300 msec. This is 
about 100 msec. higher than that usu- 
ally reported for tone stimulus of 
medium intensity. The reasons for the 
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higher RT might be (a) no ready sig- 
nal was used, (b) stimulus was pre- 
sented to one ear alone, and (c) Ss 
were junior high school girls instead 
of adults. Therefore, the data of the 
initial six trials for each group were 
used in the present analysis, An 
analysis of variance applied to the data 
of the sixth trial of all groups indi- 
cated that there was no difference 
among the group mean RTs, F (9, 
190) = .30, meaning that, after six 
training trials, all groups reached more 
or less the same performance asymptote 
irrespective of the various treatments 
prior to the training trials. 

Effects of differential treatment dur- 
ing pretraining—When confined to the 
data of the experimental groups, ac- 
cording to an analysis of variance, 
Kind of Pretraining, Degree of Pre- 
training, and Training Trial were all 
significant, F (2, 171) = 3.43, p < .05; 
F (2, 171) = 4.94, p < 01; and F (5, 
855) = 60.48, p<.01, respectively. 
The interaction of Kind of Pretraining 
X Training Trial was also significant, 
F (10, 855) = 2.30, p < .05. 

Pretraining by hearing alone.—Since 
different kinds of pretraining produced 
significant treatment effects, the authors 
are now in the position to assess the 
significance of the effect of each kind of 
pretraining in the light of the data of 
the control group. For this purpose, in 
the following, an analysis of variance 
will be applied each time to the data 
obtained from each experimental group 
and control group. 

With respect to Group E-t-1, a com- 
parison with the control group shows 
that only Training Trial was signif- 
icant, F (5, 190) = 16.88, p < .01. 
When the pretraining trials were in- 
creased to three (Group E-t-3), the 
significant interaction of Pretrain- 
ing X Training Trial, F (5, 190) = 
6.22, p < .01, as well as the significant 
effect of Training Trial, F (5, 190) = 
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16.30, p < .01, was obtained. How- 
ever, Pretraining was not significant, 
F (1, 38) = 1.06. When only the data 
of the first training trials were used for 
the comparison, the mean RT of Group 
E+-3 was significantly shorter than 
that of the control group, #(38) = 2.92, 
p<.01. With intensive pretraining 
(Group E+t-10), Pretraining, Training 
Trial, and their interaction were all 
found to be significant, F (1, 38) = 
5.09, p < .05; F (5, 190) = 20.47, p< 
O01; and F (5, 190) = 4.46, p < .01, 
respectively. Again, on the first train- 
ing trial, Group E-t-10 showed a 
shorter mean RT than that of the con- 
trol group, #(38) = 3.06, p < .01. 

The results of the statistical analysis 
are easily recognized by examining 
Fig. 1. According to Fig. 1, when in- 
creasing the number of the pretraining 
trials, though the overall RT decreases, 
the performance asymptote is reached 
in approximately the same number of 
trials for all the E-t groups as in the 
case of the control group. 
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training trial for the control group and the 
groups that received pressing-alone pre- 
training. 


Since the Pretraining X Training 
Trial interaction was significant in the 
two comparisons made above, the over- 
all variation due to each interaction 


500 
e— E-(I-p)-| 


o— E=(I-p)-3 
a— E-(I-p)-10 
*--- CONTROL 


450 


a 
= 400 
£ 
| 350 
[4 
300 
250 
I 2 3 4 5 6 
TRIALS 
Fic. 3. Reaction time as a function of 


training trial for the control group and the 
groups that received pretraining of respond- 
ing to another stimulus. 
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was further divided for trend analyses. 
As is clear from Table 1, Groups E-t-3 
and E-t-10 differed significantly from 
the control group with respect to the 
linear, quadratic, and quartic curva- 
tures. The differences in the quad- 
ratic trends were due mainly to the fact 
that although these groups started from 
short mean RTs, they improved in 
slower rates than that of the control 
group. 

Pretraining by pressing the key 
alone —As is clear from Fig. 2, each 
of the E-p groups did not show an 
appreciable difference from the control 
group: The former started with a 
mean RT close to that of the latter 
and improved gradually as the latter 
did. When each of the E-p groups 
was compared with the control group 
by means of statistical analyses, no sig- 
nificant pretraining effect was obtained, 
F (1, 38) =.003, F (1, 38) =.01, 
and F (1, 38) = 1.49, respectively. As 
is the case, Training Trial was signif- 
icant, F (5, 190) = 30.93, p <.01; 
F (5, 190) = 20.88, p < .01; and F (5, 
190) = 19.86, p<.01, respectively. 
The interaction of Pretraining X 
Training Trial was not significant in 
each comparison, F (5, 190) = .82, 
F (5, 190) = 1.25, and F (5, 190) = 
1.59, respectively. 

Pretraining by responding to another 
stimulus—When Group E-(1-p)-1 
was compared with the control group, 


TABLE 1 


TREND ANALysEs APPLIED To THE CONTROL 
Group AND EACH EXPERIMENTAL 
Group Listep BELOW 


F (1, 38) 
Source 
Et3 | E-t-10 | E-(1-p)-3 | E- (1-p)-10 
Linear | 16.70 | 9.50%* | 6.51% | 11.25%* 
Quadratic | 8.11** | 5.34* -20 16 
Cubic 204 -11 1.85 +78, 
Quartic | 15.59%* | 23:48" | 6.02* 4.25* 


INITIAL IMPROVEMENT IN SIMPLE RT 


the pretraining effect was, found to be 
insignificant, F (1,38) = .37. Training 
Trial was of course significant, F (5, 
190) = 35.03, p < .01. The interac- 
tion of Pretraining X Training Trial 
was not significant, F (5, 190) = 1.59. 

After increasing the number of the 
pretraining trials of pressing the key 
in response to the light to three, the 
Pretraining effect became significant, 
F (1, 38) = 4.17, p < .05. The inter- 
action of Pretraining X Training Trial, 
F (5, 190) = 3.21, p < .01, as well as 
Training Trial, F (5, 190) = 25.07, 
$ < .01, was significant. According to 
Fig. 3, the significant interaction is 
shown as one-trial learning in Group 
E-(1-p)-3. For this group, however, 
the pretraining effect on the first train- 
ing trial was not statistically significant, 
(38) = 1.43, 

The effect of the 10 pretraining trials 
was about the same as that of the 3 
Pretraining trials. In the comparison 
of Group E-(1-p)-10 with the control 
group, Pretraining, Training Trial, 
and their interaction were found to be 
significant, F (1, 38) = 11.40, p < 01; 
F (5, 190) = 22.10, p < .01; and F (5, 
190) = 5.14, p<.01, respectively. 
The pretraining effect appeared on the 
first training trial, since a significant 
difference in the mean RTs was ob- 
tained between this group and the con- 
trol group, #(38) = 2.95, p< 01. 

Part of Table 1 shows the results of 
trend analyses applied to the com- 
Parison involving Groups E-(1-p)-3 
and E-(1-p)-10. In both cases, dif- 
ferences in the linear and quartic com- 
Ponents were significant. The quad- 
Tatic trends were not significantly dif- 
ferent in either case. This is because 
On the second training trial both ex- 
Perimental groups improved as much as 
the control group, resulting in similar 
quadratic curvatures. In the main, the 
Statistical analyses show that both 
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groups started with short mean RTs, 
significant for Group E-(1-p)-10, and 
improved further on the second train- 
ing trial in such a way that the per- 
formance asymptote was reached on 
the second trial. 


Discussion 


According to the present results, when 
S had a moderate (3 trials) or intensive 
(10 trials) pretraining of hearing the 
tone alone, its main effect was on the first 
training trial. There are several possibil- 
ities to account for this effect. 

First, since the E-t groups received the 
additional pretraining prior to the train- 
ing trials in comparison with the control 
group, the pretraining effect of hearing 
the tone alone might be confounded by the 
warm-up effect. This possibility can be 
ruled out because Ss in the E-p groups 
showed no pretraining effect at all, while 
they received another kind of pretraining 
which would have also led to the same 
warm-up effect. 

Second, during the pretraining trials of 
hearing the tone alone, S might get some 
information about the tone onsets, because 
the tone was presented with the same 
average intertrial interval in the pretrain- 
ing and training sessions, As a matter of 
fact, Klemmer (1957) found that the rela- 
tion between RT and the time uncertainty 
of the stimulus onset is approximately 
linear. The effect of the time uncertainty 
for the E-t groups would be, however, 
relatively small and not sufficient to ac- 
count for the obtained effect on the 
grounds that, though the same average 
intertrial interval was used for the E-p 
groups, no such effect was obtained. 

Third, the pretraining effect obtained 
with the E-t groups might be due to some 
kind of information-checking mechanism. 
This possibility is based on the following 
reasoning. After hearing the tone for 
several times, it might lose its novelty, 
although the stimulus in question is a 
very simple one. The simplicity of the 
stimulus, indeed, does not imply at all 
that it is not new (Berlyne, 1960). If 
the stimulus is new to some extent there 
should be some information-checking 
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mechanism operating (Sokolov, 1963) 
which may be clarified by investigating 
the concurrent orienting reflexes such as 
GSR and vascular reactions. The adapta- 
tion of this information-checking mech- 
anism is probably partly responsible for 
the initial improvement in simple RT 
training. According to the present ex- 
periment, this information checking re- 
quires at least a few trials, for no pre- 
training effect was obtained with Group 
E-t-1. 

The second main finding of the present 
experiment is the pretraining effect ob- 
tained with Groups E-(1-p)-3 and E- 
(1-p)-10. One-trial learning is the con- 
spicuous feature of this effect. A pos- 
sible explanation is as follows: With 
several pretraining trials of pressing the 
key to the light stimulus, Ss acquired to 
respond to the general nature of stimulus 
onset. Itis then transferred to the train- 
ing trials during which Ss had to respond 
to the tone onset. In view of the results 
obtained with the E-t groups, the reason 
why the performance of Groups E-(1-p)-3 
and E-(1-p)-10 did not reach the asymp- 
tote on the first trial might be due to the 
fact that the stimulus on the training trials 
was new to them. Thus, the decrease in 
RT obtained on the first trial for either 
Group E-(1-p)-3 or E-(1-p)-10 might be 
because of the transfer process, The fact 
that the decrease in RT was not complete 
(i.e, not reaching the asymptote) on the 
first trial might be due to the information- 
checking mechanism, Presumably, the in- 
formation-checking mechanism had been 
exercised sufficiently during the pretrain- 
ing trials for Groups E-(1-p)-3 and 
E-(1-p)-10 (because the light was then 
new), resulting in one-trial learning for 
these groups in the training session, In 
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other words, as one stimulus becomes 
less new to S, another stimulus similar in 
its function becomes less new to S in a 
quick tempo. 

The above explanation of one-trial 
learning in Groups E-(1-p)-3 and E- 
(1-p)-10 is in agreement with the results 
of the E-t groups. The Ss in the E-t 
groups, irrespective of degree of their 
pretraining, still required about the same 
number of training trials as Ss in the 
control group to reach the performance 
asymptote. During these training trials 
before reaching the asymptote, Ss in the 
E-t groups apparently had to acquire to 
respond to stimulus onset, and the num- 
ber of trials to the asymptote apparently 
could not be reduced by the pretraining of 
hearing the tone alone. 
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INVESTIGATION OF GRAMMATICAL CLASS AS AN 
ENCODING CATEGORY IN SHORT-TERM MEMORY? 


DELOS D. WICKENS, SANDRA E. CLARK, FRANCES A. HILL, 


AnD ROY P. WITTLINGER ¢ 
N Ohio State University 


In the Peterson-type STM experiment, proactive inhibition (PI) 
builds up rapidly but will be released if a shift to a new class of 
materials is introduced. This report describes 5 experiments investi- 
gating the possibility that a release will occur when a shift is made 
from adjectives to verbs and vice versa, The experiment used triads 
of words and obtained retention measured after 15 sec, In the first 
4 experiments, release of inhibition was obtained when the shift 
occurred. However, it seemed possible to account for these effects 
by either phonetic or semantic factors. The final experiment sampled 
a broad class of words, avoiding semantic and phonetic uniqueness 
within a grammatical class. In this experiment, no inhibition release 
was found and it was concluded that grammatical class of verbs or 
adjectives is not used to encode words in STM. 


In the short-term memory situation 
employing the Peterson and Peterson 
(1959) technique, proactive inhibition 
(PI) builds up rapidly (Keppel & 
Underwood, 1962) reaching a maxi- 
mum after three or four trials. It has 
been shown by Wickens, Born, and 
Allen (1963), however, that PI will 
be eliminated if the item presented on 
a given trial is drawn from a different 
class of material. In that experiment, 
one group of Ss was presented with a 
series of consonant trigrams (CCCs) 
and another group with a series of 
three-digit numbers (NNNs); each 
group was then shifted to a test item of 
the other class. Both groups showed a 
marked improvement on the shift trial, 
performing as well on that trial as they 
did on the first one of the experiment. 
Subsequently, Loess (1967) has 
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demonstrated the same buildup and 
release of PI when Ss shift from a 
homogeneous class of word triads 
(such as species of birds) to a new 
class of word triads (species of trees). 

Together these experiments indicate 
that, in the STM situation, triads or 
trigrams, all elements of which are 
homogeneous with respect to concep- 
tual class, seem to be encoded not only 
as unique items, but also as members 
of the same conceptual class, If an- 
other item is drawn from a different 
conceptual class, then competition no 
longer occurs or, at least, is mini- 
mized. A reduction in retroactive in- 
hibition is also found in long-term 
memory if the originally learned list 
and the interpolated list are composed 
of items drawn from different classes 
of material (Friedman & Reynolds, 
1967; Postman, Keppel, & Stark, 
1965). 

The results of the Wickens, Born, 
and Allen, and of the Loess experi- 
ments suggested to the authors the 
possibility that the shift procedure may 
be used as something of a projective 
technique: a way of asking S what 
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response classes are being employed 
without requiring him to identify and 
label them. It was the purpose of the 
present study to employ the Wickens, 
Born, and Allen technique with two 
grammatical classes of the English 
language. 

Verbs and adjectives were chosen as 
grammatical classes to investigate for 
several reasons: They are considered 
as different grammatical forms in our 
language; it is not too difficult to 
choose verbs which are seldom used 
as adjectives; and, finally, Rowland 
(1907) has reported that the “state 
of mind” aroused by verbs and adjec- 
tives differs, 


PRELIMINARY EXPERIMENTATION 


A series of four experiments was 
conducted before the final experiment 
was designed. All five experiments were 
conceptually identical. The basic design 
consisted of presenting a series of four 
triads of verbs (adjectives) to college 
student Ss. An experimental group 
then shifted to a triad of adjectives 
(verbs) on the fifth trial, while a con- 
trol group continued with a triad of 
the original class, verbs or adjectives. 
In addition, on the sixth trial the con- 
trol group was shifted, and the experi- 
mental group continued on its new 
class to be shifted once again on the 
ninth trial. The only uncontaminated 
trial testing the major hypothesis was 
the fifth trial, but, since it took only 
approximately an additional 3 min., 
these other trials were added to the 
procedure. All Ss were drawn from 
the elementary psychology classes at 
Ohio State University. 

Experiment I—In the first experi- 
ment, the triads were presented audi- 
torily by E, the triads being selected 
from a group of verbs and adjectives 
which had been found by Black (1952) 
to be easily discriminated in a noisy 
environment. 


A Gerbrands tape timer turned on 
and off a signal light, and the onset of 
the light was used to cue E for presen- 
tation and to inform S when to respond 
by repeating the item in the original 
order. The word triad was spoken 
twice by E in rapid succession and re- 
peated twice by S. The S was then 
presented with a two-digit number 
from which he subtracted by 3s to the 
l-sec. beat of a metronome. After 15 
sec., a recall signal was flashed, and 
the next trial began 15 sec. later. The 
triads used were: fearful, hungry, 
thoughtful; active, costly, better ; 
ancient, stormy, faithful ; kindly, thank- 
ful, stubborn; certain, graceful, early; 
sharpen, possess, follow; invade, con- 
tain, repair; invite, reduce, dazzle; 
relate, include, tickle; carry, suspend, 
abide. 

Half of the Ss started with adjectives 
and shifted to verbs, while the other 
half of the 160 Ss used the reverse of 
that order. As they appeared in the 
laboratory, Ss were assigned alter- 
nately to control or experimental 
groups. Counterbalancing was em- 
ployed so that all triads appeared 
equally in each position. 

The responses were scored by giving 
one point for each item correctly re- 
called and an additional point if given 
in the correct order. The percentage 
correct for the experimental and con- 
trol groups on Trial 1 were 92 and 
90, respectively, and on Trial 4 they 
were 42 and 45. Obviously, PI effects 
had been generated. The critical trial 
was Trial 5, the trial on which the ex- 
perimental group shifted to the other 
grammatical class. On this trial the 
experimental group had a percentage 
of 58, and the control a percentage of 
45. The difference between the 
groups was highly significant, £ (158) 
= 3.59, p < .005. A similar gain on 
the shift at Trial 9 was found, with a 
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gain from 55% correct to 67.5% for 
the experimental group. It excelled 
the control group by 21.5%, and the 
difference was significant at the p< 
.005 level. 

Experiment II.—Since in Exp. I it 
was possible that, despite E’s efforts 
to prevent its occurrence, biasing could 
have arisen when the material was read 
aloud by E to S, a second experiment 
was conducted in such a manner as to 
eliminate this possible source of con- 
founding. This was done by present- 
ing all of the materials by a Carousel 
projector controlled by a Gerbrands 
timer. Eight of the words in the 
original triads were changed because 
they seemed to be items particularly 
easy to recall; “dazzle” and “tickle” 
are examples. The number of Ss was 
again 160. 

The results of this experiment agreed 
closely with those of Exp. I. Proactive 
inhibition was shown by the drop of 
the experimental and control groups 
from 85% and 93% correct on Trial 1 
to 37% and 39% on Trial 4. On Trial 
5, the experimental group rose to 48% 
while the control was 36%. The dif- 
ference between the groups was sig- 
nificant at p <.025. A similar gain 
on the shift at Trial 9 was found with 
a gain from 37% correct to 52% for the 
experimental group. It excelled the 
control group by 11%, a difference 
which was significant at the p< .05 
level. 

Experiments III and IV.—A scru- 
tiny of the words employed in Exp. I 
and IT will show that a number of the 
adjectives contained the sound “ful” or 
“Y.” Since it has been shown that 
acoustic similarity is a cause of inter- 
ference in STM (Conrad, 1964; 
Wickelgren, 1965), it could be argued 
that the positive results obtained might 
have been produced by the phonetic 
differences between the class of words 
chosen to represent the adjectival class 


and those chosen to represent the ver- 
bal class. This suggested a need for 
a list of adjectives without distinguish- 
ing suffixes. For Exp. III and IV, 
the following high-frequency words 
were chosen: cheap, dark, great; tall, 
rich, hard; fresh, large, soft; bright, 
thin, small; high, true, glad; big, poor, 
straight ; speak, try, hang; prove, draw, 
teach; hear, fall, sing; come, reach, 
spend; build, learn, get; thank, bring, 
give. Experiments III and IV were 
identical in procedure, except that one 
was conducted in the spring quarter 
and one in the summer. 

The materials were presented vis- 
ually, with the Carousel projector being 
paced by a Gerbrands tape timer. The 
procedures of these two experiments 
differed slightly from the earlier pro- 
cedures in that five trials of the same 
class of material were given before 
shifting, and the second shift followed 
five more trials, There were 122 Ss 
in each experiment. In Exp. III, the 
experimental—control difference after 
the first shift did not quite reach an 
acceptable level but was .05 < p < .10. 
After the second shift, it was clearly not 
significant (f= .9). In Exp. IV, the 
difference after the shift was significant 
on Trial 6, p < .01, but it was not sig- 
nificant on Trial 12. 

Discussion of Experiments I-IV. — 
Although all four experiments gave 
evidence that PI was reduced following 
the shift in grammatical class, it was 
thought possible that other variables 
associated with list construction might 
be responsible for the results. Specif- 
ically, in the first two experiments 
there was a repetition of the phonemes 
“ful” and “y” in the adjectives. In the 
third and fourth experiments, a within- 
grammatical class semantic similarity 
very clearly existed for the adjectives 
in such antonyms as “rich” and “poor” 
or “hard” and “soft.” Possibly some- 
thing of the sort was true for such 


602 


paired verbs as “speak” and “sing” 
or “learn” and “prove.” It seemed 
possible that | semantic differences 
occurred between grammatical classes. 
The reduction in PI might have arisen 
from a general semantic context in one 
class which was different to that in 
another. Experiment V was designed 
to sample verbs and adjectives more 
widely so as to make as sure as possible 
that the two grammatical classes were 
alike in both phonetic and semantic 
characteristics. 


EXPERIMENT V 
Method 


Subjects—The Ss were 400 students from 
the elementary psychology classes at Ohio 
State University. 

Materials—Verbs and adjectives were 
drawn from the Thorndike-Lorge word book 
(1944) and were subdivided into high- (A 
and AA) and low-frequency words (10-49). 
Each grammatical and frequency class was 
equally represented by one- and two-syllable 
words. There were 15 words; hence, five 
triads in each class, and each word began 
with a different phoneme. Care was taken 
to avoid intraclass phonetic similarity and 
semantic similarity—in short, to make each 
subset of words heterogeneous with re- 
spect to meaning and sound. Because of 
the large number of words used (120), a list 
is not included. Presentation was again by 
projector. 

Procedure.—The general procedure closely 
paralleled that of previous experiments. 
Each triad was repeated twice; the reten- 
tion interval task was subtraction by 3s with 
a metronome beating at a l-sec. rate; the 
retention interval was 15 sec.; the recall 
interval was 10 sec.; the rest interval 5 sec.; 
and the warning signal (an asterisk), which 
came on before the next triad, lasted for 2 
sec. The duration of the stimulus triad 
was controlled by E, who switched the pro- 
jector to the subtraction procedure as S 
recited the second unit of the triad for the 
second time. The Carousel takes about .4 
sec. to switch to the next slide, so the sub- 
traction number was being presented ap- 
proximately at the time the last word was 
emitted. The total presentation time of the 
stimulus triad was about 1.5 sec. 

Each subgroup of the experiment consisted 
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of 25 Ss, and there were 16 such groups, 8 
control and 8 experimental. This many sub- 
groups was required to use a high- and 
a low-frequency list of one-syllable words, 
and a high- and a low-frequency list of two- 
syllable words for both yerbs and adjectives, 
Obyiously, a group was given a homogeneous 
list of high- or low-frequency, one- or two- 
syllable words. Control Ss were given five 
triads of one grammatical class, verbs or 
adjectives, while the experimental Ss were 
given four triads of one class and then 
shifted to a triad of the same frequency and 
syllable value on the fifth and last trial. 
The shifts were made in both directions, 
from verbs to adjectives and vice versa. 
Counterbalancing was arranged so that all 
triads appeared equally often on each trial. 


Results 


The critical test was, of course, a 
comparison between the experimental 
and control groups at Trial 5 on which 
each group was presented with the 
same class of material. The combined 
results for all groups are presented in 
Fig. 1 which plots the percentage of 
items recalled regardless of order of 
recall. It is apparent that PI developed 
rapidly, and, although the experimental 
group slightly exceeded the control on 
Trial 5, the ¢ value for the difference 
was less than 1.00. A similar lack of 
significance was found on Trial 5 for 
each of the appropriate subgroups of 
the experiment. An analysis of the 
effects of the variables of frequency 
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Fic. 1. Percentage of recall of items as a 
function of trials. 


GRAMMATICAL CLASS IN STM ENCODING 


and number of syllables showed no 
significant main effects. An unex- 
pected interaction between Number of 
Syllables x Frequency did obtain sig- 
nificance. That is, the low-frequency 
two-syllable words were associated 
with better performance. No rationale 
for this finding was discovered. 


Discussion 


On the basis of Exp. V, it is concluded 
that the first four experiments may have 
given positive results because of a con- 
founding of either within-class phonetic 
characteristics or within-class semantic 
characteristics with the grammatical di- 
mension. In Exp, V, a broader sample 
of words was employed and special effort 
was made to reduce or eliminate unique 
phonetic or semantic characteristics of 
a particular grammatical class. Under 
these conditions, the positive effect of the 
shift did not occur. 

These data strongly suggest that, when 
words are encoded for short-term mem- 
ory, they are not classified according to 
the grammatical forms of verbs or nouns. 
This conclusion is based upon 400 Ss and 
a broad sampling of words. With so 
large an N and so wide a sampling of 
words, it does not seem that the accept- 
ance of the null hypothesis is a radical 
act. Whether other grammatical forms 
are used in classification in STM is a 
question which is not likely to receive a 
Satisfactory positive answer. In the case 
of verbs and nouns, so many words serve 
in both grammatical classes that encod- 
ing by this method would be most un- 
reliable. Articles, conjunctions, and prep- 
Ositions are too limited in the available 
Pool to be employed in a convincing ex- 
Periment. In addition, one would feel 
uneasy in employing words with such 
high frequency of usage as articles, con- 
Junctions, and prepositions against words 
of considerably lower frequency. The un- 
€asiness is not completely eliminated by 

e negative findings on frequency in the 
Present experiment. 

The negative findings with respect to 
encoding by grammatical class in this 
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short-term memory experiment is con- 
sonant with a failure to obtain gram- 
matical class clustering in free recall, 
Cofer and Bruce (1965) presented a 36- 
item list containing 12 adjectives, 12 
nouns, and 12 verbs and obtained no 
evidence of clustering in free recall over 
four trials. This was true for a group for 
whom the three form classes were pre- 
sented in a block as well as for a group 
with random presentation. A subsequent 
sorting task indicated that other Ss from 
the same student population could make 
differentiations among the form classes, 
The results of the Cofer and Bruce experi- 
ment could be explained if it is assumed 
that the nature of the clustering in free re- 
call is greatly if not completely determined 
by the nature of the initial encoding cate- 
gories. If a category is not used in the 
encoding stage, then it is not likely to be 
used in free recall output; if it is used in 
one, it will be in the other as Loess 
(1967) has shown for the Bousfield, 
Cohen, and Whitmarsh (1958) taxonomic 
classes. 

The failure to find a difference in reten- 
tion between high- and low-frequency 
words would seem to be contrary to the 
findings of Turnage (1967) who reported 
a performance difference in favor of 
high-frequency words at short intervals, 
but not at a longer interval of 30 sec. In 
his study only a single word was em- 
ployed rather than three as in the present 
experiment and it has been shown that 
there is a more rapid decline for a three- 
unit item than for an item composed of 
a single unit (Melton, 1963; Murdock, 
1961). Thus, it is possible that the three- 
item group at 15 sec. of the present ex- 
periment is comparable to his 30-sec., 
one-item group. The failure to find a 
difference in retention between one- and 
two-syllable words is quite comprehensible 
if a word is considered to constitute a 
chunk regardless of the number of 
syllables. 

In summary, it seems unlikely that 
grammatical classes of verbs and adjec- 
tives are used by the individual as en- 
coding classes in STM, other than through 
their unique phonetic suffixes. This ex- 
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periment further seems to demonstrate an 
important methodological point: that, 
when one works with words in STM, the 
variable being studied can easily be con- 
fused by phonetic and semantic charac- 
teristics of the words used. 
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REPETITION EFFECTS IN IMMEDIATE MEMORY WHEN 


THERE ARE NO REPEATED ELEMENTS 
IN THE STIMULI: 


ROBERT G. CROWDER 


Yale University 


2 experiments are reported in which Ss recalled series of 8 consonants 
immediately following auditory presentation. S was required always 
to include 1 prearranged, redundant consonant in his recall, as a prefix 
before the series in Exp. I and between the 2nd and 3rd elements in 
Exp. II. In both studies there was a comparison of inclusive condi- 
tions, where the redundant letter also occurred as part of the memory 
series, with exclusive conditions, where the redundant element was 
never repeated in the memory series. The main finding was that 
patterns of output repetition in the inclusive conditions affected the 
form of the serial position curve, simulating, in critical cases, the effects 
of intraserial repetition (a repeated letter in the series as presented). 
These results were interpreted as evidence that the effects of distributed 
intraserial repetition are attributable chiefly to output rather than input 


processes, 


Including a repeated element (RE) 
in a memory-span item affects both the 
frequencies and distributions of errors 
obtained (Crowder, in press; Crowder 
& Melton, 1965; Wickelgren, 1965, 
1966; Wolf & Jahnke, 1968). Such 
intraserial repetition effects are now 
understood to include two distinct 
empirical findings, at least for the case 
in which a single RE occurs in a series 
of eight consonant letters (Crowder, in 
press). With massed repetition, i.e., 
zero or one nonrepeated element 
(NRE) between occurrences of the 
RE, there is improved recall at all serial 
positions, most of all at the positions 
of the RE. With distributed repeti- 
tion, i.e., two or more NREs between 
occurrences of the RE, there is re- 
duced recall for the RE itself but not 
elsewhere (this inhibition is known as 
the Ranschburg Phenomenon). 


1This research was supported by Grant 
GB 4066 from the National Science Founda- 
tion. Susan E. Bender deserves thanks for 
her careful collection and tabulation of the 
data, 


A pair of conditions employed in an 
experiment designed for other purposes 
(Crowder, 1967) has considerable rele- 
vance for the Ranschburg Phenomenon. 
These conditions both involved the use 
of a redundant prefix (Conrad, 1958). 
On each trial S had to emit some pre- 
arranged letter between the time he 
heard the stimulus series and the time 
he recalled it. The comparison of inter- 
est originally showed that recall was 
better if the redundant element was 
excluded from the memory series than 
if it was allowed to occur later in the 
series. It is, however, the inclusive 
condition that is relevant here because 
although the eight letters S heard con- 
tained no repetition, the nine he re- 
called did. If repetition effects were a 
consequence of the way S encoded the 
series, then they should not occur with- 
out repetition in the stimulus. If the 
effects were due to output processes, 
however, they should appear in the in- 
clusive condition. 

Although the previous experiment 
was adequate to establish the overall 
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superiority of the exclusive condition 
to the inclusive condition, there were 
not sufficient observations to trace re- 
liable serial position functions for the 
eight subconditions of the inclusive 
condition (those subconditions being 
defined by where in the series the pre- 
fix letter was repeated). The purpose 
of Exp. I was to replicate the condi- 
tions described so as to provide ade- 
quate data to analyze separately for 
serial position the conditions of “simu- 
lated repetition.” 

In Exp. II the redundant element 
was placed between the third and fourth 
elements of the to-be-recalled string, 
rather than before the first, to deter- 
mine whether different patterns of 
simulated repetition would be found. 


EXPERIMENT I 


Method.—Sixteen Yale undergraduates re- 
called 128 eight-consonant series. On each 
trial S was required to emit a prefix before 
reproducing the series; however, the prefix 
was not presented with the series. All Ss 
received the stimuli in the same order and 
served under both conditions, half working 
under the inclusive condition first and half 
under the exclusive condition first. Thus 
the two main conditions were balanced 
against order effects and against individual 
stimuli. 

The stimuli were all premutations of the 
same eight consonant letters (BFHKNRTX) 
taken from successive rows of randomized 
Latin squares, Under the exclusive condi- 
tion the prefix was always the letter b; 
under the inclusive condition exactly two 
Ss used each of the eight letters from the 
vocabulary. The latter precaution was ex- 
tremely important for it meant that each of 
the 128 stimuli in the study contributed 
equally often to each subcondition and that 
the serial position data for any given sub- 
condition (e.g., Cond. P5, where the prefix 
letter was repeated in the fifth serial posi- 
tion of the memory series) were absolutely 
unconfounded with stimulus series, with 
order of testing, and with the actual letter 
used as prefix. 

The series were recorded at a rate of 2 
consonants/sec on a Wollensak No. 1500 tape 
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recorder. The Ss recalled the series out 
loud, with the aid of a position grid (see 
Crowder, 1967). About 15 sec. were allowed 
for recall. 


Results and discussion.—There was 
no appreciable difference in recall per- 
formance between the 16 Ss of the 
present replication and the 16 from the 
earlier replication (t = .52) and there- 
fore all 32 will hereafter be considered 
together. As before, the difference in 
errors between the inclusive and exclu- 
sive conditions strongly favored the ex- 
clusive condition, ¢ (31) = 4.03, p < 
.01. 

A two-way repeated-measures ana- 
lysis of variance based on the eight 
subconditions of the inclusive condi- 
tion showed statistically significant 
main effects of Subcondition, F (7, 
217) = 3.73, p< 01, of Serial Posi- 
tion, F (7, 217) = 126.82, p < .01, and 
a significant interaction, F (49, 
1519) = 1.58, p < .01. The interac- 
tion indicated that the location of pre- 
fix repetition affected the form of the 
serial position curve. To identify 
which particular subconditions were 
responsible for the interaction, simple 
main effects (Winer, 1962) were cal- 
culated to show variation due to sub- 
condition at each serial position 
separately. (The other way of analyz- 
ing simple main effects would have 
made no sense since there were strong 
serial position effects with all subcon- 
ditions.) None of these comparisons 
approached statistical significance ex- 
cept that associated with the third serial 
position, F (7, 217) = 2.54, p < .025. 
Certain features of the serial position 
data are presented in Fig. 1, which 
makes it clear that the significant varia- 
tion in errors at the third serial posi- 
tion was attributable to Subcond. P3, 
where two NREs intervened between 
the prefix and its reoccurrence in the 
series. The other seven inclusive sub- 
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Fic. 1. Results of Exp. I. (The exclu- 
sive, or control, condition and Cond. P3 are 
shown in the solid and dashed lines, respec- 
tively. The other inclusive conditions are 
represented by the range of scores for each 
serial position.) 


conditions are represented by bars 
showing the range from most to least 
errors. In all cases there were gen- 
erally more frequent errors than in the 
exclusive condition (solid line) but 
nowhere the localized inhibition ob- 
tained at the third position of Subcond. 
P3 except for a somewhat similar 
tendency at the second serial position of 
Subcond. P2; however, this effect was 
not statistically significant, F (7, 217) 
= 1.36, p > .10. 

Parametric data on recall for eight- 
consonant series with all possible loca- 
tions of a single RE (Crowder, in 
press) revealed that among cases of 
distributed repetition the conditions 
where exactly two NREs came be- 
tween locations of the RE were un- 
usually difficult; four of the five most 
difficult conditions in that experiment 
(in terms of correct recall) were cases 
of that sort. Since strong localized 
inhibition was found in Exp. I precisely 
where two letters occurred between the 
prefix and its repetition in the memory 
series, it was tentatively concluded that 
Cond. P3 simulated with some fidelity 
the Ranschburg Phenomenon. 


ExPERIMENT II 


Method.—The conclusion that Cond. P3 
simulated the Ranschburg Phenomenon re- 
mained tentative because the other inclusive 
subconditions were not significantly affected 
in Exp. I, There is, however, no particular 
reason why the redundant element in this 
type of experiment has to be employed as a 
prefix. The logic behind Exp. II was that 
if the redundant letter were inserted else- 
where in the series then other conditions of 
intraserial repetition would be set up. The 
conduct of Exp. II was identical with that of 
Exp. I in every detail except for two: (a) 
the 16 new Ss were students of both sexes 
from several colleges, all living in New Ha- 
ven during the summer, and (b) the redun- 
dant element was emitted by S between his 
recall of the second and third element of the 
to-be-recalled string. 


Results and discussion—As in Exp. 
I the inclusive condition was more dif- 
ficult for Ss than the exclusive condi- 
tion, although the difference was statis- 
tically significant only in the case of 
correct recall for series, t (15) = 2.02, 
p <.05, and not in total errors (t = 
1.04). 

The four panels of Fig. 2 give serial 
position data for all inclusive condi- 
tions, compared two at a time with the 
exclusive control data. For Cond. 4 
through 6 or possibly 7 there can be no 
question that patterns of output repeti- 
tion affected the forms of the serial 
position functions. Analysis of vari- 
ance of the inclusive conditions showed 
significant main effects of Condition, 
F (7, 98) = 3.53, p< 01, of Serial 
Position, F (7, 98) = 57.16, p < .01, 
and a significant interaction, F (49, 
686) = 1.34, p < .05. (Stage of prac- 
tice was included in this analysis as a 
between-group comparison to reduce 
error variance; it was not a significant 
source of variation and did not interact 
with the other factors.) The signif- 
icant interaction confirmed that the 
shapes of the serial position curves 
were affected and not just their eleva- 
tion. The striking inhibition observed 
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Fic. 2. Results of Exp. II. (The exclusive, or control, condition is represented in each 


panel by the filled circles. 


The inclusive conditions are labelled by the position in which 


the redundant element was repeated and are compared, two at a time, to the control 


condition.) 


in the fourth, fifth, and sixth positions 
of Cond. 4, 5, and 6 lent considerable 
support to the interpretation given 
Exp. I. 


GENERAL Discussion 


The present results suggest that the in- 
hibitory effects of distributed intraserial 
repetition occur while Ș is recalling the 
series. The implication for the facilita- 
tive effects of massed repetition is just the 
opposite; in none of the conditions studied 
where zero or one NRE intervened be- 
tween the redundant element and its repe- 
tition (there were six such conditions— 
P1 and P2 in Exp. I and Cond. 1, 2, 3; 
and 4 in Exp. II) was there any ap- 
preciable reduction in errors. This fail- 
ure to stimulate effects of massed repe- 
tition is consistent with the position of 
Wickelgren (1965) and others (Crowder, 
in press; Tatuno, 1961) that massed repe- 
tition makes it easier to encode series and 
possibly reduces intraserial interference. 
One testable implication of the output- 
interference interpretation of distributed 
repetition effects is that with partial recall 
techniques the Ranschburg Phenomenon 
should be reduced in magnitude. A re- 
cent study by Wolf and Jahnke (1968) 
using the probe method confirms this 
deduction. 

Although the present results tell one 
where to look for a theory of intraserial 
repetition effects, and thus impose limits 


on such a theory, they do not identify 
the nature of the relevant processes. An 
interesting feature of the present design 
is that since each of the memory series 
used a permutation of the eight letters 
in the vocabulary there was, logically 
speaking, only order information to re- 
member and no “item information” (Con- 
rad, 1965). It could be speculated, along 
the lines of Conrad’s argument about 
serial order errors (Conrad, 1965), that 
Ss assumed nonrepetition of letters in the 
stimuli and when they came to an ele- 
ment they previously used (albeit as the 
redundant letter) they rejected it. If 
this were so, then the tendency to reject 
the redundant element as a series mem- 
ber should decrease with practice on the 
inclusive task, for, particularly as $ ex- 
perienced conditions of massed repetition, 
he would learn that the characteristics of 
the stimulus set permitted repetition of 
the redundant element. To test this pos- 
sibility the data of Cond. 5 in Exp. II 
were analyzed by stage of practice; each 
S had eight trials under each inclusive 
condition, so the first four were com- 
pared with the second four. The results 
of this analysis are shown in Fig. 3. Ob- 
viously, the magnitude of the simulated 
repetition effect did not decrease with 
practice on the inclusive task. 

Insofar as can be determined the asso- 
ciative structure of immediate memory 
tasks involves both serial position 
cues and forward chained associations 
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Fic. 3. Analysis of Cond. 5 in Exp. II 
by stage of practice. (Solid line is the ex- 
clusive condition.) 


(Wickelgren, 1966). With either type of 

association governing the output of a 
given element, the conditions are set for 
an A-B, C-B interference paradigm with 
respect to recall for the RE. (With the 
NREs following the RE, the A-B, A-C 
paradigm is of course satisfied and 
Wickelgren, 1965, 1966, has offered con- 
vincing evidence that such associations 
Predict the identity of certain errors.) 
One line of future research would profit- 
ably be an exploration of conditions where 
the same response is associated with dif- 
ferent stimuli in short-term memory 
Situations where stimulus and response 
functions are more clearly delineated than 
in memory-span studies (e.g., the tech- 
nique of Tulving & Arbuckle, 1963). 
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Habituation of the orienting reflex (OR) is typically treated as a 
kind of stimulus learning. The results of 3 experiments employing 
the GSR indicate, however, that (a) stimuli which have behavioral 
associates produce greater ORs than those which do not, (b) “new” 
behavioral associations produce greater ORs than “old” associations, 
ie, selective habituation occurs to response characteristics and, (c) 
level of ORs across habituation trials is a direct function of the 
amount of response information to be encoded. It appears, therefore, 
that the “neuronal model” of a stimulus includes the characteristics 


of associated responses. 


The orienting reflex (OR) consti- 
tutes a diffuse complex of somatomotor 
(e.g postural) and autonomic (e.g., 
cardiovascular, sense organ) responses 
with three defining characteristics: 
(a) The adequate stimulus for the OR 
is one which is, to some extent, “new” 

r “novel,” i.e., the OR occurs in re- 
sponse to stimuli which have not been 
repeatedly presented to the organism. 
(b) The OR is a nonspecific reaction, 
i.e., it occurs in response to changes, of 
at least “just noticeable” value, in any 
parameter (frequency, amplitude, dura- 
tion, rate of presentation) of a stim- 
ulus belonging to any sense modality. 
(c) The OR selectively habituates or 
extinguishes to a stimulus which is re- 
peatedly presented. Selective habitua- 
tion is demonstrated by the reemerg- 
ence of the OR which occurs when one 
of the parameters of a previously habit- 
uated stimulus is changed. 


1The research reported here was sup- 
ported by a research grant, Project No. 
31687, from the National Aeronautics and 
Space Administration. The author wishes to 
thank G. Chernault and S. Klein for their 
assistance in conducting the experiments. 
Major portions of this paper were presented 
at the meeting of the Pavlovian Society of 
Dan America, Baltimore, September 6, 
1968. 


Pavlov (1927) noted the biological 
significance of the OR and further ob- 
served that this reaction plays an im- 
portant role in the initial formation of 
conditioned responses, and that the OR 
to extraneous stimuli is capable of 
temporarily inhibiting a well-established 
conditioned response and disinhibiting 
one which is in the process of extinc- 
tion. 

In addition to the importance which 
results from its complex interactions 
with conditioned responses, the OR 
possesses independent significance since 
habituation may, in itself, be viewed 
as a simple form of learning (Hernan- 
dez-Peon & Brust-Carmona, 1961; 
Thorpe, 1963). The response changes 
(progressive decrements) which take 
place as a function of repeated expo- 
sure to the stimulus cannot be ascribed 
to fatigue or receptor adaptation. 
Highly selective or specific habituation 
argues against both possibilities. The 
operations and consequent response 
changes which define OR habituation 
and reemergence would seem to neces- 
sitate at least the temporary storage of 
relevant information within the central 
nervous system. 

These last considerations have led 
several investigators (cf. Lynn, 1966) 
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to speculate on the central neural sys- 
tems which are responsible for the 
production and habituation of the OR. 
Because of their circumscribed experi- 
mental operations and probably because 
Pavlov originally described the OR 
as the “what is it?” reflex, these in- 
vestigators have limited their attempts 
to neural schemata which are more or 
less capable of handling the stimulus 
learning features of OR habituation. 
Sokolov (1960, 1963a, 1963b), e.g., 
has suggested a central “modeling sys- 
tem” which acts in functional parallel 
with both the classical afferent system 
and the amplifying system responsible 
for the production of the OR. Most or 
all of the discriminable aspects of the 
stimulus are encoded within this sys- 
tem, ie, a “neuronal model” of the 
stimulus is constructed on the basis of 
repeated presentation of the stimulus 
during the habituation series. When 
a stimulus coincides with a previously 
established model, inhibitory impulses 
are sent from the modeling to the 
amplifying system, attenuating the re- 
sponse of the last system to the extent 
that concordance has been achieved 
between model and stimulus. 
Although Sokoloy’s conceptualiza- 
tion is capable of interpreting those 
features of the extensive OR data 
which demonstrate that a kind of stim- 
ulus learning takes place during habit- 
uation, his concept of the “neuronal 
model” is limited to this extent. Other 
investigators (Berlyne, 1960; Kvasov 
& Korovina, 1965) have suggested, 
however, that at least some of the OR 
components (increments in general 
muscle tension, postural responses) 
should properly be viewed as constitut- 
ing the question “what’s to be done?” 
This view suggests that those central 
structures which are directly or indi- 
rectly responsible for the production 
of the OR may be as concerned with 


possible behavioral requirements as 
they are with purely sensory events. 
In addition, Anokhin (1961, 1965) has 
proposed that a model of specific be- 
haviors and associated reinforcement 
outcomes is constructed on the basis 
of the afferent feedback which is pro- 
duced by behavior and the results of 
behavior. 

It is interesting to note that although 
Sokolov (1963b) has provided a sub- 
stantial amount of data concerning the 
effects of endowing stimuli with “sig- 
nal significance,” i.e., pairing them with 
behavioral responses, he has not in- 
cluded response characteristics in his 
conception of the “neuronal model” of 
the stimulus.” 

The research to be reported was de- 
signed to systematically explore the 
possibility that the “neuronal model” 
of a stimulus includes not only the 
parameters of the stimulus, but also 
the characteristics of associated re- 
sponses. 

An important methodological feature 
of these studies should be noted here. 
In all three studies, Ss were instructed 
not to respond behaviorally during the 
presentation of the experimental stimuli 
but were provided with a subsequent 
blank or “response” interval during 
which they performed any instructed 
behavior. Both the stimulus and re- 
sponse periods were of 10-sec. dura- 
tion. This procedure was employed to 
insure that the GSRs to the experi- 
mental stimuli would clearly antecede 
the behavioral responses and would, 
therefore, be unconfounded with the 
autonomic activity that directly accom- 
panies overt behavioral responding. 


2 Sokolov (1963b, p. 287) defines the neu- 
ronal model as “a certain cell system whereby 
the information is stored concerning the 
properties of a stimulus which has been ap- 
plied many times.” 
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EXPERIMENT I 


The first study in the series explored 
the effects of a behavioral association 
on the ORs to stimuli and, in addition, 
generally tested for selective habitua- 
tion to response characteristics. 


Method 


Subjects—Thirty-two male and female 
undergraduates enrolled in an introductory 
psychology course volunteered to participate 
in the study. 

Apparatus—A Grass Model 7 polygraph, 
equipped with 7P5 preamplifier, was used 
along with Ag/AgCl electrodes and EC-2 
electrode cream to record GSRs. 

The experimental stimuli consisted of 
35-mm, slides containing any single number 
from one to eight in written form. The stim- 
uli were presented by a Kodak Carousel pro- 
jector and the duration of stimulus and re- 
sponse intervals were determined at 10 sec. 
each by a Lafayette timer. Impulses from 
the timer were automatically recorded by the 
polygraph. 

Procedure—One-half of the Ss was re- 
quired to respond, in Stage I, by simply 
saying the odd numbers after they had been 
presented. They were instructed not to 
repeat the even numbers. The remaining 16 
Ss were required to respond to the even 
but not to the odd numbers, Again, Ss 
were instructed not to respond while the 
stimuli were being presented. Four odd and 
four even slides were presented. 

In Stage II, a different random order of 
the eight stimuli was used. One-half of each 
Stage I group (N=8) was instructed to 
respond to the class of stimuli (odd or even 
numbers) to which they had responded in 
Stage I, whereas the remaining Ss were in- 
structed to respond to the class of stimuli 
to which they had not been responding. 
These constituted the unswitched (U) and 
switched (S) groups, respectively, 


Results and Discussion 


The GSRs to the 16 experimental 
stimuli were measured in change in 
conductance units. Change scores 
were computed as the difference be- 
tween the conductance level imme- 
diately preceding the GSR and the 
peak amplitude value of the GSR. 
Only resistance changes 500 ohms or 
greater were considered and this same 
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criterion was used to separate multiple 
GSRs, i.e., if the first GSR was sepa- 
rated from a second response by an 
increase in resistance greater than 500 
ohms, the two responses were con- 
sidered as different and only the mag- 
nitude of the first response was com- 
puted. Only GSRs with an onset 
latency of 1-5 sec. were used. This 
last procedure, combined with the 
multiple-response cutoff criterion, pro- 
vided some assurance that only initial 
responses to the stimuli were investi- 
gated. These might reasonably be 
identified as ORs. 

The square root transformation was 
applied to the change in conductance 
scores, 

The results of this study are pre- 
sented in Fig. 1. It is apparent that 
the respond (R) stimuli, those which 
were followed by simple behavioral re- 
sponses, consistently produced greater 
GSRs than the nonrespond (NR) 
stimuli. In addition to demonstrating 
the R-NR and habituation or trials 
effects, the Stage II data indicated that 
the GSRs to R and NR stimuli across 
trials were, in part, determined by the 
Group conditions, i.e., whether S's were 
producing “new” or “old” responses. 

A Groups (S-US) xX Response 
Condition (R-NR) x Trials analysis 
of the Stage I data revealed significant 
R-NR, F (1, 30) = .451/.077 = 5.86, 
p<.05, and Trials, F (3, 90) = 
2.493/.063 = 39.57, p < .001, effects. 

The same analysis applied to the 
Stage II data yielded significant R- 
NR, F (1, 30) = .310/.050 = 6.20, 
p < .05, Trials, F (3, 90) = .353/.071 
= 4.97, p <.01, and Groups X Re- 
sponse Condition X Trials, F (3, 90) 
= .414/.080 = 5.18, p < .01, effects. 

To further test the hypothesis that 
the Stage II, Trial 1 R-GSRs of the 
S group constituted a significant in- 
crement over their terminal R-GSRs 
of Stage I, a ¢ test was applied to the 
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Fic. 1. Mean GSRs to respond (R) and 
nonrespond (NR) stimuli across habituation 
trials. (In Stage II, switched Ss performed 
responses different from those of Stage I, 
whereas unswitched Ss performed the same 
responses in both stages.) 


difference between these means, yield- 
ing a paired t (15) = .259/.094 = 2.76, 
$ < .01. The same test carried out on 
the US means resulted in a # (15) = 
044/.094 = 47. 

The latter finding suggests, in a gen- 
eral way, that selective habituation 
occurs to response characteristics. 


EXPERIMENT IT 


The second experiment investigated 
the relationship between degree of 
Stage II response change and the ex- 
tent of reemergence of the OR. It 
constituted, therefore, a further test of 
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selective habituation to response char- 
acteristics. 


Method 


Subjects—Thirty male and female under- 
graduates from an introductory psychology 
course volunteered to participate. 

Apparatus—The same apparatus was used 
to measure GSRs, to present experimental 
stimuli, and to time the stimulus and re- 
sponse intervals. 

An additional polygraph channel, equipped 
with the Grass 7P1 preamplifier, was used 
along with the SP1 accelerometer to measure 
the finger movement responses which were 
the instructed behavior in this study. 

Alternating light and dark intervals of 10 
sec. each constituted the stimulus and re- 
sponse intervals, respectively. 

Procedure —Īn Stage I, one-half of the Ss 
was instructed to respond to each light stim- 
ulus by making one up-down movement of 
the left index finger during the subsequent 
dark or response interval. The remaining 
15 Ss were instructed to respond by pro- 
ducing 10 such movements. The accelerom- 
eter, taped to the left index finger, recorded 
these finger movements. 

Subsequent to eight Stage I trials, Ss were 
instructed to respond with either 1, 5, or 10 
movements, The instructions differed only 
in terms of the number of requested re- 
sponses, One-third (M =5) of each Stage I 
group was instructed to make 1 response, 
one-third was told to produce 5 movements, 
and one-third was instructed to make 10 
responses. Thus, three Stage II conditions 
were established: 10 Ss continued to respond 
with the Stage I response (1 or 10 move- 
ments); 10 Ss were requested to make a 
mean response change of 4.5 movements 
(changes from 1 to 5 or 10 to 5 responses) ; 
and 10 Ss were instructed to make a re- 
sponse change of 9 movements (1 to 10 or 10 
to 1). In this way, Stage I response fre- 
quency was counterbalanced and a direct 
test of the effects of response change on 
the reemergence of the OR on two Stage II 
trials was afforded. 


Results and Discussion 


The results of this study are pre- 
sented in Fig. 2, It is apparent that 
Stage I response frequency did not 
affect GSR magnitude in any consistent 
fashion and that habituation took place 
across trials. A Groups X Trials 
analysis revealed only a significant 
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Trials or habituation effect, F (7, 196) 
= .349/.087 = 4.01, p < .001. 

The data for the two Stage II trials 
demonstrated graded reemergence of 
the OR on the basis of degree of Re- 
sponse Change. A Stage I Frequency 
x Response Change X Trials analysis 
of the Stage IT data yielded significant 
Response Change, F (2, 24) = .920/ 
.262 = 3.51, p < .05, and Trials, F (1, 
24) = .173/.054 = 3.20, p < .10, 
effects, 

These findings suggest that highly 
selective habituation occurs to response 
characteristics and that the central 
structures which are responsible for the 
production and habituation of the OR 
are sensitive to response as well as 
stimulus parameters, 


Experiment III 


If habituation may be regarded as 
the result of progressive encoding of 
both stimulus and response character- 
istics, then it would follow that habitua- 
tion can be affected by the amount of 
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Fic. 2, Mean GSRs across habituation 
trials of Ss performing either 1 or 10 finger 
movements in Stage I and requested to make 
a mean response change of 0, 4.5, or 9.0 
movements in Stage IT. 


JOSEPH GERMANA 


stimulus and response information that 
will be encoded. The last experiment 
attempted to manipulate the overall 
level of ORs across habituation trials 
by varying the amount of response in- 
formation to be encoded. 


Method 


Subjects—Forty male and female students 
volunteered. 

Procedure—The Ss were randomly as- 
signed to four groups of 10 Ss each: Group 
0 Ss were not requested to make any re- 
sponse to the light stimulus; Group 5 Ss 
were instructed to respond by saying any 
number from 1 to 5 (one-half was requested 
to say any number from 6 to 10), but 
were not to repeat the same number on two 
successive trials; Group 10 Ss were required 
to respond by saying any number from 1 to 
10, also with the provision that they were not 
to make the same response on any two suc- 
cessive trials. A last 10M group was added 
to investigate the effects of mixing response 
classes. These Ss were required to respond 
by saying any number from 1 to 5 (or 6 to 
10) or the letters A to E, with the same 
restriction on successive repetition, Again, 
all Ss were told not to make the instructed 
response during the 10-sec. stimulus light 
period but were to withhold their responses 
for the 10-sec. dark Tesponse interval. The 
GSRs of all Ss to the signal light stimuli 
were measured on 10 habituation trials. 


Results and Discussion 


Figure 3 shows that although habi- 
tuation took place for all groups, the 
overall level of GSRs across the habi- 
tuation series was a positive function 
of the amount of response information. 

A Groups X Trials analysis of vari- 
ance was applied to these data. Sig- 
nificant Groups, F (3, 36) = 1.777/ 
-579 = 3.07, p < .05, and Trials, F (9, 
324) = .895/.092 = 9.73, p < 001, ef- 
fects were obtained. 

A Duncan multiple-range test (Ed- 
wards, 1960) applied to the group 
means revealed that only the 10-10M 
difference was not significant beyond 
the .05 level. 

It should be noted that an interpre- 
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Fic. 3. Mean GSRs across habituation 
trials of Ss performing no responses (0), 
1 of 5 possible responses (5), 1 of 10 possible 
responses (10), or 1 of 10 possible mixed- 
class responses (10M), on each trial. 


tation other than that based on re- 
sponse encoding may be applied to 
these results. It may be suggested, 
e.g., that level of GSRs across habitua- 
tion trials primarily reflects the gener- 
ality of the preparation to respond 
behaviorally., The Ss in Group 0 were 
not required to make any preparation 
for overt behavior, Group 5 Ss, on the 
other hand, were required to prepare 
for one of five possible responses, while 
both the 10 and 10M Ss were required 
to make a preparation for 1 of 10 
possible responses. The hypothesis 
that magnitude of GSR in the present 
situation may have been affected by the 
degree of experienced “responsibility,” 
i.e., by the degree of generality of be- 
havioral preparations, receives support 
from the fact that Trial 1 GSRs show 
apparent group differences in Exp. Ii 
but not in Exp. II. 

Additional support for such an inter- 
pretation may be found in the ancillary 
observation that “response” require- 
ments typically resulted in postinstruc- 


tional shifts in base levels toward in- 
creased skin conductance whereas the 
“no response” condition did so to a 
lesser extent. It was primarily for this 
reason that the square root transforma- 
tion was performed, since this treat- 
ment operates better than any other 
nonempirical transformation to free 
GSRs from base-line influences. 


GENERAL DISCUSSION 


Whenever it can be demonstrated that 
a system is capable of detecting differ- 
ences between past and present events, 
then it would seem necessary to conclude: 
(a) that the system is receiving informa- 
tion concerning the present event; (b) 
that it already possesses information, in 
some form, of the past event; and (c) 
that it possesses some mode of com- 
parison. If it is necessary, then, to postu- 
late at least temporary storage of stimulus 
parameters in order to explain the demon- 
stration of habituation which is selective 
to those characteristics, then it is equally 
necessary, on the basis of the data re- 
ported here, to postulate the encoding of 
response characteristics. Tt appears, 
therefore, that the “neuronal model” of 
a stimulus must include the characteristics 
of associated responses. 

A peculiar feature of this last statement 
is the implication that progressive S-R 
learning (habituation) involving molar 
behavior may be shown to have occurred 
in the absence of any ‘modification of that 
behavior beyond its original production. 
The results of Exp. IT demonstrated, e.g.; 
that Ss progressively associated a particu- 
lar response to the experimental stimulus 
in Stage I. That such learning had, in 
fact, taken place was demonstrated by 
Stage I habituation and by the graded 
reemergence of the OR in Stage II when 
Ss were asked to produce a different be- 
havior to the stimulus. The behavioral 
responses of Stage I, however, were in- 
variant in their “correctness” from Trial 
1 to Trial 8, providing no indication of 
a trial-by-trial learning. 

Returning to the suggestion that many 
of the autonomic-somatic components of 
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the OR may, either directly or indirectly, 
represent the question “what’s to be 
done?” it may be stated that the central 
structures responsible for the production 
of the OR are as concerned with re- 
sponse characteristics as they are with 
stimulus parameters. This rephrasing of 
the “what is it?” question is reminiscent 
of Sperry’s (1952) treatment of percep- 
tion and his encouragement to view the 
brain as a mechanism for governing 
motor activity. 
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RETROACTIVE INHIBITION IN FREE-RECALL LEARNING 


WITH ALPHABETICAL CUES? 


BONNIE ZAVORTINK? ann GEOFFREY KEPPEL 
University of California, Berkeley 


It was hypothesized that the repetition of alphabetical coding cues 
between lists would increase retroactive inhibition (RI) in free- 
recall learning (FRL). The amount of alphabetical coding was 
varied by instructions to some Ss to use alphabetical cues in learning 
and instructions to others for free recall; overlap in cues was 
manipulated by the assignment of some Ss to learn materials with 
the same 12 Ist letters in the 2 lists (2 words/cue), and some to 
learn lists with 12 different Ist letters. List 1 was learned to a 
criterion of 16/24 correct responses and List 2 was presented for 
6 trials. Control Ss learned a single list and worked on arithmetic 
problems for the duration of List 2 learning. Following the inter- 
polated activity, Ss recalled, in any order, the words of the list(s) 
they had learned. The results supported the hypothesis, with a large 
increase in RI for Ss who were required to use the same alphabetical 


cues in both lists, 


A number of accounts of the occur- 
rence of retroactive inhibition (RI) in 
free-recall learning (FRL) have re- 
cently appeared in the literature (Asch 
& Ebenholtz, 1962; Postman & Keppel, 
1967 ; Shuell & Keppel, 1967; Tulving 
& Thornton, 1959). However, analyti- 
cal investigations of the mechanisms of 
RI in FRL of the sort available for 
paired-associate learning (McGovern, 
1964) and serial learning (Keppel, 
1966) have not as yet appeared. Post- 
man and Keppel (1967) suggested that 
the explanation of RI in the more 
traditional learning tasks by reference 
to contextual and interitem effects 
might be extended to FRL; the present 
study was an attempt to test experi- 
mentally the implication that the RI 
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_ 2 Now at Georgetown University, Wash- 
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in FRL might be due in part to over- 
lap in organizational cues for succes- 
sively learned lists. Specifically, it was 
suggested that if S used the same cues 
to learn the items of the two lists, the 
classical negative transfer paradigm of 
A-B, A-C would be formed so that 
unlearning might occur. One type of 
organizational cue which has received 
some attention in FRL is that of alpha- 
betization. Tulving (1962), e.g., found 
that the requirement to recall words in 
alphabetical order facilitated learning, 
even though alphabetization occurred 
spontaneously at a low rate among Ss. 
Similar results have been reported by 
Earhard (1967a, 1967b). In the pres- 
ent experiment the amount of alpha- 
betical organization was varied by in- 
structing some Ss to use alphabetical 
cues in learning, and instructing others 
for free recall. Manipulation of over- 
lap in cues was effected by the assign- 
ment of some Ss to learn materials with 
the same first letters in the two lists, 
and some to learn lists with different 
first letters. It was predicted that the 
repetition of letters would increase the 
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RI for the group instructed to use the 
alphabetical cues, but would not affect 
Ss working under free-recall instruc- 
tions. 


METHOD 


Design—The experiment included two 
control (C) and six experimental (E) condi- 
tions. The C groups learned one list of 
words with instructions for free (F) or 
alphabetical (A) recall, and then worked 
arithmetic problems. Two of the E groups 
learned List 1 under A conditions, and four 
learned it under F conditions. Of these, the 
two A groups then learned a second list 
under A instructions, while the F groups 
transferred to a new list with either F or A 
instructions. (Due to the possibility that Ss 
given A instructions for List 1 would con- 
tinue to alphabetize on List 2, the AF condi- 
tion was not included in the experiment.) 
Within each instruction sequence for the E 
conditions (AA, FF, and FA), one group 
had words of the same (S) first letters in 
the two lists, and one received different (D) 
first letters, The groups will be denoted by 
the appropriate letter combinations, e.g., 
EFAD for the experimental group with free- 
recall instructions on List 1, alphabetization 
instructions on List 2, and different letters 
in the two lists. 

Materials —Each list consisted of 24 words, 
with two words each of 12 different first let- 
ters. The 12-letter sets were chosen by 
omitting “x” and “z,” and then randomly 
splitting the remaining letters into two 
groups, with the restriction that three letters 
from each quarter of the alphabet appear in 
each set, There were two different lists for 
each letter set. 

The words making up the lists were all 
5-7 letter, dissyllabic nouns with 1-25 fre- 
quency in the G count of the Thorndike and 
Lorge (1944) list. In order to obtain four 
words beginning with “y,” two words with 
the same length and syllable restrictions were 
chosen from a dictionary. All four words of 
a given first letter had different second let- 
ters except those beginning with “q,” which 
differed in their third letters. Words were 
randomly assigned to particular lists, with 
the restrictions that the lists be equated for 
word length and frequency, Each list was 
used an equal number of times as List 1 in 
each group, and also was used equally often 
as List 2 for the S and D conditions, 

Six orders of items which did not con- 
tain sequence or position repetitions nor 
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repetitions of the same first letter within a 
sequence were used. Each order was se- 
lected about equally often as a starting point 
in each condition. 

Procedure—The method of learning for 
both lists consisted of alternating study 
trials, during which the words were shown 
on a memory drum, and test trials, during 
which S wrote his recall. Items were shown 
at a 1.5-sec. rate, with 60 sec. for recall 
and a 30-sec. pause before the start of the 
next study trial. 

For the F conditions, Ss were instructed 
to recall the words in any order. The Ss of 
the A conditions were told to organize their 
recall alphabetically by noting the first letters 
of the words on the study trials and going 
through the alphabet at recall. 

List 1 was learned to a criterion of 16/24 
Correct responses, and List 2 was presented 
for six trials. The C groups worked arith- 
metic problems for the duration of List 2 
learning (14 min.). Subsequent to List 2 
learning or the C task, Ss were instructed to 
recall, in any order, the words of the list(s) 
they had learned. The C groups were al- 
lowed 2.5 min. for this; the E groups, 5 min. 
This was followed by a cued-recall test in 
which S wrote the words opposite the alpha- 
betically ordered first letters of the items he 
had learned. Again 2.5 and 5 min, were al- 
lowed for the C and E groups, respectively. 

Subjects—There were 16 Ss in each condi- 
tion, 14 of whom were students fulfilling re- 
quirements for undergraduate psychology 
courses and 2 of whom were paid student 
volunteers. They were assigned to condi- 
tions in order of arrival at the experiment, 
according to a roster containing 16 blocks 
of independently randomized conditions. 
There were 3 Ss in the A conditions who 
did not recall the words in the A to Z order; 
each was replaced by the next S to appear. 
No S in the F groups adopted the strategy of 
alphabetical recall on his own accord, 


RESULTS 


List 1 learning—The mean trials to 
criterion for the A groups was 4.33 
and for the F groups, 5.25, a difference 
which is significant at p < .05, F (1, 
120) = 6.54. Variation among the A 
groups and among the F groups was 
not reliable. Contrary to the findings 
of Tulving (1962) and Earhard 
(1967a, 1967b), the overall facilitation 


V; 
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due to A instructions was slight. In 
fact, the F groups were actually supe- 
rior early in learning. An analysis of 
Trials 1 and 2, the only trials on which 
all Ss are represented, revealed a sig- 
nificant F-A difference, F (1, 120) = 
6.44, p < .05, with means for the two 
trials combined of 19.61 and 17.62 for 
the F and A groups, respectively, The 
interaction of Instructions X Trials was 
not reliable (F < 1). 

It was noted earlier that none of the 
Ss in the F groups and all of those who 
were included in the A groups recalled 
the words in a gross alphabetical order. 
For this experiment, however, the criti- 
cal question is the use of the first letters 
as a recall cue. Thus, it is conceivable 
that Ss in the F groups might occasion- 
ally use some of the first letters as an 
aid in recall. In order to evaluate this 
possibility, an analysis was performed 
on the repetitions, on the last recall 
trial, of words beginning with the same 
letter. For this test the expected num- 
ber of repetitions, E(R), was sub- 
tracted from the obtained number, 
O(R), according to the method out- 
lined by Bousfield and Puff (1964). 
The average difference was 6.31 for 
the A groups and .70 for the F groups, 
with F (1, 120) = 439.51. This clus- 
tering was reliably greater than zero 
even for the F groups, t (79) = 5.90. 

List 2 learning—The total number 
of correct responses for each group ap- 
pears in Table 1 along with the number 


TABLE 1 
List 2 LEARNING SCORES 


Total Cor- 
Trial 6 rect in 6 
Trial: 


Condition Trial 1 
s 


EFFD 8.19 17.38 82.50 
EFFS 8.19 17.56 81.80 
EFAD 6.25 19.69 87.06 
EFAS 6.12 19.81 86.38 
EAAD 6.81 19.25 84.88 


correct on the first and last trials. 
While the groups did not differ in total 
scores, there was a significant differ- 
ence in the slopes of the learning curves 
for the groups receiving F or A in- 
structions during second-list learning, 
F (5, 450) = 14.78. This interaction 
was produced by a superiority of the 
F groups on Trial 1 (8.18 vs. 6.31) 
changing to an inferiority on the later 
trials (e.g., 17.47 vs. 19.06 on Trial 6). 
Separate analyses on the first and last 
trials indicated that both differences 
were reliable, F (1, 90) = 16.75 and 
4.43, p < .01 and .05, for Trials 1 and 
6, respectively. Although the inter- 
actions did not approach significance, 
it may be noted that of the four groups 
learning List 2 under A instructions, 
Cond. EAAS is clearly lower than the 
other three conditions in the total score, 
suggesting the presence of negative 
transfer when there is an appreciable 
overlap in the coding cues employed in 
the learning of the two lists. 

Clustering scores for words of the 
same first letter were computed for 
each trial using O(R) minus E(R) 
differences. As an adjustment for dif- 
ferences in level of learning, these devi- 
ations were expressed as percentages 
of the maximum possible number minus 
the expected number of repetitions. 
For Cond. EFAD, EFAS, and EAAD, 
the mean percentages on Trials 1 and 
2 were all 100% with averages for the 
six trials of 99%, 98%, and 96%, re- 
spectively. For Cond. EAAS the Trial 
1 clustering percentage was only 48% ; 
however, on Trial 2 it had climbed to 
96% and was about equal to the other 
three groups for the remainder of 
learning (97% average for Trials 2- 
6). This initial deficit in clustering 
might be indicative of the differential 
negative transfer being experienced by 
Ss in this particular condition. 

List 1 recall—Table 2 contains a 
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TABLE 2 
Scorrnc oF List 1 RECALL 


Condition Total Recall | Category Recall | MWRPC* | Percentage of | Cued Recall 
Fi 16.50 11.00 1.49 13 16.81 
EEFD 9.56* 7.50* 1.26* 14 10.19* 
EFFS 8.25* 6.56* 1.22* 26 9.50* 
EFAD 14.81 10.00 1.47 17 15.69 
EFAS 13.88 9.75 1.40 15 13.38* 
CA 16.81 11.06 1.51 76 17.19 
EAAD 13.12* 9.31* 1.38 87 15.12 
EAAS 8.38* 6.56* 1.24* 53 9.75* 


* Mean word recall per category. 
b O(R) — E(R) 


Mi R) — E(R)” 
DEIN Ci 


summary of the various indices of List 
1 recall. The first column gives the 
total number of words recalled (TR) 
by each of the eight groups. The TR 
scores were separated into two com- 
ponents, the number of different first- 
letter categories represented in recall 
(CR) and the mean word recall per 
category (MWRPC). (A TR score 
is the product of the two component 
scores.) For each measure an analysis 
of variance was performed, followed by 
Dunnett’s tests for the C-E differences. 
The significant C-E differences (p< 
05) are noted in the table. Conditions 
EFFS, EFFD, and EAAS have simi- 
lar patterns of results, all showing sig- 
nificant differences from their appro- 
priate C groups in TR, CR, and 
MWRPC. Condition EAAD differed 
from its control only on the TR and 
CR measures, while Cond. EFAD and 
EFAS did not differ from Cond. CF 
on any measure, 

Among the E groups, it may be seen 
that the change of instructions, i.e., the 
introduction of alphabetization on List 
2 (the FA groups), resulted in a siz- 
able reduction in the interfering effects 
of interpolated learning. For all three 
measures this difference was significant 
(p < .01). Further analyses were per- 


formed on the four E groups receiving 
the same instructions on both lists (the 
FF and AA groups). An inspection 
of Table 2 indicates that a greater 
deficit in recall is associated with the 
duplication of first letters in the two 
lists. These differences were reliable 
for TR and CR measures (p < .01), 
but not for the MWRPC measure, F 
(1, 120) = 3.44, p >.05. While the 
main comparison of interest, the inter- 
action of Instructions X Cue Repetition 
(FF/AA vs. S/D), was actually in the 
predicted direction for all three mea- 
sures, in no case were these differences 
significant, F (1,120) = 3.41, 3.32, and 
1.17, for TR, CR, and MWRPC, re- 
spectively, p > .05. (As will be dis- 
cussed later, however, the magnitude 
of this interaction may have been re- 
duced if some of the Ss in Cond. EFFS 
occasionally used alphabetical cues in 
learning.) 

The percentage of clustering scores 
of words of the same first letter (CL) 
is presented in the fourth column of 
Table 2. As was the case at the end 
of List 1 learning, higher CL scores 
were obtained for those conditions in 
which the first list was acquired under 
A instructions. This was true both 
for the Ç and E groups considered 
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separately, F (1, 109%) = 22.51 and 
40.36. In other comparisons among 
the E groups, the only reliable differ- 
ence was the marked reduction in the 
CL score for Cond. EAAS relative to 
Cond. EAAD, F (1, 109) = 6.59, p < 
05. 

Following the completion of the final 
recall, Ss were given a cued-recall test. 
The results of this test can be found 
in the final column of Table 2. A com- 
parison of these scores with respective 
TR scores (Column 1) indicates that 
recall was facilitated on the cued test 
for all groups except Cond. EFAS. 
An analysis of the groups receiving 
homogeneous instructions on Lists 1 
and 2 showed that this general improve- 
ment was significant, F (1, 90) = 
29.01, and that the gain was reliably 
greater for the E groups than for the 
C groups, F (1, 90) = 6.18, p < .05. 
A similar improvement resulting from 
the cued recall of categorized words has 
been reported by Tulving and Pearl- 
stone (1966). As far as the pattern 
of results is concerned, the only changes 
from the TR of uncued recall are those 
of Cond. EAAD, which is no longer 
significantly different from Cond. CA, 
and of Cond. EFAS, which now shows 
a significant loss on this test. 

List 2 recall—Analyses on free and 
cued recall for List 2 both indicated 
that the only significant source of vari- 
ance was the List 2 instruction vari- 
able, with means of 18.50 for the com- 
bined F groups and 20.53 for the com- 
bined A groups in free recall, F (1, 
90) = 8.43, p<.01; and 18.09 vs. 
21.12 in cued recall, F (1, 90) = 16.33, 
P<.01. This F-A difference mirrors 
the terminal difference between the 
groups in List 2 learning. 

8 There were 11 Ss for whom the percent- 
age measure was undefined due to maximum 
and expected values of zero. These scores 
were estimated by the group means, with a 
corresponding reduction of 11 in df. 


Order of recall—tn order to deter- 
mine a dominance index in recall order 
for the two lists which is uncontami- 
nated by different levels of recall, a 
mean g score was computed for the 
rank in recall of each S’s List 1 items. 
With this measure, increasing positive 
numbers, i.e., later recall, represent de- 
creased List 1 dominance. In general, 
List 2 had priority in recall. This 
was shown by the significant deviation 
from zero of the mean overall z score 
for first-list responses (.62), F (1, 90) 
= 155.54. The lone significant source 
of variance in the analysis was that of 
List 1 instructions for the groups re- 
ceiving A instructions on List 2. Spe- 
cifically, the combined mean for the 
FA groups was .78 as opposed to the 
.52 of the AA groups, indicating higher 
List 2 dominance for the groups with 
changed instructions, F (1, 90) = 4.76, 
p<.05. 


Discussion 


The results of the present experiment 
offer strong support to an analysis of RI 
in FRL which is set within the same con- 
ceptual framework as the explanations of- 
fered for RI in paired-associate and serial 
learning. With a change in the paradigm 
of cue usage from A-B, D-C (Cond. 
EAAD) to A-B, A-C (Cond. EAAS), 
a large increase in RI occurred, as has 
been found with comparable manipula- 
tions in the classical learning tasks. In 
addition, List 2 performance, i.e., CL 
scores and rate of learning, reflects the 
increased interference to which Cond. 
EAAS was presumably subjected. Sim- 
ilar results have also been reported by 
Shuell (in press) and by Winograd 
(1968) in analogous variations with 
semantic categories. In these two cases, 
greater RI was produced when the words 
in the second list were drawn from the 
same conceptual categories as those pre- 
sented in the first list. 

The locus at which RI was found is 
also consistent with an analysis in terms 
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of paradigmatic relationships; i.e., Cond. 
EAAD has an A-B, D-C paradigm for 
words within categories, and showed no 
RI on the MWRPC measure, nor any re- 
duction in CL for List 1. At the same 
time, this group had an A-B, A-C par- 
adigm for specific letter categories, and 
did show RI on the CR score. More- 
over, RI was largely dissipated for this 
group when the categories were provided 
on the cued-recall test. In contrast, Cond. 
EAAS, with an A-B, A-C paradigm for 
words of a given category, showed RI 
on the MWRPC measure and also a re- 
duction, although not statistically signifi- 
icant, in CL. The large reduction in RI 
found for the FA groups relative to the 
FF groups also can be interpreted in 
terms of the essential nonoverlap of the 
two sets of stimulus cues used by the Ss 
in learning List 1 under F instructions 
and List 2 under A instructions (cf. 
Shuell & Keppel, 1967) ; alternatively, 
this difference may have been due to the 
change in the recall instructions per se 
(cf. Postman, Keppel, & Stark, 1965). 
One discrepant finding with regard to 
paradigmatic relationships is the RI for 
Cond. EAAS on the CR score, since the 
paradigm for categories is in effect A-B, 
A-B. This outcome is probably due to 
the fact that the loss of both items of a 
category also must be reflected in scoring 
as the loss of a category. Support that 
this loss is artifactual is the finding that 
RI was not eliminated on the cued-recall 
test for Cond. EAAS as it was for Cond. 
EAAD, 

Tn the preceding section, the suggestion 
was made that the expected increase in 
RI for Cond. EAAS, relative to the com- 
parable groups in the other two instruc- 
tion conditions (Cond. EFFS and EFAS), 
may have been underestimated by the oc- 
currence of some alphabetical coding by 
Ss in these latter groups. That is, if the 
Ss who gave evidence of coding in List 
1 also used alphabetical cues during the 
learning of List 2, one would predict 
greater RI for the alphabetical coders in 
the S groups than in the D groups. In 
order to test this Possibility, half of the 
Ss in each of the groups receiving F in- 
structions during List 1 learning were 
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designated alphabetical coders and the 
other half nonalphabetical coders. This 
classification was based on the percentage 
of CL scores obtained on the last trial of 
original learning. The results of this 
analysis appear in Table 3. In general, 
although alphabetical coders tended to 
recall more first-list words in the two 
D conditions (and Cond. CF), they also 
tended to recall fewer words under the 
two S conditions. This interaction is 
clearly seen in the RI scores for the two 
sets of E groups (FF and FA). This 
finding indicates, then, that alphabetical 
coding may have been responsible for the 
greater deficit in first-list recall observed 
for the S groups in the FF and FA con- 
ditions. Moreover, this finding provides 
additional evidence that overlap in or- 
ganizational cues contributes to the RI 
of “normal,” i.e., uninstructed, free-recall 
learning. 

One question arising from this study is 
why Cond. EAAS did not suffer more RI 
than Cond. EFFS, for it seems that the 
instructions to alphabetize should have 
assured more cue overlap than had oc- 
curred “naturally” in Cond. EFFS. One 
possibility has already been mentioned, 
namely, that alphabetical coders con- 
tributed to an increase in RI for Cond. 
EFFS. A second possibility is that items 
learned in clusters with clearly available 
cues (the AA groups) may actually be 
less susceptible to interference, Support 
for this notion appears in Table 3, where 
it can be seen that the coders of Cond. 
CF recalled two more words than did the 


TABLE 3 


List 1 RECALL OF ALPHABETICAL CODERS 
AND NONCODERS FOR Groups RECEIV- 
ING F Instructions DURING 
ORIGINAL LEARNING 


Coders Noncoders 

Condition 

x % RI x % RI 
CE 17.50 = 15.50 at 
EFFD | 11.12 36.4 8.00 48.4 
EFFS 7.00 60.0 9.50 38.7 
EFAD | 16.25 7.1 13.38 13:7 
EFAS 13.12 25.0 14.62 5.7 
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noncoders. This difference is not due 
to a difference in learning ability for these 
two groups, since the mean numbers of 
trials to reach criterion were identical for 
the two subgroups of Ss. 

The application of analyses which were 
developed to account for the RI produced 
with paired-associate and serial lists to 
RI phenomena in FRL does not imply 
that all of the component processes of the 
tasks are identical; certainly there are 
differences in the requirements of § in 
the tasks. This fact notwithstanding, it 
is of theoretical interest that what appears 
to be analogous experimental operations 
seems to produce comparable results. A 
comprehensive theory of free recall must 
be able to handle the occurrence of RI. 
The results of the present experiment 
offer evidence that very simple limited 
capacity notions of RI are not adequate. 
That is, RI cannot be due merely to the 
“bumping out” of one item by newly 
learned ones since the combined List 1 
plus List 2 recall score for the E groups 
was significantly higher than the List 1 
recall for the C groups, F (1, 120)= 
222.67. On the other hand, it is still pos- 
sible to account for these results in terms 
of a limited retrieval system, if it is as- 
sumed that a duplication of coding cues 
in the two lists (i.e. Cond. EAAS) es- 
sentially reduces their efficiency. Clearly, 
further experiments are necessary to 
decide between these alternative ex- 
planations. 

The final interpretive point to be con- 
sidered is why the overall amount of 
facilitation afforded by the use of alpha- 
betical cues in learning is relatively small 
in the present experiment and occurs for 
both lists only after retardation early in 
acquisition. Both Tulving (1962) and 
Earhard (1967b) found superiority of A 
over F instructions on the order of three 
or four items within the first five trials. 
The discrepancies between the sets of ex- 
periments may be explained, however, by 
the presence of several factors in this 
experiment which may have reduced the 
F-A difference. For instance, the To- 
ronto list contained what appeared to be 
extremely unfamiliar words, such as 
gorget, natron, and windrow. Although 


the words of the present experiment were 
of low frequency, very unusual words 
were not used, and there was some inci- 
dence of related words, e.g., fjord and 
isthmus, which often appeared together 
in recall protocols. It seems probable that 
alphabetization would be of most help to 
S when the list to be learned has few 
related words. If this is the case, a com- 
parison of F and A instructions with a 
list of very strongly associated items, 
such as a categorized list, should show 
no facilitation. In fact, A instructions 
might even produce interference—alpha- 
betical cues might reduce the effectiveness 
of strong category cues. Another factor 
contributing to the lack of a large F-A 
difference might have been the use of 
two words per cue. Earhard (1967a) 
has shown the speed of learning for lists 
with A instructions is inversely related 
to the number of words per cue, even 
when recall is cued by appropriate first 
letters. In the present study, S was addi- 
tionally confronted with the necessity to 
learn which particular 12 letters appeared 
in the list, a factor which might be ex- 
pected to reduce even more the effective- 
ness of alphabetical coding cues. 
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CHILDREN’S CHOICE BEHAVIOR AS A FUNCTION OF 
STIMULUS CHANGE, COMPLEXITY, RELATIVE 
NOVELTY, SURPRISE, AND UNCERTAINTY? 


F. MICHAEL RABINOWITZ ann CHARLOTTE V. ROBE 
University of Washington 


The effects of stimulus change, complexity, relative novelty, surprise, 
and uncertainty on choice behavior were of interest in the present 
experiment. Ss were 120 4th-grade children. Their task was to 
press 1 of 6 buttons on each of 149 trials. 5 of the buttons activated 
the same single light, while the 6th button activated another stimulus 
which was different for each of the 10 groups. Ss exhibited prefer- 
ence for relative novelty (on both position and color dimensions) and 
complexity, but not for stimulus change, surprise, or uncertainty. 
The failure to obtain significant effects of surprise and uncertainty 
was discussed. The data indicated that a variable labelled “outcome 
change” was important in controlling children’s stimulus selection 


behavior. 


In the present experiment, the effects 
of five of the “collative” variables— 
change, complexity, relative novelty, 
surprise, and uncertainty—which Ber- 
lyne (1960) suggested might control 
stimulus selection behavior were in- 
vestigated. This was accomplished by 
presenting children with a choice task. 
The apparatus used consisted of six 
buttons and six lights each arranged in 
a circle with the lights mounted on a 
separate panel above the buttons. 
Pressing any of the buttons on the re- 
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sponse panel activated one or more of 
the lights on the stimulus panel. The 
response panel was arranged so that 
five of the buttons always activated an 
identical single light (common stim- 
ulus) forall Ss. The sixth button (odd 
button) activated various patterns of 
light which differed in one or more of 
the collative variables of interest for 
each of the 10 groups. The S was 
allowed to press any one of the six but- 
tons on each of 149 trials. It was ex- 
pected that the number of times the odd 
button was pressed would be related to 
the collative variable manipulated. 

The relationship of the collative 
variables to odd button depressions 
was assessed via a series of compari- 
sons. Each of the variables was relevant 
to some, but not all, of the comparisons. 
For this reason, a discussion of all 
of the variables except relative 
novelty will be postponed until the 
comparisons relevant to each are re- 
ported in the results section. 

Berlyne suggested several definitions 
for the concept of novelty, e.g., in terms 
of: (a) how different a given stimulus 
is from the stimuli previously encoun- 
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tered by the individual (relative nov- 
elty dimension, investigated in the 
present experiment) ; (b) how recently 
a given stimulus has been encountered 
by the individual (short-term novelty 
dimension) ; and (c) how frequently 
a given stimulus has been encountered 
in the individual’s past (familiarity di- 
mension), Contemporary experi- 
mentation with children has concen- 
trated on the latter two of Berlyne’s 
definitions. Typically the child was 
given an initial familiarization period, 
and then experienced a test session 
in which the familiarization stimuli 
and new (short-term novel) stimuli 
were available. Children have shown 
a preference for the novel stimuli in the 
test phase by choosing them more often 
than the familiar stimuli (Harris, 
1965; Mendel, 1965) by responding to 
them faster than the familiar stimuli 
(Bogartz & Witte, 1966; G. N. Cantor 
& Cantor, 1964) and by looking at them 
longer than the familiar stimuli (ARER 
Cantor & Cantor, 1964, 1966). 


METHOD 


Subjects —The Ss were 60 male and 60 fe- 
male fourth-grade children, Their mean age 
was 10 yr., 0 mo., with a range 8 yr., 11 mo- 
11 yr., 2 mo. 

Apparatus—The S’s box was painted flat 
black and included a stimulus and a re- 
sponse panel, The response panel, mounted 
45° from the horizontal, contained six but- 
tons placed in a circular pattern (2-in. 
radius) on a 6X6 in. board, The positions 
of the buttons corresponded to the odd hours 
on a clock. A dim yellow light was located 
at the center of the circle of buttons. It 
served as the start position, The stimulus 
panel contained six 6-w. white bulbs, each 
covered by a green (except where noted 
below) reflector mounted on a second 6X6 
in. board. The latter board was mounted 2 
in. from the top of the response panel, per- 
pendicular to the horizontal, The reflectors 
were arranged in a circular pattern having 
a 2-in. radius, The positions of the reflectors 
also corresponded to the odd hours on a clock. 
A button and a light will be said to be con- 
gruent when they occupy the same hour on a 


clock, When any one button on the response 
panel was depressed, it activated one or more 
lights on the stimulus panel. 

The E’s box contained a control panel that 
allowed E to select the light or lights 
activated when S depressed any given button 
on the response panel, a Hunter timer which 
controlled the duration of the stimulus 
lights on each trial (2.5 sec.), an Industrial 
timer which controlled the interval from 
stimulus light offset to the starting signal 
(buzzer) onset (.5 sec.), a color-coded series 
of six lights, each activated by the depression 
of a different response, and a series of 
switches which controlled whether one or 
more stimulus lights flickered at a rate of 
2.5/sec. 

Procedure—Six boys and six girls were 
randomly assigned to each of 10 groups. For 
all of the groups, five of the buttons each 
activated the same single light stimulus 
(common stimulus, Light D) while the sixth 
(odd button) activated a different stimulus. 
The stimulus activated by the odd button 
was different in each group. The position of 
the odd button was counterbalanced, so that 
one boy and one girl in each group found 
the odd button in the 1 (3, 5, etc.) o'clock 
position. In the following description of the 
groups, the odd button, regardless of its 
position on the response panel, is referred 
to as Button A and its congruent light as 
Light A. The rest of the buttons (lights), 
in a clockwise sequence, are referred to as 
Buttons (Lights) B, C, D, E, and F. 

The lights activated by Button A for the 
10 groups were: 

Group 1. Light A (control condition). 

Group 2. Same as Group 1 except Light A 
also flickered (stimulus change). 

Group 3. Light A, which was covered by 
a green reflector for half of the Ss of each 
sex and an orange reflector for the remain- 
ing Ss. Buttons B, C, D, E, and F turned on 
Light D as usual, but the color of the re- 
flectors covering Lights B-F was orange 
when the reflector covering Light A was 
green and vice versa (relative novelty). 

Group 4. Same as Group 3 except Light 
A also flickered (relative novelty and stimu- 
lus change). 

Group 5. Lights E, F, A, B, and C 
(simultaneous complexity). 

Group 6. Light F or Light B, which 
alternated on successive depressions of But- 
ton A (complexity). 

Group 7. Light F or Light B, which were 
randomly activated on successive depressions 
of Button A with the restriction that each 
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light appeared twice in blocks of four suc- 
cessive depressions (complexity and uncer- 
tainty). 

Group 8. Light E, F, A, B, or C in that 
order on successive depressions of Button A 
(complexity). 

Group 9. Light E, F, A, B, or C ina 
random order on successive depressions of 
Button A, with the restriction that each light 
appeared twice in blocks of 10 successive 
depressions (complexity and uncertainty). 

Group 10. Light E, F, A, or B, in that 
order, on four successive depressions of 
Button A, followed randomly by either Light 
F, A, B, or D on the next depression of 
Button A. The sequence (beginning with 
Light E) repeated itself in successive five- 
trial blocks (surprise). 

Each child was brought into a semidark- 
ened room and seated before S’s box. In- 
structions read to S indicated E was inter- 
ested in whether boys or girls worked new 
machines better, and that each time the 
buzzer sounded he was to press any one of 
the buttons with the first finger of his pre- 
ferred hand. He was to look at the lights 
when they were activated, and between trials 
he was to place his finger on the start posi- 
tion. If S did not have his finger on the 
start position at the onset of the buzzer, 
Pressing the response buttons would neither 
activate any lights nor turn off the buzzer. 
In order to activate the lights and turn off 
the buzzer, S had to touch the starting posi- 
tion and then press a button. This feature 
was included to insure that S initiated his 
response at a point equidistant from all the 
buttons. 

Each S pressed the buttons 149 times 
(trials). The E recorded which button S 
pressed on each trial. If S pressed the 
buttons in a clockwise or counterclockwise 
direction on 6 successive trials during the 
first 10 trials, he was told: “You may keep 
Pressing the buttons around in a circle if 
you want to, but you don’t have to press 
the buttons around in a circle. You may 
Jump from button to button in any way you 
please.” The purpose of including these 
additional instructions was to reduce the 
amount of clockwise or counterclockwise re- 
sponding expected on the basis of earlier 
studies, 


REsULTS AND DISCUSSION 


_ Comparison 1 (relative novelty posi- 
tion).—Comparison 1 was conducted 
to determine if Ss would press the odd 


button more often than expected by 
chance under a condition which, a 
priori, seemed least conducive to such 
behavior. Therefore, the comparison 
involved only Group 1. For this 
group, the stimulus associated with the 
odd button differed from the common 
stimulus only in spatial location, and 


* was, therefore, the stimulus least dif- 


ferentiated from the common stimulus 
of all the groups. In terms of Ber- 
lyne’s collative variables, it can be 
argued that Ss should press the odd 
button more than chance only if it is 
assumed that a stimulus can acquire 
relative novelty in relation to an ident- 
ical stimulus simply by appearing in a 
different spatial location. 

In order to conduct a binomial anal- 
ysis, each S in Group 1 was assigned to 
one of two categories: (a) pressed the 
odd button more frequently than any 
one of the other five buttons, or (b) 
did not press the odd button more fre- 
quently. The total number of times 
Ss pressed each of the six buttons over 
the last 140 trials was used to deter- 
mine category assignment. (The first 
9 trials were considered practice.) 
The a priori probability (based on the 
assumption that Ss would press each 
button an equal number of times) of 
assignment to Category a was approxi- 
mately 4 and to Category b approxi- 
mately 3. Ties were assigned to Cate- 
gory b to make the null hypothesis 
more difficult to reject. Seven Ss were 
assigned to Category a. The probabil- 
ity that 7 or more Ss out of 12 would 
be assigned to this category was cal- 
culated (binomial analysis, p < .005). 

Thus, a tendency to press the odd 
button more often than chance was 
found when the stimulus associated 
with the odd button differed from the 
common stimulus only in spatial loca- 
tion. This suggests that it would be 
useful in a theory such as Berlyne’s 
to include position as a dimension on 
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which a stimulus can acquire relative 
novelty. It might be noted that the 
a priori assumption that Ss in Group 1 
would engage in less familiarity equali- 
zation than the rest of the Ss was cor- 
rect (see Table 1). 

Comparison 2 (stimulus change and 
relative novelty color).—The collative 
variables of stimulus change and rela- 
tive novelty were of interest in Com- 
parison 2. Berlyne defined stimulus 
change as the extent and rate of any 
perceived change or movement of a 
stimulus. Stimulus change was studied 
by comparing the odd button choice 
behavior of Groups 1 and 3 vs. Groups 
2 and 4. The odd button activated a 
single stationary light in Groups 1 and 
3, while it activated a flickering light 
in Groups 2 and 4. Relative novelty 
was manipulated by varying color con- 
trast. For Groups 1 and 2 the single 
light stimulus associated with the odd 
button was the same color as the com- 
mon stimulus, while for Groups 3 and 
4 it was a different color, It was pre- 
dicted that Ss would prefer the greater 
amount of relative novelty and stim- 
ulus change. 

In order to test the prediction, an 
analysis of variance was performed on 


TABLE 1 


THE MEAN AND STANDARD DEVIATION OF THE 
NuMBER or Opp Button DEPRESSIONS 
OVER THE Last 140 TRIALS For 


Eacu Group 
Group M SD 
1 (Control) 26.2 
2 (Stimulus Change) 32.08 | 15.94 
3 (Relative Novelty) 36.37 | 10.51 


4 (Relative Novelty and 
Stimulus Change) 37.75 | 16.38 
5 (Simultaneous Complexity)| 34.58 13.24 


6 (Complexity) 39.08 
(Complexity and Uncer- lesa 
ainty, 41.66 | 18.39 
8 (Complexity) 46.16 | 21. 
9 oe and Uncer- | 
tainty) 41.25 | 11.42 
10 (Surprise) 48.75 | 19.88 


the number of times each S pressed the 
odd button in each of seven successive 
20-trial blocks (TB) over the last 140 
trials. (This dependent variable was 
used in all of the remaining compari- 
sons.) Thus, TB 1-7 consisted of 
Trials 10-29, 30-49, 50-69, 70-89, 
90-109, 110-129, and 130-149, re- 
spectively. The between-Ss factors in 
the analysis of variance were stim- 
ulus change, relative novelty, and sex, 
while the within-Ss variable was TB. 
A significant Sex X TB interaction 
effect was obtained, F (6, 240) = 
2.19, p < .05, reflecting the fact that 
there was a greater increase in the use 
of the odd button across TB by males 
than by females. The main effect of 
relative novelty was nearly significant, 
F (1, 40) = 4.06, p~.05. As pre- 
dicted, the groups (3 and 4) that could 
activate different color lights pressed 
the odd button more often than the 
groups (1 and 2) that activated lights 
of one color (see Table 1). No other 
contrast effect approached significance, 
p > .15 in all cases. 

Comparison 3 (stimulus complex- 
ity) —Berlyne (1960) defined the col- 
lative variable of complexity as the 
degree of diversity in the stimulus pat- 
tern. Such diversity was assumed to 
be directly related to the number of 
elements in the pattern and their de- 
gree of dissimilarity, and inversely re- 
lated to the degree to which the pattern 
was responded to as a unit. The 
comparisons of Groups 1, 5, and 8 
allowed analysis of odd button choice 
when: (a) a single predictable stimulus 
composed of one element was available, 
(b) a single predictable stimulus com- 
posed of five elements was available, 
and (c) five predictable stimuli each 
composed of one element were avail- 
able. 

According to Berlyne’s definition the 
stimuli of Groups 5 and 8 are clearly 
more complex than that of Group 1 
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since they contain more elements, The 
difference between Groups 5 and 8 
is more difficult to delineate, but the 
five stimulus elements of the latter 
might be considered temporally more 
dissimilar or less proximal than those 
of the former. Thus, the increasing 
complexity across Groups 1, 5, and 
8 was expected to result in increasing 
choice of the odd button. 

The predictions derived from Ber- 
lyne’s theory were tested using an 
analysis of variance in which groups 
(1 vs. 5 vs. 8) and sex were the be- 
tween-S's variables, while TB was the 
within-Ss variable. The analysis 
yielded significant main effects of TB, 
F (6, 180) =2.67, p< .025, and 
groups, F (2, 30) =5.31, p < .025. 
No other contrast effect approached 
significance, p > .15 in all cases. The 
TB effect resulted from increasing 
choice of the odd button across train- 
ing. The significant groups effect was 
followed up in a manner suggested by 
Scheffé (1959). Group 8 chose the 
odd button significantly more than did 
Group 1, S? (2, 30) = 10.29, p < .025. 
The differences between Groups 1 and 
5 and Groups 5 and 8 were not reli- 
able. However, Groups 1 and 5 com- 
bined did differ reliably from Group 8, 
S? = 8.58, p < .025, and Group 1 dif- 
fered reliably from Groups 5 and 8 
combined, $? = 6.87, p< .05. Al- 
though reliable differences in the per- 
formances of Groups 1 and 5 and 
Groups 5 and 8 were not obtained, the 
agreement between prediction and data 
was good. The relative number of 
choices of the odd button across TB 
was ordered correctly by the prediction 
(see Table 1). 

The results obtained also are con- 
sistent with the results of earlier studies 
involving children in which complexity 
was manipulated by varying the num- 
ber of elements (May, 1963; Thomas, 
1966) or the number of elements in 
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conjunction with the temporal proxim- 
ity of the elements (Duke, 1965). 

Comparison 4 (stimulus complexity 
and stimulus uncertainty) —Berlyne 
defined stimulus uncertainty in terms 
of information theory. He assumed 
that uncertainty is related to the range 
of values a stimulus may take, and the 
extent to which these values are equally 
likely. In Comparison 4, uncertainty 
was manipulated by varying both the 
number of different lights that could 
be activated (one at a time) by the odd 
button and the predictability of activat- 
ing any particular light. Since the 
lights that could be activated by the 
odd button were completely predictable 
in Groups 6 and 8, the uncertainty was 
zero for these groups. The lights that 
could be activated by the odd button 
were not completely predictable in 
Groups 7 and 9; thus, the uncertainty 
was greater than zero for these groups. 
Furthermore, since the number of out- 
comes was greater in Group 9 than in 
Group 7 and each of the possible out- 
comes was equally likely to occur 
within the groups, uncertainty was 
greater in Group 9 than Group 7. In 
order to test predictions derived from 
Berlyne’s theory, an analysis of vari- 
ance was conducted on the odd button 
data of Ss in Groups 6-9. The be- 
tween-Ss variables were complexity 
(two vs. five temporally proximal 
lights; Groups 6 and 7 vs. Groups 8 
and 9), uncertainty (Groups 6 and 8 
vs. Groups 7 and 9), and sex, The 
within-Ss variable was TB.  Signif- 
icant main effects of complexity and 
uncertainty were predicted. It was 
expected that S's would prefer the more 
complex and more uncertain stimuli. 
A significant Complexity X Uncer- 
tainty interaction was also predicted on 
the basis of the greater uncertainty in 
Group 9 than in Group 7. 

The analysis of variance yielded only 
a significant TB effect, F (6, 240) = 
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6.32, p < .001, reflecting the increase 
in odd button choice across TB. No 
other contrast effect approached sig- 
nificance, p > .20 in all cases, indicat- 
ing a failure to support the predictions 
derived from Berlyne’s theory. 

The failure to find a significant com- 
plexity effect in Comparison 4 sug- 
gests the interpretation offered for 
part of the results obtained in Com- 
parison 3 may be incorrect. In Com- 
parison 3, Group 8 pressed the odd 
button significantly more often than 
Groups 1 and 5 (combined). In the 
discussion following Comparison 3, it 
was suggested that this finding resulted 
from the increasing effectiveness of 
stimulus complexity as a function of 
both increasing number of elements and 
decreasing temporal proximity of ele- 
ments. An alternative explanation of 
this result, based on and consistent 
with the results in Comparison 4, is 
that the important variable is one vs. 
more than one outcome associated with 
the odd button. Group 8 had five out- 
comes associated with the odd button, 
while Groups 1 and 5 had only one. 
All the groups in Comparison 4 had 
more than one outcome associated with 
the odd button. 

Berlyne suggests that the collative 
variables affect behavior through a con- 
flict mechanism. Briefly, when an S 
looks at a stimulus containing some 
collative Property, a state of conflict is 
induced. Associated with the conflict 
is an increment in drive level. If S 
remains in contact with the stimulus, 
the conflict is reduced which, in turn, 
reduces the drive level and is rein- 
forcing. The collative variable of un- 
certainty seems most closely related to 
the conflict formulation, and for this 
reason the failure to obtain a significant 
effect of uncertainty seems theoretically 
important. 

It is possible that significant findings 
involving stimulus uncertainty and/or 


complexity were not obtained in Com- 
parison 4 because: (a) Ss treated the 
stimuli associated with the odd button 
as a class of events rather than as two 
or five unique outcomes; (b) a suf- 
ficient range in stimulus uncertainty 
was not used; (c) forgetting occurred 
between successive depressions of the 
odd button which tended to equalize 
the complexity and uncertainty be- 
tween the groups; and (d) one vs. 
more than one outcome associated with 
a response is the important variable. 
No evidence regarding Alternatives a 
or b is presently available. However, 
Alternative a is worthy of further 
comment. Berlyne emphasized that 
it was S’s uncertainty and not O’s 
that was important. Although it does 
not seem likely, it is possible that Ss 
treated the series of outcomes asso- 
ciated with the odd button as a class 
of events and that no subjective uncer- 
tainty was involved. If this was the 
case, a more appropriate test of the 
effect of stimulus uncertainty would 
involve a dimension scaled by Ss. 

Alternative c (forgetting) seems the 
least plausible of the above explana- 
tions.. Inspection of the data reveals 
the median and mode of the maximum 
number of times an S pressed the odd 
button in a block of 20 trials (same 
TB as used in the analysis of variance) 
in Groups 6-9 was 9, while the mean 
was 8.74. Thus, at some point in 
training 50% of the Ss pressed the odd 
button about half the time over a se- 
quence of 20 trials. This frequency 
of odd button use would seem to pre- 
clude forgetting. 

Using third- and fourth-grade child- 
ren as Ss, Duke (1965) investigated 
the effects of stimulus complexity and 
uncertainty in a multiple-response sit- 
uation. Complexity and uncertainty 
were manipulated in a manner similar 
to that used in Comparison 4. Two 
different response speed measures were 
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the dependent variables. As in Com- 
parison 4, significant effects of com- 
plexity and uncertainty were not ob- 
tained. Since Duke designed her task 
to minimize forgetting, additional evi- 
dence inconsistent with Alternative c 
is available. On the other hand, 
Duke’s data are consistent with the one 
vs. more than one outcome, Alterna- 
tive d. Her control group (each re- 
sponse produced only one outcome) 
performed differently from her other 
groups. 

A similar conclusion can be drawn 
from Duke’s findings and the results of 
Comparisons 3 and 4. Children be- 
tween 8 and 10 yr. of age respond dif- 
ferently in a multiple-response situation 
in which all but one of the responses 
are associated with one stimulus 
outcome and the remaining re- 
sponse is associated with more than 
one outcome from the way they re- 
spond in a multiple-response situation 
in which every response is associated 
with one stimulus outcome. The dif- 
ference is manifested in choice behavior 
and speed of responding, and is inde- 
pendent of the number or uncertainty 
of the stimulus outcomes associated 
with the multiple-outcome response 
within the range studied. 

Whether the above conclusion gen- 
eralizes to Ss of other ages, single re- 
sponse situations and greater amounts 
of complexity and uncertainty (Duke 
used the same range of values as did 
the present investigators) are empirical 
questions. At any rate, the possible 
Importance of more than one vs. one 
outcome on human stimulus selection 
behavior will be emphasized by sug- 
gesting that “outcome change” (to dis- 
tinguish it from stimulus change) be 
added to Berlyne’s list of potentially 
important collative variables. 

Comparison 5 (surprise).—Berlyne 
defined the collative variable of sur- 
Prise as a stimulus which does not con- 
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form to the established expectations of 
O. For Ss in Group 10, successive de- 
pressions of the odd button activated 
four lights in a clockwise order and a 
fifth light in an “unexpected” random 
location (surprise condition). The 
Ss in Group 8, for whom successive 
depression of the odd button activated 
five lights in a clockwise order, served 
as the control for outcome change. It 
was predicted that Group 10 would 
use the odd button more frequently 
than Group 8. To test this prediction, 
an analysis of variance was conducted 
in which groups (8 vs. 10) and sex 
were the between-Ss variables and TB 
was the within-Ss variable. 

The analysis yielded only a signif- 
icant effect of TB, F (6, 120) = 3.90, 
p<.005, reflecting an increasing 
choice of the odd button across the 
first four trial blocks and a decreasing 
choice thereafter. No other contrast 
effect approached significance, p > .20 
in all cases. 

Although a significant effect of sur- 
prise as compared to outcome change in 
a multiple-choice situation has not been 
obtained elsewhere, effects of surprise 
on human stimulus selection behavior 
have been obtained under other cir- 
cumstances (Charlesworth, 1964; 
Charlesworth & Zahn, 1966; Grim & 
White, 1965). Therefore, the failure 
to obtain a significant effect of sur- 
prise in Comparison 5 was probably a 
result of the manner in which this vari- 
able was manipulated. The Ss in 
Group 10 might have learned to ex- 
pect a “surprising” outcome after they 
pressed the odd button a sufficient 
number of times. In other words, as 
training progressed, the surprise ma- 
nipulation probably seemed an “un- 
certainty” manipulation to Ss. The 
fact that Ss in Group 10 pressed the 
odd button more often than Ss in any 
of the other nine groups (see Table 1) 
might reflect the effect of the “surprise” 
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manipulation early in training and the 
effect of outcome change as training 
progressed. 

Concluding remarks and additional 
analyses.—The result obtained in Com- 
parison 1 suggests that stimuli can be 
novel on a position dimension. In 
Comparison 2, relative novelty (same 
vs. different color lights) had a greater 
effect on odd button behavior than did 
stimulus change. In Comparison 3, 
odd button choice was positively cor- 
related with increasing complexity. 
The results obtained in these three 
comparisons also can be viewed as re- 
flecting only the effect of relative 
novelty. From this point of view, 
Comparison 1 reflects the effect of 
“position” relative novelty, and the 
comparison of Group 1 with Groups 
2-5 would assess the effect of “Dosi- 
tion” vs. “position plus stimulus di- 
mension” relative novelty. Group 1 
was compared with Groups 2-5, using 
the number of times each S$ pressed 
the odd button over the last 140 trials 
as the dependent variable, The result- 
ing statistic was significant, t (58) = 
2.03, p < .05, two-tailed test. Table 1 
shows that the addition of “stimulus 
dimension” relative novelty increased 
the use of the odd button. 

The results of Comparisons 3, 4, and 
5 appear to reflect the potency of the 
outcome change variable. In order to 
assess the effect of this variable more 
directly, the outcome change groups 
( 6-10) were compared with the “Dosi- 
tion and stimulus dimension” relative 
novelty groups (2-5) on the number 
of odd button choices over the last 140 
trials. The resulting statistic was sig- 
nificant, £ (106) = 2.67, $ < 01, two- 
tailed test, reaffirming the effectiveness 
of the outcome change: variable (see 
Table 1). 

Finally, it should be mentioned that 
the main effect of TB was significant 


F. MICHAEL RABINOWITZ AND CHARLOTTE V. ROBE 


in Comparisons 3, 4, and 5, but not 
in Comparison 2. Inspection of the 
data reveals that the outcome change 
groups increased in their use of the 
odd button across TB more than did 
the non-outcome change groups. In 
order to assess the statistical reliability 
of this trend, an analysis of variance 
was conducted in which outcome vs. 
non-outcome change (Groups 6-10 vs. 
Groups 1-5) and sex were the between- 
Ss factors, while TB was the within- 
Ss factor. The Outcome vs. Non-out- 
come Change X TB interaction was 
significant, F (6, 696) = 3.49, p< 
.005, as were the main effects of out- 
come vs. non-outcome change, F (1, 
116) = 12.52, p < .001, and TB, F (6, 
696) = 7.95, p < .001. No other con- 
trast effect approached significance, p 
> .20 in all cases. Thus, the stimulus 
condition that was more reinforcing in 
terms of the absolute choice of the odd 
button also led to a greater increase in 
choice of the odd button across training. 
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University of Alberta 
3 experiments investigated several aspects of the relationship between 


performance in paired-associate (PA) learning and formal intralist 
stimulus similarity. Similarity was defined in terms of number and 


position of identical letters amon 
prising the list. In Exp. I, simi 
a single letter-position identity “ 


moderate similarity, 


g all possible pairs of stimuli com- 
larity was varied in lists using only 
rule.” 
but varied in terms of number of rules and 
variability of the similarity among stimuli. 


In Exp. II, lists were of 


In Exp. III, the sets 


contained 4 rules, but varied in similarity. Similarity produced some 


interference in homogeneous lists, 
was greater when the stimuli had 
the variability was high. The re: 
coding by pronunciation vs. stimul 


When similarity among a set of ver- 
bal units used as stimuli in a paired- 
associate (PA) learning task is defined 
in terms of shared letters, the results 
are remarkably consistent. Most stud- 
ies (eg., Runquist, 1968; Under- 
wood, 1953) show poorer performance 
as similarity increases. 

The experiments reported in this 
paper investigate several aspects of 
this relationship. It should be clear 
that similarity is a variable defined on 
a set of stimuli. One way of concep- 
tualizing the similarity of a set would 
be to consider the pairwise similarities, 
ie, the similarity between every item 
in the set and every other item in the 
set, Ratings of similarity of pairs of 
stimuli obtained from published norms 
(Runquist & Joinson, 1968) may 
be used to compute the mean similarity 
among all of the items in the list. 
There are, however, other parameters 
of lists so defined. One such character- 


1 Supported by Grant AP-B-88 from the 
National Research Council of Canada. Lil- 
lian Bowes supervised the collection of the 
data which were collected by Allan Jamison, 
Frank Phillette, Dennis Bobiy, and Brian 
Johnson. J 


but the amount of interference 


multiple rules and was less when 
sults were discussed in terms of 
lus selection. 


istic of the list is its variability. In 
some lists, all items are virtually equally 
similar to one another, while in other 
lists some items may be very similar, 
while others are dissimilar, The latter 
would be considered to have high vari- 
ability. In addition, there is a property 
which will be called heterogencity. The 
relationship between any two items in 
the list may be defined in terms of 
number and position of shared letters. 
With CVCs there are 11 possible re- 
lationships or “rules” including the con- 
dition where no letters are shared. 
Thus, lists may vary in the number of 
different rules involved in all com- 
parisons of items. Although these 
three factors are not entirely inde- 
pendent, they can be manipulated sepa- 
rately. 

Although these variables are of in- 
terest from a purely empirical point of 
view, recent discussions concerning the 
coding of both stimulus and response 
terms in PA learning are also of some 
televance. Underwood (1963) has 
pointed out that S may encode the 
stimulus term in such a way that only 
a single component of an analyzable 
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unit, such as the first letter of a tri- 
gram, serves as a discriminative cue 
for recalling a particular response term. 
Thus, the amount of interference pro- 
duced when similarity is high should 
depend on the opportunity for coding 
by stimulus selection, Heterogeneity 
should have its effects through this 
process. Recent evidence concerning 
organizational processes in PA learn- 
ing (Battig, 1966; Runquist, 1966) 
suggests that opportunity to group 
items, as in the case of high-variability 
lists, will attenuate interference. 

Thus, the lists used in these experi- 
ments varied in their structural char- 
acteristics as well as in overall stimulus 
similarity. The study consisted of 
three subexperiments involving differ- 
ent factors. In Exp. I, similarity was 
varied in homogeneous sets of stimuli 
involving only a single rule. In Exp. 
II, sets of moderate similarity but 
varying in heterogeneity (number of 
tules) and variability were used. In 
Exp. III, the sets contained four rules, 
but varied in similarity. 


EXPERIMENT I 


In this experiment, the rule which 
determined sharing of letters was iden- 
tical for all items in a set. This system 
should enhance coding by stimulus se- 
lection and thus reduce or eliminate 
the effects of similarity. 


Method 


Apparatus.—The PA task was identical in 
all experiments and required S to press one 
of six buttons arranged in a linear array 
when one of six trigrams was presented. 
Data were simultaneously collected from two 
Ss who sat in adjacent booths. Neither S 
could see the other. Stimuli appeared on a 
3X5 in. screen located in the wall a few 
inches above a sloping wooden panel contain- 
ing six momentary contact button switches 
approximately 2 in. apart. Directly above 
each switch was a small lamp to provide 
feedback on S’s responses. Details of the 


apparatus are given in a previous publica- 
tion (Runquist, 1967). 

Procedure—Learning was by the anticipa- 
tion method. Each stimulus was exposed 
for 3 sec. After 1.5 sec. had elapsed, the 
lamp above the correct button was lighted for 
1 sec., and there was a .5 interitem interval. 
Only responses occurring within the initial 
1.5-sec. period were recorded. Each of six 
stimuli was paired with one of the six lamps. 
The sequence of stimuli were arranged in 
trials consisting of a single presentation of 
each stimulus, but no intertrial interval or 
other indication was given to S that a new 
trial had begun. Five separate orders of pre- 
sentation were used. All Ss were given 10 
trials on a practice set of six nonsense shapes 
prior to 20 trials on the test set. 

Materials—The stimuli for all’ three ex- 
periments were CVC trigrams selected from 
Archer’s (1960) norms, and were of less 
than 25% association value. The sets varied 
in mean similarity as determined from an 
earlier calibration (Runquist & Joinson, 
1968), but in this experiment all items in a 
given set were related by a single rule. 
These rules were no repeated letters (List 
1), all first letters identical (List 2), all 
third letters identical (List 3), all vowels 
identical (List 4), all first letters and vowels 
identical (List 5), all vowels and third let- 
ters identical (List 6), and all first and third 
letters identical (List 7). The mean simi- 
larity values for these sets computed from 
the norms were 5.0, 31.9, 25.8, 26.5, 66.3, 68.4, 
and 68.8. The variance of each list was zero, 
Two separate sets of six trigrams were made 
for each rule. The correct button for each 
stimulus was determined haphazardly with 
the restriction that the stimuli not be as- 
signed to buttons in alphabetical order. 
Three separate assignments of stimuli and 
buttons were used with each list. 

Subjects—The Ss were 189 introductory 
psychology students serving as a course re- 
quirement. Some Ss had previously served 
in verbal PA experiments although not 
with these materials. Most Ss were run in 
pairs. The experiment was run as two rep- 
lications using the alternate sets of stimuli. 
Order of assignment to list condition (1-7) 
was random in order of appearance, but the 
two replications were run consecutively ; 
within each replication the different S-R 
pairings were also run consecutively, Little 
bias should result from these procedures as 
the first replication was run in 1 calendar 
week, and the second replication in 2 wk. 
Twelve Ss per list were run in the first 
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replication and 15 Ss per list in the second 
replication. 


Results 


Since all Ss learned the practice task 
under identical conditions, performance 
on this task may be used to determine 
equality of the various groups. There 
were no differences among the groups 
within either replication or overall, F 
< 1 for Replication 1 and F = 1.13 for 
Replication 2, The correlations be- 
tween practice task and test task per- 
formance in the various groups ranged 
.13-.72 with a median of .48. The re- 
sults fromi the two replications are thus 
combined in all analyses. The per- 
centage of correct responses for each 
block of four trials is shown in Fig. 1. 
Descriptively, the functions fall into 
three categories: (a) the group in 
which sets contained no duplicated let- 
ters had the best acquisition perform- 
ance; (b) lists with one shared letter 
produced somewhat poorer perform- 
ance; and (c) generally, stimuli shar- 
ing two letters produced the poorest 
performance. The exception is List 5 
where the stimuli share first letter and 
vowel. This group performed as well 
as the groups where only one letter 
was shared. 

Statistical analyses on total cor- 
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rect responses by covariance analysis 
yielded an overall F (6, 175) = 4.19, 
p < 001. Comparisons between indi- 
vidual groups and combinations of 
groups using Dunnett’s and Scheffé’s 
tests (Edwards, 1960) generally sup- 
ported the picture shown in Fig. 1. 
Performance on Lists 2 and 3 just 
failed to show significant depression, 
t= 1.83 and 1.74, respectively, while 
the performance on all other lists dif- 
fered from that of List 1. Further- 
more, performance on Lists 6 and 7 
together was inferior to that on Lists 
2 and 3 together, but the difference be- 
tween these groups and the combined 
performance on Lists 2, 3, 4, and 5 
barely failed to reach significance. Dif- 
ferences among groups in overt errors 
were small and not significant. 


EXPERIMENT II 


In this experiment sets of stimuli 
of moderate similarity but varying in 
variability (variance of individual item 
similarities) and heterogeneity (num- 
ber of different letter-position identity 
rules) were used. Theoretically, one 
might expect increasing variability to 
reduce the effects of similarity, par- 
ticularly when there are few rules, since 
it would give Ss the opportunity to 
organize the stimuli into groups (Bat- 
tig, 1966). On the other hand, in- 
creasing the heterogeneity of the set 
should make the list more difficult, as 
it would require the acquisition of a 
more complicated set of rules for dis- 
criminating among the stimuli. 


Method 


The apparatus, task, and procedure were 
identical to that in Exp, I. Four groups 
were run, each group receiving 10 trials on 
the practice list and 20 trials on one of four 
test lists. The stimulus sets for these lists 
were arranged to combine two values of vari- 
ance of rated similarities (high and low 
Variance) with two degrees of heterogeneity 
(2 vs. 5-6 rules). These sets are desig- 
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TABLE 1 
EXAMPLE STIMULUS SETS For Exp. II 


LV-HO HV-HO LV-HE HV-HE 
XUL XOL SUJ HUJ 
XIL XEL SUV HU! 
QET XIL XIJ U. 

OT GAC XYP EL 
ZAV WIH XEL XIS 
ZYV PYD LYG GAC 

Wote. —M for LV-HO and HV-HO = 14,8; LV-HE 


= 15.2; HV-HE = 15.4. Variance for LV-HO = 0; 
HV-HO = 63.8; LV-HE = 11.7; HV-HE = 49.7. 


nated LV-HO (low variance-homogeneous), 
LV-HE (low variance-heterogeneous), HV- 
HO (high variance-homogeneous), and 
HV-HE (high variance-heterogeneous). In 
computing the variance of a list, the rated 
similarity of an item to each other item in the 
list was obtained from the norms, and the 
sum of these ratings were computed result- 
ing in a similarity score for that item. The 
variance was then computed among the sums 
for the six items. It was not possible to 
equate the two low-variance sets or high- 
variance set exactly, but the values were 
approximated. The rule structure of these 
sets is somewhat complex; therefore, exam- 
ples are provided in Table 1 along with the 
mean rated similarity and variance of each 
list. Since the sets contain six items, rule 
structure may be described in terms of the 
relationships between the 15 pairwise com- 
binations of items. For the LV-HO and 
HYV-HO sets, 12 comparisons contain no 
common letters, and 3 comparisons have the 
first and third letters identical, with the dif- 
ference in variability being produced by the 
different arrangement of letter identities. 
The LV-HE set contained 8 comparisons 
with no letters in common, 3 comparisons 
with identical first letter, 1 comparison with 
vowel identical, 1 with last letter identical, 1 
with first two letters identical, and 1 with 
first letter in third position. The HV-HE 
set had only five rules with identical first 
letter, one having first two letters identical, 
one having first letter in third position with 
the vowel identical, and one having first and 
third letters identical but reversed. The 
Study was run in two replications using dif- 
ferent items in each replication. 

The Ss were 108 introductory psychology 
students who served as part of a course re- 
quirement. They were assigned to condi- 
tions according to a prearranged nonsys- 
tematic order as they arrived at the labora- 


tory. Replications and three different S-R 
pairings were run consecutively. There 
were 12 Ss in each group of the first repli- 
cation and 15 in each group in the second 
replication. 


Results 


The design was treated as a 2X 2 
factorial design, with comparisons also 
made with the performance of the low- 
similarity list (List 1) from Exp, I. 
The two replications were combined 
for all analyses. 

There were no differences in prac- 
tice list performance, as the Fs for 
variability, heterogeneity, and the inter- 
action were all less than unity. Analy- 
sis of covariance was used to assess 
the data on the test lists. The ad- 
justed mean total correct responses for 
each group are shown in Table 2 along 
with the mean for the low-similarity 
list from Exp. I. High variance and 
few rules aid performance, supporting 
both expectations. The covariance 
analysis resulted in significance for both 
main effects, F (1, 100) = 4.37, p< 
.05, for both variables. The interaction 
was not significant. Moreover, specific 
comparisons with the low-similarity 
group showed that only in the LV-HE 
group was there significant depression 
in performance, £ (50) = 2.49, p < 02. 
Although LV-HO and HV-HE showed 
depression in both replications, the ¢ 
values were only 1.42 (p<.15) for 
these means compared with the low- 
similarity condition. The HV-HO 
group not only showed no depression 
but performed slightly better than the 
low-similarity group. The overt errors 


TABLE 2 


A TED MEAN TOTAL CORRECT 
sts Rear IN Exe. II 


Group M | Combined 
-HE 63.20 65.81 
$789 HVE 68.41 73.16 
ined | 73.15 i 
fike 1) | 75.41 Estimated Sm | 3.47 
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TABLE 3 
LEARNING SCORE DIFFERENCES FOR INDIVIDUAL ITEMS IN Exp. II 
Sets Items 
XUL XIL QET QOT ZAV ZYV 
LV-HO 
imilari 17.8 17.8 17.8 17.8 17.8 17.8 
Te i | 37 | 65 | 16 52 32 
XOL XEL XIL. GAC WIH PYD 
HV-HO 
imilari 31.0 31.0 31.0 5.0 5.0 5.0 
peers 76 | 45 | —4 | —47 =37, =Z 
SUJ SUV XIJ XYP XEL LYG 
LV-HE 
Similarity 21.4 10.4 14.9 11.8 18.7 12.2 
Learning 50 52 97 59 97 28 
HUJ HUQ QUH XEL XIS GAC 
HV-HE 
Similarity 25.7 30.3 29.4 10.4 10.4 5.0 
Learning 60 61 88 15 0 m23 


showed the same pattern, but the dif- 
ferences were small and not significant. 

The structure of the lists makes it 
possible to analyze both correct re- 
sponses and errors for individual items 
as a function of the similarity of the 
item to all other items in the list. To 
obtain the similarity score for an item, 
the mean of the rated similarities of 
that item to the other five items in the 
list was obtained. Since only three 
different S-R pairings were used, how- 
ever, the learning scores are not en- 
tirely independent of response button 
position effects, Consequently, total 
correct responses for each item were 
compared with this value for the com. 
parable item in the low-similarity group 
from Exp. I, and are expressed as dif- 
ference scores. Positive scores repre- 
sent fewer correct responses than com- 
parable low-similarity items. The 
results of this analysis are presented 
in Table 3. 

The LV-HO stimuli are arranged 


into pairs of similar items, thus there 
are no differences in similarity score 
among items and the items show rela- 
tively small differences in interference. 
The range of differences among items 
is 49, while in the other lists, the 
ranges are 123, 69, and 111. In the 
HV-HO stimuli where three items are 
similar to one another and three items 
are dissimilar to these items and to 
each other, the difference scores are 
larger in the similar set. The dis- 
similar items had large negative scores 
indicating that the similarity among the 
first three items actually facilitated per- 
formance on the remaining low-simi- 
larity items. ; 
In the two HE lists the correlation 
between mean similarity and learning 
was positive. For the LV-HE set the 
rank correlation was .24 and for the 
HV-HE set it was .93. The lower 
correlation may be the result of the 
smaller range of similarity values neces- 
sary to produce a low-variability list. 
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The single isolated low-similarity item 
in the HV-HE list showed a negative 
score again indicating some facilitation. 
The overall rank correlation between 
learning score and similarity rating 
across all lists was .67. 

The distribution of overt errors over 
various items in the lists produced dif- 
ferences. For this analysis, the fre- 
quency of particular errors was tabu- 
lated and compared with the frequency 
of the same error in the low-similarity 
condition. Considering first the two 
HO sets, the frequency of “generaliza- 
tion” errors, i.e., errors which would 
have been correct for one of the similar 
items, was higher in both LV-HO and 
HV-HO groups. With LV-HO, 40% 
of the overt errors were generalized er- 
rors compared with 19% in the low- 
similarity condition, while in the HV- 
HO group 63% were generalized 
errors compared with 41% in the low- 
similarity set. The three low-similarity 
stimuli in the HV-HO condition pro- 
duced 186 overt errors compared with 
290 errors on comparable items in the 
low-similarity condition. 

In the LV-HE set, generalized er- 
rors among items sharing the first two 
letters accounted for 46% of the total 
overt errors while the proportion in 
the low-similarity group was 14%. 
The items sharing only the first letter 
produced 56% generalized errors com- 
pared with 38% in the low-similarity 
condition. Each of these items also 
was similar in some way to another 
item in the list, but in no case were 
confusions among these items more 
frequent than those among comparable 
items in the low-similarity set. 

In the HV-HE group, pairs sharing 
only an initial letter produced 35% 
generalized errors compared with 18% 
errors in the low-similarity set. The 
two items sharing the first two letters 
made 39% generalized errors while the 
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low-similarity set produced only 24%. 
The “anagram” relation generalized 
48% with the comparable control items 
producing 26% errors. No other 
source of similarity resulted in an error 
differential. 


EXPERIMENT III 


Experiment II showed that a greater 
complexity of the stimulus set in terms 
of number of different letter-position 
identity rules resulted in depressed 
performance. The effect, however, was 
small and, compared with a group 
which had no similarities, was barely 
significant. Since the mean similarity 
in these lists was quite low (15%), 
it may be that rule complexity would 
have a much greater effect when simi- 
larity is high. 

In Exp. III, performance is com- 
pared among lists in which mean simi- 
larity and number of rules is varied. 


Method 


Only two additional conditions labelled 
HS-1 and HS-2 were run with comparisons 
being made with selected groups from Exp. 
I and II. The procedure was identical to 
that in the other experiments. 

Materials—Each of the new sets contained 
four letter-position identity rules making up 
the 15 possible comparisons. The rules 
comprising HS-1 were 6 comparisons with 
no common letters, 3 comparisons with last 
letter identical. The set may be exempli- 
fied by XIK, XUH, VEK, VUB, QIH, 
QEB. The HS-2 list consisted of 6 com- 
parisons with first letter in third position 
and identical yowel, 3 comparisons with first 
two letters identical, 3 comparisons with 
all three identical but consonants reversed, 
and 3 comparisons with second and third let- 
ters identical. An example of this set would 
be XUQ, XUZ, QUX, QUZ, ZUQ, ZUX. 
This set comprises permutations of X, Q, 
and Z around a common yowel. Two sets 
were constructed for each condition. Mean 
rated similarity for HS-1 was 18.8 and for 
HS-2, 59.8. 

Subjects—The Ss were 54 introductory 
psychology students who served as part ofa 
course requirement, They were assigned 
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to conditions in order of appearance accord- 
ing to a prearranged nonsystematic order. 
Within each condition, 12 Ss learned one set 
and 15 Ss the replication set. Three S-R 
pairings were used and were run consecu- 
tively. 


Results 


Practice task performance did not 
differ between the HS-1 and HS-2 
groups nor among any of the other 
groups with which comparisons will be 
made. The mean percentage of cor- 
rect responses on each block of four 
trials on the test lists is shown in Fig. 2. 
Also shown in the graph are: the re- 
sults from the low-similarity set of 
Exp. I (List 1); List 6, a set which 
had approximately the same mean simi- 
larity (67.0) as HS-2 but only a single 
tule; List 2, a set using only a single 
“one shared letter” rule (mean simi- 
larity 32.9); and the HV-HE group 
from Exp. II which had approximately 
the same similarity rating (15.4) and 
the same number of rules (5), but only 
one case of each “shared letter” rule 
compared with three cases in HS-1. 
The results clearly indicate the greater 
difficulty of HS-1 and HS-2. In fact, 
performance on HS-2 actually shows 
little improvement over the 20 acquisi- 
tion trials. Both HS-1 and HS-2 dif- 
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BLOCKS OF FOUR TRIALS 


13-16 17-20 


Fig. 2. Percentage of Correct responses 
on blocks of four trials for conditions com- 
pared in Exp. III. 
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fered from the low-similarity condi- 
tion, ¢ (50) = 5.80 and 11.58, respec- 
tively, and from each other, ¢ (50) = 
5.78. Both also differed from HV-HE, 
t (50) =3.68 and 9.47, respectively. 
Performance of List 6 was superior to 
that of HS-2, ¢ (50) = 6.41, while it 
was virtually identical to that of HS-1, 
t (50) = 64. All significant ts had 
p < .001. List 2 from Exp. I had a 
single rule for sharing a single letter, 
while HS-1 had many rules, but all 
involved sharing a single letter. De- 
spite the lower mean similarity in the 
HS-1 group, performance of this group 
was inferior tu that of the group learn- 
ing List 2, (50) = 3.41, p < .001. 

Mean number of ,overt errors was 
also higher in HS-1 and HS-2 (32.7 
and 44.7) than in the low-similarity 
group (21.0); however, the errors to 
opportunities ratio was slightly smaller 
for these groups (48.1% and 47.8% 
opposed to 49.6%). 


Discussion 


The experiments indicate that the rela- 
tionship between formal intralist stimulus 
similarity and performance cannot be 
characterized in terms of a simple func- 
tion. Considerable modulation of per- 
formance has been obtained among sets 
of stimuli in which overall similarity was 
virtually the same, The results suggest 
that the amount of interference might be 
predicted best from the number of dif- 
ferent letter-position identity rules in- 
volved in the set with this variable be- 
coming increasingly important as the 
number of stimuli sharing letters increases 
and less important when the stimuli form 
obvious groups. 

A basic theoretical question is whether 
the stimuli are coded in terms of analyzed 
letter sequences in which S$ learns the 
rule structure of the set, or whether they 
are coded by pronunciation or some other 
means. Although an explanation in terms 
of stimulus selection is not denied by these 
data, the fact that Ss were not able to 
entirely avoid interference in the single- 


FORMAL INTRALIST SIMILARITY IN PA LEARNING 


rule lists of Exp. I argues that the rule- 
making proclivities of Ss may not be as 
extensive as some theorists imply (Mand- 
ler, 1967). Rather, it seems that even 
with low association value stimuli, Ss 
code the stimuli by pronunciation rather 
than attending to the individual letters. 
This type of coding would interfere with 
the abstraction of the rule and be subject 
to “acoustic” similarity in pronunciation. 
Postman and Greenbloom’s (1967) con- 
clusion that pronounceable items are less 
susceptible to stimulus selection and the 
results of studies suggesting an auditory 
representation of units in memory (Me- 
chanic, 1964; Murray, 1966) provide sup- 
port for this notion. 

On the other hand, the data from Exp. 
II and III seem to show a closer relation 
between interference and rule relation- 
ships. Certainly, lists in which items are 
related by multiple rules are more difficult 
than single-rule lists. Furthermore, in- 
dividual similarity relations in hetero- 
geneous lists predict rate of learning of 
individual items and overt errors to a 
considerable degree. Generally, however, 
even these findings are not incompatible 
with the notion of coding by pronuncia- 
tion since differential interference could 
result from interference in grouping or 
organizing items, a process postulated as 
being basic in interference reduction 
(Battig, 1966; Runquist, 1966). The 
effects of rule structure might be due to 
the fact that it is correlated highly with 
similarity of pronunciation. 

The data do not permit a definite an- 
swer to this issue. Either transfer stud- 
ies or special techniques will be necessary 
to determine when and if Ss actually learn 
and use rule relationships among items. 
One thing is apparent: Stimulus selection 
coding is not used fully by Ss even when 
it is to their marked advantage to do so. 
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WILLIAM E. ROWETON ann GARY A. DAVIS 
Wisconsin Research and Development Center for C: ognitive Learning 


In a concept identification study, 


2 temporal task variables, the pre- 


response interval (PRI) and the postinformative feedback (PIF) 


interval, and 2 levels of problem 


difficulty were factorially combined. 


Increases in the duration of the PRI, the period between stimulus offset 


and S’s overt response, facilitated 


performance, particularly with more 


difficult problems. Similar, though less striking, results were found 


for the PIF interval. 


In several recent experiments, 
Bourne and his associates (Bourne & 
Bunderson, 1963; Bourne, Guy, Dodd, 
& Justesen, 1965) have studied the ef- 
fects of variations in temporal intervals 
on performance in concept identification 
problems. Bourne and Bunderson or- 
thogonally combined several lengths of 
feedback delay, the interval between S’s 
response and the informative feedback 
signal, with several lengths of the post- 
informative feedback (PIF) or inter- 
trial interval, the interval between the 
feedback signal and the onset of the 
next stimulus. They found, first, that 
variations between 0 and 8 sec. in the 
feedback delay interval did not sys- 
tematically influence performance. Sec- 
ondly, performance was facilitated as 
the PIF interval increased from 1 to 
9 sec, Also, the latter effect was re- 
lated to task complexity since greater 
facilitation was found with more com- 
plex problems. The importance of the 
PIF interval was later substantiated by 
Bourne et al. (1965), who found that 
performance improved then worsened 
with longer PIF intervals, the optimal 


1 This research was Performed pursuant to 
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gram. 


length being greater with more complex 
problems. Bourne (1966, pp. 10, 72) 
suggested that Ss may use the PIF in- 
terval to “mull over” (rehearse, memo- 
rize, assimilate) information provided 
by the preceding critical events. 

The purpose of the present study was 
to investigate effects on concept identi- 
fication of variations in another tem- 
poral interval, the interval between the 
stimulus offset and the S’s categorizing 
response. This interval, which will be 
referred to as the preresponse interval 
(PRI), is controlled by requiring S to 
delay his overt response until signaled 
by E. To evaluate any possible inter- 
active effects of the length of the PRI 
with the length of the PIF interval, the 
duration of the PRI was varied orthog- 
onally to the length of the PIF interval. 
Since a lengthened PRI also might al- 
low profitable information processing 
activities, such as relating one or more 
current hypotheses to the available 
stimulus or selecting a response con- 
sistent with both the stimulus and S’s 
hypotheses, it was predicted that in- 
creases in the length of the PRI would 
facilitate concept identification. Based 
on Bourne’s work, it also was predicted 
that increases in the length of the PIF 
interval (over the present 0-20 sec. 
range) also would facilitate perform- 
ance. 
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RESPONSE AND FEEDBACK INTERVALS IN CONCEPT LEARNING 


METHOD 


Subjects—The Ss were 36 volunteers (4 
males and 32 females) from undergraduate 
educational psychology courses at the Uni- 
versity of Wisconsin. 

Apparatus and stimulus materials—A 
slide projector was used to present a series of 
geometric-patterned stimuli, Each pattern 
was a combination of the levels of two rele- 
vant plus two or four irrelevant, binary 
stimulus dimensions: shape (circle or tri- 
angle), color (red or blue), number (1 or 
2), size (large or small), presence or ab- 
sence of a black “X” across the face of the 
pattern, and the presence or absence of a 
notch (wedge) in the side of the pattern. 
All dimensions were highly discriminable. 

Procedure——Each individually-run § was 
read a standard set of instructions describing 
the task, the stimulus dimensions, the re- 
quired type of two-dimensional conjunctive 
solution, and the criterion of 16 consecutive 
correct responses. The S was instructed to 
respond to each stimulus pattern by writing 
a “plus” or “minus” sign when signaled 
to do so in the appropriate blank of his score 
sheet. The E provided verbal informative 
feedback (“plus” or “minus”) immediately 
after each response. A stopwatch was used 
to control manually (a) the duration of the 
stimulus presentation, which was 5 sec, 
throughout the experiment; (b) the length of 
the preresponse interval, the period between 
the offset of the stimulus and the signal for 
S to respond; and (c) the length of the post- 
informative feedback or intertrial interval, 
the period between the feedback signal and 
the onset of the next stimulus. 

Design—A 3X2X3 repeated measures 
design was used. The between-groups fac- 
tors were PRI length (0, 10, or 20 sec.) and 
Problem difficulty (2 or 4 irrelevant stimulus 
dimensions). Each of the six Ss in these six 
groups solved three problems, one problem 
at each PIF interval length (0, 10, or 20 
sec.). The three problems differed accord- 
ing to the particular stimulus dimensions 
Televant for solution. For any S, presenta- 
tion order of the three PIF intervals and 
the three problems were determined by a 
Tow of two 3 X 3 Greco-Latin squares. The 
Performance measure was trials to criterion. 


RESULTS 


Mean trials to criterion for each level 
of the PRI at each level of problem dif- 
ficulty are shown in Fig. 1. An analy- 
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MEAN TRIALS TO CRITERION 


PRE-RESPONSE INTERVAL (IN SEC.) 


Fic. 1. Mean trials to criterion as a function 
of PRI length and problem difficulty. 


sis of variance on these data showed 
that performance significantly im- 
proved with longer PRIs, F (2, 30) = 
18.18, p < .001. A trend analysis in- 
dicated that the linear component was 
highly significant, F (1, 30) = 33.61, 
p<.001. The quadratic trend sug- 
gested in Fig, 1 did not reach sig- 
nificance, F (1, 30) = 2.76, p> .05. 
A subsequent Newman-Keuls test 
showed that performance in the 0-sec. 
PRI condition was significantly worse 
than in either the 10-sec. condition, 
q= 5.01, df = 30, p < 01, or the 20- 
sec. condition, g = 6.69, df = 30, p< 
01. Scores in the 10-sec. and 20-sec. 
conditions were not significantly dif- 
ferent. 

The Problem Difficulty main effect 
was significant, F (1, 30) = 16.91, 
p<.001, and the PRI x Problem 
Difficulty interaction also reached sig- 
nificance, F (2, 30) = 6.37, p < .005. 
As shown in Fig. 1, the facilitative 
effects of longer PRIs were enhanced 
in the more difficult problems. 

The effects of PIF interval length 
on trials to criterion are shown in Fig. 
2. While performance generally im- 
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POST-INFORMATIVE FEEDBACK 
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Fic. 2. Mean trials to criterion as a function 
of PIF interval length and problem difficulty. 


proved with longer PIF intervals, an 
analysis of variance showed this effect 
to be nonsignificant, F (2, 60) = 2.06, 
25> p> .10. 

The PRI x PIF, the PIF x Prob- 
lem Difficulty, and the PRI x PIFx 
Problem Difficulty interactions did not 
approach statistical significance. 

The mean number of trials to cri- 
terion, with standard deviations, for 
the 18 combinations of 3 PRIs, 3 PIF 


WILLIAM E. ROWETON AND GARY A. DAVIS 


intervals, and 2 levels of problem com- 
plexity are summarized in Table 1. 


Discussion 


The present results are consistent with 
the hypothesis that lengthened PRIs 
facilitate the identification of concepts. 
This effect was greater in solving more 
complex problems. Moreover, the im- 
provement in performance with longer 
PRIs was independent of the length of 
the PIF interval. ; 
` The effects of the PIF interval on per- 
formance, though less striking, were gen- 
erally the same as for the PRI. That is, 
performance improved with longer, PIF 
intervals, especially with more difficult 
problems. A comparison of Fig. 1 and 2, 
and the F tests for the PRI vs. the PIF 
interval main effects, could be taken to 
suggest that variations in PRI length 
were more effective than variations in ` 
PIF interval length. However, artifacts 
of the experimental procedures do not 
allow a clear statement regarding the rela- | 
tive effectiveness of the PRI and the PIF X 
interval, For example, the fact that each | 
S solved one problem at each level: of 
PIF interval, while all three of his prob- 
lems involved the same PRI length, i.e., 
the PIF interval was a within-groups 
variable while the PRI was a between- 
groups variable, conceivably could distort 
a meaningful comparison. Also, the 5- 


TABLE 1 


MEANS AND SDs FOR ALL CoMBINATIONS OF 


PRERESPONSE INTERVAL AND POSTINFORMATIVE 


FEEDBACK INTERVAL LENGTHS FOR PROBLEMS WITH TWO AND 
FOUR [RRELEVANT STIMULUS DIMENSIONS 


PIF Interval Length 
(in sec.) 


PRI Length (in sec.) 


0 10 20 M 
2 Teleyani Dimensions 
23.67 (20.03 16.17 (17.33) 11.17 ( 5.98) 17.00 
a 22.50 (281 12.17 ( 7.86 13.17 ER 15.95 
pa Os 4.24 16.00 ( 8.12 B30 ¢ 4.23) WNE | 
4 Irrelevant Dimensions : an ITs 
60.00 (59.76) 29.50 (15.44) 14.50 (12.69 34.67 
n A 2506 a ( 105) 139 $5) nn 
5. 5. . 56 13.00 ( 7.35 19. 
“m 42.50 19.56 / 1428 


Note.—Each figure is based on Scores from 6 Ss. SDs are in parentheses. 
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ec. stimulus presentation, which im- 
Miesistely preceded the PRI, would tend 
to functionally increase the length of each 
PRI by 5 sec., again complicating a direct 
comparison of the effects of PRI length 
and PIF interval length. Also, it is pos- 
sible that optimal PIF interval lengths 
simply were not included in the present 
design. Perhaps the most reasonable 
conclusion, based on the present results 
and on Bourne and his associates’ (Bourne 
& Bunderson, 1963; Bourne et al., 1965) 
experiments, is that both the PRI and the 
PIF intervals are important periods in 
concept identification during which Ss 
process information. Since part of S’s 
task during the PRI is classifying the im- 
mediately preceding stimulus, and this ac- 
tivity does not occur during the PIF in- 
terval, the precise information processing 
activities during the two intervals prob- 
ably are not identical. Some suggestions 
regarding the nature of these activities 
were mentioned above and, in regard to 
the PIF interval, are further described by 
Bourne (1966). 

As a final note, it is obvious that the 
PRI and the PIF interval together deter- 
mine the total interstimulus interval. It 
is thus conceivable that a massing-spacing 
effect is the important underlying vari- 
able, regardless of where in the sequence 
of events the spacing occurs. However, 


while performance generally did improve 
with increases in total interstimulus in- 
terval length, there were sufficient ir- 
regularities in this relationship to ques- 
tion the generality of the massing-spacing 
suggestion. In addition, the fact that 
Bourne and Bunderson (1963) did not 
find a systematic change in performance 
when the feedback delay interval was 
varied, which also determines the length 
of the interstimulus interval, similarly 
would be inconsistent with an overall 
massing-spacing hypothesis. Rather, to 
facilitate concept identification, it seems 
likely that additional time must occur at 
particular points in the sequence of events, 
such as the PRI and the PIF intervals. 
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IMPORTANCE OF VARIANCE PREFERENCES 
IN GAMBLING DECISIONS * 


PAUL SLOVIC ann SARAH LICHTENSTEIN 


Oregon Research Institute, Eugene 


The variance of outcomes in a gamble is presumed by many to be an 
important determinant of the gamble’s attractiveness, However, be- 
cause the variance is confounded with the gamble’s probabilities 
and payoffs, behaviors that have been interpreted in past studies as 
indicative of variance preferences are subject to alternative interpreta- 
tions. The present study reports 3 experiments in which variance 
was manipulated without changing the particular probabilities and 
payoffs that were explicitly displayed to S. This manipulation was 
made possible by the use of a specially constructed “duplex gamble.” 
Results indicated that variance is at best a minor determinant of 
gambling decisions. Variance preferences observed in previous studies 


appear to be artifacts generated by other decision strategies, 


Since Bernoulli's (1954) explication 
of the St. Petersburg paradox in 1738, 
it has been evident that the expected 
value (EV) of a risky course of action 
is not the sole determiner of its attrac- 
tiveness for the decision maker. Ber- 
noulli and others proposed that people 
maximize expected utility (EU) where 
utility is the subjective value of money. 
More recently, EU theory has been 
amended to incorporate the notion that 
people make decisions on the basis of 
the probabilities as they perceive them 
rather than the stated probabilities, 
This amended version states that be- 
havior is governed by the subjectively 
expected utility (SEU) of a gamble 
(Edwards, 1955). 

One of the most persistent and seri- 
ous criticisms of EU and SEU theories 
has been the argument that persons 
base their risk-taking decisions to a 
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considerable extent on the dispersion 
of a gamble’s possible outcomes (Allais, 
1953; Fisher, 1906). Any gamble can 
be viewed as a probability distribution 
over the possible monetary outcomes. 
The mean of this distribution is the 
gamble’s EV. The variance of this 
distribution has been the most com- 
monly studied index of dispersion. 
For a gamble that offers probability p 
to win 4 dollars and probability q to 
win B dollars, 


variance = pq (A — B)?. 


A person’s “utility for risk” has often 
been conceptualized in terms of his 
preference for variance (Coombs & 
Pruitt, 1960; Kogan & Wallach, 1967; 
Royden, Suppes, & Walsh, 1959; 
Slovic, 1962; Van der Meer, 1963). 

Although the first experimental 
study of variance preferences (Ed- 
wards, 1954) concluded that they were 
less important than probability prefer- 
ences, studies done since have been 
interpreted as indicating that variance 
has rather strong effects (Coombs & 
Pruitt, 1960; Davidson & Marschak, 
1959; Lichtenstein, 1965 ; Littig, 1962; 
Royden, Suppes, & Walsh, 1959; 
Van der Meer, 1963). Edwards 
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(1961) noted that some decision 
theorists felt that “the variance of a 
bet is as important as its SEU in 
determining its attractiveness [p. 
485].” Kogan and Wallach (1967) 
concluded their review of relevant 
studies by saying: 


Empirical evidence is strongly suggestive 
of the view that the variance properties of a 
bet (or properties closely akin to the dis- 
persion of potential outcomes) exercise con- 
siderable influence over whether a bet will 
be chosen or rejected [p. 119]. 


Because the variance of a gamble is 
invariably confounded with the gam- 
ble’s probabilities and payoffs, be- 
haviors interpreted by some as indica- 
tive of variance preferences are subject 
to alternative interpretations. For 
example, consider the choice between 
the following two gambles: 


A. 5Swin$1 B. .5 win $2 
.5 lose $1 .5 lose $2 
(variance = (variance = 
1.00) 4.00) 


While the choice of Gamble A over 
Gamble B could be attributed to a pref- 
erence for minimizing variance, it could 
be explained alternatively as a desire 
to maximize EU or to minimize the 
maximum possible loss. Similarly, 
while the choice of Gamble B could be 
due to a desire to maximize variance, 
it could just as well be interpreted as 
an attempt to maximize the potential 
gain. Since, by definition, a manipula- 
tion of variance must involve a change 
in the underlying distribution of prob- 
abilities and payoffs that characterizes 
a gamble, the task of untangling the 
effects of variance from the effects of 
Other aspects of the payoff distribution 
is indeed formidable. However, it is 
Possible to manipulate variance with- 
out changing the particular probabil- 
ities and payoffs explicitly displayed to 
S. One device for doing this involves 
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a stimulus called a “duplex gamble,” 
as illustrated at the top of Fig. 1. A 
duplex gamble is represented by two 
discs, each with a pointer on it. The 
left-hand disc determines winnings. 
The right-hand disc is used to deter- 
mine losses. To play, S must spin 
pointers on both the winning and losing 
discs. This means that $ can win and 
not lose, lose and not win, both win 
and lose, or neither win nor lose. 

The lower gamble in Fig. 1 is a 
standard two-outcome bet, represented 
by a single disc with a spinner. If 
the spinner lands in the upper portion 
of the disc, S wins. If it lands within 
the bottom sector, S loses. 

The duplex and standard gambles 
shown in Fig. 1 are termed parallel 
because they have the same stated 
probabilities and payoffs. The duplex 
bet and its parallel standard gamble 
have the same EV—namely, pA + qB. 
However, they differ in variance. If 
S plays the standard bet, he will either 
gain $2 or lose $2. If he plays the 
duplex bet, he has a fairly high prob- 
ability of breaking even. The duplex 
bet obviously has less variance (half 
as much, in this case). 

It seems reasonable to expect that 
Ss preferring low variance would find 
the duplex bet more attractive than 
its parallel standard bet, while Ss pre- 
ferring high variance would find the 
standard bet more appealing. How- 
ever, the interpretation of an observed 
preference for standard or duplex bets 
still remains equivocal. Rather than 
variance, nonlinear subjective prob- 
abilities and utilities applied to the 
values of the underlying distribution 
could well be the critical determinants. 
Thus, observed variance preferences 
will always represent the upper limit 
of the true influence of dispersion. On 
the other hand, if persons typically 
make decisions about gambles by em- 
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Fic. 1. Comparison between parallel duplex and standard gambles. 


ploying strategies that involve only the 
stated probabilities and payoffs, dis- 
regarding the differences in dispersion 
of the underlying distributions, they 
should have no strong preferences be- 
tween duplex gambles and their 
parallel standard bets. The following 
experiments evaluated the influence of 
variance when manipulated by means 
of parallel standard and duplex bets. 


EXPERIMENT I 
Method 


A set of 36 duplex bets was constructed by 
crossing four combinations of band q (.2, 8; 
4, 6; .6, 4; 8, .2) with three amounts to 
win and three amounts to lose. These 
amounts were 10¢, 20¢, and 40¢. For each 
duplex bet, a parallel standard bet also was 
constructed. This parallel bet had the same 


stated levels on the risk dimensions but dif- 
fered in variance as shown in Fig. 1. 

The Ss were 19 male undergraduates from 
the University of Oregon who evaluated the 
attractiveness of each duplex and each stand- 
ard gamble by means of the selling price 
version of the bidding technique. With this 
method, Ss were asked to imagine that they 
own a ticket to play each gamble. They 
were asked to state the least amount of 
money for which they would sell this ticket. 
For unattractive gambles, they stated the 
highest amount they would pay E in order to 
avoid playing the gamble. This procedure 
has been used previously by Becker, DeGroot, 
and Marschak (1964), Lindman (1965), 
Coombs, Bezembinder, and Goode (1967), 
and Tyersky (1967a, 1967b). 

The evaluations of the standard bets were 
made twice. The Ss first evaluated 12 of 
the duplex bets followed by the first replica- 
tion of all 36 standard bets. Next came 12 
more duplex bets followed by the second 
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replication of the standard bets. Then came 
the last 12 duplex bets. 

The Ss’ salaries for participating in the 
experiment were determined by having them 
actually play a subset of the gambles they 
indicated as most attractive. 


Results 


The data were analyzed by compar- 
ing the absolute difference between the 
bids for the two presentations of a 
particular standard bet with the ab- 
solute difference between the bid for 
each presentation of the standard bet 
and the bid for the corresponding du- 
plex gamble. In other words, the 
unreliability of replication was used as 
a yardstick against which to assess the 
differences between duplex and stand- 
ard gambles. By comparing absolute 
differences, any type of effect, whether 
attraction or aversion to a given vari- 
ance, would contribute toward making 
the difference between standard and 
duplex gambles larger than the differ- 
ence between the two replications of 
the standard gamble. This procedure 
should allow maximum opportunity 
for variance preferences to emerge. 

The absolute deviations between the 
bids for the two presentations of each 
standard gamble and the absolute dif- 
ferences between parallel duplex and 
standard gambles were averaged across 
the 36 pairs of gambles. These aver- 
ages are presented in Table 1. The 
distribution of differences between du- 
plex and standard bets was compared 
statistically with that of the differences 
between standard gambles by means of 
the Wilcoxon matched-pairs signed- 
tanks test (Siegel, 1956). Significant 


, differences suggestive of variance pref- 


erences were evident for only three Ss 
(Ss 5,11, and 18). Of these, only S11 
had a sizable mean effect. For Ss 
6, 10, and 16, the difference between 
the duplex bet and the first replication 
of the standard gamble was signif- 


TABLE 1 


AVERAGE ABSOLUTE DEVIATIONS BETWEEN 
Bws For Two REPLICATIONS oF A 
STANDARD GAMBLE AND BETWEEN 
Bis For PARALLEL DUPLEX 
AND STANDARD GAMBLES 


| Average Absolute Deviation in Cents 


= Standard 
Subject (Replica- Duplex Duplex 
No. tion 1) vs. vs. 
vs. Standard Standard 
Standard (Replica- (Replica: 
(Replica- tion 1) tion 2) 
tion 2) 
1 2.61 2.61 3.33 
2 2.86 3.64 2417 
3 3.55 3.70 2.39 
4 4.56 5.56 5.17 
5 0.00 4,20** 4,20* 
6 7.37 10.46* 6.80 
7 9.81 8.97 4.66 
8 3.69 4.19 2.67 
9 0.22 0.44 0.56 
10 9.58 12.20* 6.81 
11 2.83 13.40** 12.83**. 
12 16.36 13.52 5.09 
13 4.18 3.97 3.42 
14 3.83 3.14 3.97 
15 3.06 3.83 4.22 
16 3.58 5.08* 4.17 
17 3.86 4.14 3.88 
18 6.45 9.554 PRE 
19 5.19 5.44 5.06 
Average 
over Ss 4.96 6.21 4,87 


* Significantly greater than Column 1 at p < .05, 


one-tailed test. 
** Significantly greater than Column 1 at  <.01, 


one-tailed test. 


icantly greater than the differences be- 
tween standard gambles, but this effect 
failed to replicate when the duplex 
gambles were compared with the sec- 
ond presentation of the standard bets. 
Thus, for the great majority of Ss, the 
differences between parallel duplex and 
standard bets were no larger than the 
differences between the replications of 
the same standard bet, indicating the 
lack of any discernible variance pref- 
erences. 


ExPERIMENT IT 


Supporters of the notion that vari- 
ance preferences are important might 
criticize Exp. I on the grounds that the 
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payoffs, and accordingly, the differ- 
ences in variance, were rather small. 
Therefore, comparisons were made, in 
Exp. II, between parallel duplex and 
standard bets that had considerably 
larger differences in variance than 
those studied in the first experiment. 
In addition, each bid was replicated 
six times in an attempt to obtain a 
stable estimate of the attractiveness of 
each bet. 


Method 


Six bets served as the critical stimuli for 
this experiment. These bets are illustrated 
in Fig. 2. 

Each bet had zero expected value. Bet 1 
was a standard gamble with variance equal 
to 13.0. Its parallel duplex bet, having the 
same stated probabilities and payoffs but a 
variance equal to 6.5, is shown as Bet 5. 
A third bet was constructed to state explicitly 
the true probabilities and payoffs under- 
lying this duplex bet. This third bet had, 
of course, the same low variance as the 
duplex bet. It was called a complex bet 
and is shown’as Bet 6 in Fig. 2. In addition, 
a second duplex bet having Py = Pi =.5, yet 
with a range of payoffs increased so that the 
bet’s variance equalled 13.0, was constructed 
(Bet 2) along with a second complex bet 
(Bet 3), whose dimensions were those un- 
derlying this high-variance duplex bet. The 
remaining bet (Bet 4) was a standard bet 
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whose range of payoffs was reduced to the 
extent necessary for its variance to equal that 
of the low-variance duplex and complex 
bets. 

This set of bets permits a comparison be- 
tween two different variance manipulations: 
the first is the traditional method of decreas- 
ing variance by decreasing range of payoffs; 
the second decreases variance by introducing 
intermediate outcomes via the duplex bet. 
An S who prefers high variance should bid 
more for Bet 1 than for Bet 4, more for Bet 
2 than Bet 5, and more for Bet 3 than Bet 6 
because of the greater ranges of payoffs in 
Bets 1, 2, and 3. This same S should also 
bid more for Bet 1 than for Bet 5 or Bet 6 
because of the intermediate payoffs that can 
occur with the latter two bets. Similarly, 
the bids of an S who prefers low variance 
should exhibit the opposite orderings among 
these five pairs of bets. 

These two sets of orderings will be re- 
ferred to later as the criteria defining con- 
sistent preference for high or low variance. 
Comparisons between high-variance bets such 
as Bets 2 and 3 and a low-variance bet such 
as Bet 4 do not provide as good an index of 
variance preference as the aforementioned 
pairings. This is because Bets 2 and 3 have 
intermediate outcomes which may counteract 
their greater range of outcomes and thus 
make them appear, to some people, to have 
variance similar to or even lower than Bet 4. 
In contrast, the relative variance within each 
of the five criterion pairs should almost cer- 
tainly be veridically perceived. 

The six bets were randomly intermixed 


weni 


LOSE $2.55 


VARIANCE =6.5 |BET 4 


a 


WIN $0 


BET TYPE—> STANDARD QUPLEX | COMPLEX 
ei eana 
WIN $3.60 WIN $509 LOSE $5.09 
WIN $5.09 OSE $509 
VARIANCE = 13 = BET 2 BET 3 (ahs \ 
LOSE $3.60 WIN $0 LOSE $0 $o 
WIN $2.55 WIN $3.60 LOSE $360 


LOSE $0 $o 


=| 


Fic. 2. Stimulus bets used in Exp. II. 
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with 36 other bets—I2 of them standard bets, 
12 duplex, and 12 complex. They were pre- 
sented to S in a booklet. The set of 42 bets 
was presented six times, producing a total 
of 252 bids from each S. 

The Ss were 24 male undergraduates from 
the University of Oregon who were paid for 
their participation. 


Results 


Table 2 presents the mean bids, 
across six replications for each bet, 
for each S. The only S whose mean 
bids satisfied all five criteria for con- 
sistent variance preferences was S1. 
The Ss 2-6 satisfied four of the five 
criteria. An analysis of variance per- 
formed on the data from each of these 
Ss showed that the pair of means 
(underlined in the table) whose order- 
ing differed from the orderings of the 
other criterion pairs differed signif- 
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icantly in each instance. The sizable 
magnitude of these violations makes it 
doubtful that the bids of these Ss were 
strongly influenced by variance con- 
siderations. 

The mean bids for Ss 7, 8, and 9 
varied considerably across bets but not 
in any manner related to variance. 
The Ss 10, 11, 12, and 13 showed 
significant differences in bids for gam- 
bles that differed in payoff range, but 
their bids for Bet 5 and its parallel 
standard gamble (Bet 1) did not dif- 
fer significantly. 

The most numerous category in- 
cluded the remaining 11 Ss. The bids 
of these Ss did not differ as a function 
of the variance or format of the bets. 

In summary, at most, 6 out of 24 Ss 
showed orderings of bids that were 


LE 2 


MEAN Bins (1N DOLLARS) AND CATEGORIZATIONS OF SS WITH RESPECT 
TO VARIANCE PREFERENCES 


Bet 1 Bet 2 Bet 3 Bet 4 Bet 5 Bet 6 
Subj é Number of 
Nee Parallel ‘ategory riteria 
No. | Saraca | ‘Duplex | Complex | Standard | Duplex | Complex Satisfied 
High High High Low Low Low 
Variance | Variance | Variance | Variance | Variance | Variance 
s —.50 —.17 | High-Variance 5/5 
1 2.38 «83 2.33 1.69 igh-Variane 
2 3.20 2.50 2.08 2.06 2.25 2.35 | High-Variance 4/5 
3 2.30 1.58 83. 1.04 1.17 10 Preference. 
4 64 1.75 3.07 -60 208 m 
5 245 05 1.55 | —.60 —.99 1.28 
= z 2413 .58 | Low-Variance 4/5 
Tbs 2 oe + — Preference. 
- .00 -.90 2.73 | No Variance Prefer- 
5 07 5 | 180 ‘so | ist | =.67 | ence Pattern. 
9 —.20 aT 2.33 —.10 23 —.24 
1.70 2.12 2.25 | Significant Differences 
it 33a 309 535 2.55 3:50 1.10 between High and 
12 7 1,00 1:00 50 71 “71 | Low Variance:ex: | 
13 ‘00 ‘67 83 100 100 117 | cept for Bets 1 an 
K i 5 Where Displayed 
Dimensions Were 
Equal. 
= .00 | No Significant Differ- 
is Taek ENE Tee 08 ‘oz | ences of Any Sort. 
x 5 i : i 
16 —.03 «00 200 —.27 200 : 
17-24 200 100 100 100 100 ‘00 
58 
M 33 -60 85 39 29 X 
(N = 24) 


Note.—Underlined values for Ss 2-6 illustrate pairs of means that violated the predominant vere rere 


ence trend (failed to satisfy a criterion for consistent 


preference). 
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consistent with variance preferences 
and continued to be consistent when 
explicitly stated payoffs and probabil- 
ities were made equal. Five of these 
Ss exhibited an ordering on one pair 
of bids that deviated to a sizable extent 
from a consistent variance preference 
policy. 


Experiment III 


The results of Exp. I and II indi- 
cated that variance per se has little 
systematic influence on bidding re- 
sponses. However, these results do 
not preclude the possibility that vari- 
ance might exert a strong influence on 
choices among gambles. Actually, the 
majority of studies purporting to dem- 
onstrate the influence of variance have 
employed choices rather than bids as 
the dependent variable. 

Experiment III was designed to 
examine the importance of variance in 
a choice situation. Specifically, the 
experiment tested whether Ss would 
exhibit consistent preference for either 
a duplex bet or its parallel standard 
gamble when these were paired, and, 
if so, whether giving the less preferred 
gamble a slightly greater winning pay- 
off and a slightly smaller potential loss 
could reverse the preference. 


Method 


Each S was presented with 88 pairs of 
gambles and asked to select the member of 
each pair he would prefer most to play. He 
was further instructed to choose as though 
each decision was the only one he would get 
to make. 

Each pair matched a standard two-outcome 
gamble against a duplex gamble. All stand- 
ard bets had a .5 probability of winning and a 
.5 probability of losing. The duplex bets 
offered a .5 probability to win a stated 
amount on the left-hand disc and a .5 chance 
to lose a different amount on the right-hand 
disc. 

For 20 pairs (Set 0), both standard and 
duplex bets had exactly the same payoffs 
(and, of course, the same EV). For 18 
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pairs (Set 5S) the standard gamble offered 
5¢ more to win and 5¢ less to lose (5¢ more 
EV) than the duplex gamble. For example, 
one of these pairs pitted a standard bet offer- 
ing even odds to win $1.60 or lose $1.40 
against a duplex bet whose winning payoff 
was $1.55 and whose potential loss was $1.45. 
A second pairing in this set consisted of a 
standard gamble with winning and losing 
payoffs of $1.65 and $1.35, respectively, vs. a 
duplex bet whose payoffs were $1.60 and 
$1.40. The duplex bet was given this same 
type of 5¢ advantage in 18 other pairs (Set 
5D). For 16 pairs, the standard gamble 
offered 10¢ more to win and 10¢ less to lose 
(10¢ more EV) than the duplex gamble (Set 
10S). Similarly, the duplex gamble had 
this 10¢ advantage in the remaining 16 pairs 
(Set 10D). 

Within each of these five sets, the pairs of 
gambles were evenly divided into those in 
which the amount to win exceeded the 
amount to lose (positive EV) for both gam- 
bles and pairs in which both bets had nega- 
tive EV. There were 10 combinations of 
winning and losing payoffs for positive EV 
gambles. These varied in increments of 5¢ 
starting from a win of $1.55 and a loss of 
$1.45, going next to a win of $1.60 and a loss 
of $1.40, etc., and ending with a bet offering 
a win of $2.00 and a loss of $1.00. The win- 
ning and losing payoffs were simply reversed 
when constructing the negative EV gambles. 

The Ss were 19 male and 12 female under- 
graduates from the University of Oregon 
who were paid for participating in the ex- 
periment. 


Results 


The proportion of times in which S 
selected the standard gamble instead of 
the duplex was calculated separately 
for each subset of pairs. The resulting 
sets of proportions enabled Ss to be 
placed into one of the four distinct 
categories indicated in Table 3. The 
preferences of Ss placed in the same 
category were quite homogeneous and 
are thus well described by the means 
given in Table 3. Only 2 of the 31 Ss 
could not be categorized in any mean- 
ingful way. 

Category 1 includes the 3 Ss who 
exhibited a consistent preference for 
duplex gambles across all sets of pairs. 
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TABLE 3 
MEAN PERCENTAGE oF PAIRS IN WHICH THE STANDARD GAMBLE WAS PREFERRED 
OVER THE DUPLEX GAMBLE 
Set 
Category 
10D | 5D o 5s | 10S 
1. Consistent Preference for Duplex Gambles (N = 3 00 0 
2. Preferences Moderately Influenced by Bice in y ! 4 oF iv 
Payoffs (N = 7) 05 15 28 49 59 
3. Preferences Strongly Influenced by Changes in Payoffs 
(N = 11) 05 04 21 86 93 
4a, Preferences That Interacted with Sign of the 
Expected Value (N = 5) 
+EV 43 58 78 74 75 
—EV 05 02 02 04 15 
4b. Preferences That Interacted with Sign of the 
Expected Value (N = 3) 
+EV 00 00 00 18 08 
-EV 75 82 80 96 96 


There were no Ss who preferred stand- 
ard gambles universally. Category 2 
includes 7 Ss whose preferences were 
influenced to a moderate extent by dif- 
ferences in payoffs, and Category 3 
contains the 11 Ss who were very 
strongly influenced by payoffs. The 
preferences of most of these 18 Ss were 
as much influenced by a 5¢ difference 
in payoffs or EV as by a 10¢ difference. 
Category 4 includes Ss who showed 
a preference for either duplex or stand- 
ard bets but whose preferences reversed 
themselves when the signs of the bets’ 
EVs were reversed. The five Ss in 
Category 4a preferred the standard 
gamble when EV was positive and the 
duplex gamble when EV was negative. 
The three Ss in Category 4b showed the 
Opposite preference pattern. Five of 
these Ss also seem to have been influ- 
enced, at times, by the payoff differ- 
ences between members of a pair, but 
this difference was small compared to 
» the effect produced by the sign of EV. 
These Ss are reminiscent of the Ss 
in the Coombs and Pruitt (1960) 
study who exhibited variance prefer- 
ences that changed as a function of the 
Probabilities involved (see also Pruitt, 
1962). 


In summary, consistent preferences 
among duplex and standard gambles 
were more noticeable in this choice 
situation than in the previous bidding 
experiments. However, the prefer- 
ences of the majority of Ss were not 
strong enough to persist when the less 
preferred gamble was given a 5¢ ad- 
vantage in EV. 


Discussion 


At the very most, only 20% of the Ss 
in the first two experiments bid differen- 
tially for parallel standard and duplex 
gambles in a way that could be interpreted 
as preference for high or low variance. 
In Exp. III, less than one-third of the Ss 
exhibited preferences for standard bets 
over parallel duplex bets or vice versa 
that were strong enough to persist in the 
face of extremely small disadvantages in 
the stated payoffs. Since these percent- 
ages represent an upper limit to the influ- 
ence of variance, it must be concluded that 
dispersion or variance, as manipulated 
here, is at best a minor determinant of 
bids and choices. 

Some persons may wish to argue that 
range of payoffs is the aspect of disper- 
sion that is psychologically relevant and 
the reason these experiments failed to 
demonstrate any effect of dispersion is 
because parallel duplex and standard bets 
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have the same range. It appears unlikely 
that range would be psychologically 
meaningful when variance, as manipulated 
here, was not. 

The most parsimonious explanation of 
the present results, as well as behaviors 
previously labeled as variance preferences, 
would seem to be that the decisions of 
most persons are determined by factors 
such as nonlinear subjective probabilities 
and utilities or by strategies that employ 
only the stated probabilities and payoffs. 
An example of the latter would be the 
minimization of maximum loss or the 
maximization of gain. Other such de- 
terminers are described by Slovic and 
Lichtenstein (1968). Variance seems to 
be too subtle a factor to exert significant 
influence, 
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LOW-FREQUENCY VIBROTACTILE ADAPTATION? 


J. F. HAHN 


University of Virginia 


A comparison was made of the effects of 2 different vibrotactile 
adapting stimuli (10 or 200 Hz. each at 14 db. sensation level) de- 
signed to stimulate, in the index fingerpad, 2 different receptor 
populations, Cross-adaptation measurements indicated that 2 different 
receptor populations were indeed adapted, but there were no marked 
differences in their adaptation or recovery. Vibrotactile adaptation 
probably reflects sensorineural mechanisms rather than an effect of 


mechanical properties of tissue. 


Psychophysical measurements of 
tactile adaptation to prolonged vibra- 
tory stimuli have been made by pre- 
vious investigators both for comparison 
with visual and auditory adaptation 
data and for investigation of touch sen- 
sitivity per se (Békésy, 1959; Cohen 
& Lindley, 1938; Hahn, 1966, 1968; 
Wedell & Cummings, 1938). These 
measurements were all carried out at 
frequencies of 50 Hz. or higher. 

In recent investigations, both psy- 
chophysical (Verrillo, 1963) and elec- 
trophysiological (Lindblom & Lund, 
1966; Mountcastle, Talbot, Darian- 
Smith, & Kornhuber, 1967), it has 
been concluded that vibrotactile ab- 
solute thresholds below 40-50 Hz. 
are mediated by different receptors 
than those involved at higher fre- 
quencies. The purpose of the present 
psychophysical investigation was to 
measure the course of vibrotactile 
adaptation and recovery with stimuli 
designed to affect only the low-fre- 
quency receptors, and to compare these 
data with data obtained at the same 
time for higher frequency receptors. 


1It is a pleasure to acknowledge partial 
support of this investigation by United States 
Public Health Service Research Grant NB- 
04177 from the National Institute of Neuro- 
logical Diseases and Blindness. 


METHOD AND RESULTS 


At the end of O’s armrest was a flat 
plastic fingerrest with a round hole, 
4 mm. in diameter, on which he rested 
his right index fingerpad. Projecting 
through the hole from beneath was a 
short wooden cylinder with a circular 
tip of 1.5 mm. diameter, driven by a 
vibration generator (Goodmans V-47) 
so mounted and counterweighted that 
it exerted a static pressure of 9 gm. 
The driving current, its wave form 
monitored by an oscilloscope connected 
across the V-47 terminals, was the out- 
put of an amplifier driven by a beat- 
frequency oscillator whose frequency 
was monitored by a crystal-controlled 
frequency meter. During adaptation 
and during threshold measurements, 
auditory cues from the vibration gene- 
rator were masked by white noise pre- 
sented at about 90-db. SPL via ear- 
phones worn by O. Stimulus inten- 
sities were recorded as voltages across 
the generator input, these voltages 
having been calibrated against the 
peak-to-peak vibration amplitude of the 
contactor measured under stroboscopic 
illumination by means of a microscope 
with filar micrometer eyepiece. Damp- 
ing of the contactor motion by the skin 
was about 10%, as measured by a 
photoelectric monitor, and did not 
change during the course of adaptation 
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and recovery measurement. The 
damping does not affect the data of 
this experiment because they are re- 
ported as ratios of adapted to un- 
adapted thresholds. 

Observers —The author, a colleague, 
and a graduate student served as Os.” 
Two of the Os had served in a previous 
experiment on vibrotactile adaptation ; 
the other O received several days of 
familiarization before beginning the 
experiment. 

Procedure.—Ilf it is assumed that 
the electrophysiological data reported 
by Mountcastle et al. (1967) for mon- 
keys are applicable to man, then vi- 
brotactile stimuli of 10 Hz. and 200 
Hz. at 14 db. above the corresponding 
absolute thresholds would stimulate 
only the low-frequency or the higher 
frequency receptors, respectively. 


2I am indebted to Charles Fry and 
Thomas Brozoski for their patient careful- 
ness. Brozoski also kindly served as E when 
I was 0. 


THRESHOLD IN DB. RE UNADAPTED THRESHOLD, 
o 


60 120 180 240 300 


These stimuli were used in two sets 
of measurements. In the first set, 
adaptation produced by 2, 4, or 8 min. 
of stimulation was measured and the 
course of recovery was followed for 
4 min. In the second set, similar mea- 
surements were made after 8 min. of 
cross adaptation between the two stim- 
uli, thresholds at 10 Hz. being mea- 
sured after 200-Hz. adaptation, and 
vice versa. 

Thresholds were measured by a 
modified method of limits, intensity in- 
creases being continuous rather than 
stepwise, and descending series being 
omitted to avoid the additional adapta- 
tion they would have caused. An 
average of 13 sec. of recovery occurred 
between the termination of adaptation 
and the actual reaching of the first 
threshold; the absolute values of adap- 
tation obtained are therefore somewhat 
too low, but from inspection of the re- 
covery functions obtained it did not 
appear that this source of error had 


Omo 10 Hz. Thresholds 
xx 200 Hz. Thresholds 


420 480 540 600 660 720 


ELAPSED TIME IN SECONDS 


Fic. 1. Vibrotactile adaptation and recovery at 10 Hz. and at 200 Hz. on the index 


finger. (Dashed line shows approximate 
after 120, 240, and 480 sec. of adaptation.) 


course of adaptation; solid lines show recovery 
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Fic. 2. Comparison of adaptation at 10 and 200 Hz. with cross adaptation by these two 


frequencies, 
of the adapting stimulus. 
adapting stimulus.) 


differentially affected the 10-Hz. and 
200-Hz. thresholds, whose comparison 
was the point of the experiment. 

On the first day, measurements were 
made for each O of the 10-Hz. thres- 
hold adaptation and recovery pro- 
duced by 2, 4, and 8 min. of the 10-Hz. 
adapting stimulus and by 8 min. of the 
cross-adapting 200-Hz. stimulus. The 
locus of stimulation of the fingerpad 
was changed at the end of each re- 
covery. On the second day, analogous 
measurements were made of 200-Hz. 
threshold adaptation and recovery. 
The third and fourth days were repe- 
titions of the first and second except 
that the temporal order of adapting- 
stimulus presentations was reversed. 
These daily alternations continued until 
there was a total of 16 observations for 


(Upper two curves: recovery of thresholds for the same frequency as that 
Lower two curves: recovery of thresholds after the cross- 


each data point of the eight adaptation- 
recovery functions, Each threshold 
measurement was expressed as its 
ratio to the unadapted threshold that 
had been measured thrice before each 
presentation of the adapting stimulus ; 
the ratios were then averaged and con- 
verted to decibels with reference to the 
unadapted threshold. 

In order to assess the effects of fac- 
tors such as the continued static pres- 
sure of the contactor and of the fixed 
position maintained by O during adap- 
tation and recovery, control series of 
threshold measurements were made 
under conditions like those of the adap- 
tation measurements except for the ab- 
sence of vibrotactile adapting stimuli. 
In the control series, 10-Hz. thres- 
holds rose at an average rate of 45 
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db. per 100 sec., while 200-Hz. thres- 
holds fell at the same average rate. 
The adaptation and recovery data pre- 
sented here have been corrected by this 
factor. 

To facilitate comparison of adapta- 
tion and recovery at 10 Hz. and at 
200 Hz., both sets of data are plotted 
in Fig. 1. No marked differences are 
apparent between the two frequencies 
either in the amount or temporal course 
of adaptation and recovery. 

The data on cross adaptation are 
presented in Fig. 2, in which are plot- 
ted the recovery functions for 10-Hz. 
thresholds and 200-Hz. thresholds after 
each was preceded by the 8-min. 
adapting stimulus at the other fre- 
quency. For comparison, replotted 
from Fig. 1 is the recovery of 10-Hz. 
and 200-Hz. thresholds when each 
was preceded by the 8-min. adapting 
stimulus at the same frequency. No 
significant degree of cross adaptation 
is evident, although possibly some small 
amount may have occurred with the 
10-Hz. thresholds, 


Discussion 


The two adapting stimuli that were 
planned to affect different sets of recep- 
tors produced adaptation and recovery 
that were similar both in magnitude and 
in temporal course. Should this finding 
be taken at face value, or does it mean 
that the two stimuli were in fact affecting 
the same mechanism? 

The latter interpretation appears to be 
ruled out by the results of the cross- 
adaptation measurements. If the two 
stimuli had been producing adaptation by 
the same mechanism, then the cross- 
adaptation functions and direct-adaptation 
functions in Fig. 2 should be similar, and 
they are not. Therefore, it seems reason- 
able that the results mean what they ap- 
pear to, and that adaptation of the dif- 
ferent sets of receptors to vibrotactile 
stimulation, at the same levels above their 
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respective thresholds, was similar in rate 
and amount. 

An interesting aspect of this conclusion 
stems from the fact that the psychophys- 
ical and electrophysiological evidence al- 
ready cited indicates that subcutaneous 
Pacinian corpuscles probably are mediat- 
ing threshold at the higher frequency, 
while receptors in the skin are responsible 
for the lower frequency thresholds. The 
simplest interpretation of the present ex- 
periment is that tactile adaptation to 
sinusoidal displacement was fairly inde- 
pendent of the mechanical properties of 
the tissues interposed between stimulus 
and receptors, as one might expect for 
sinusoids, so that the resulting adaptation 
functions primarily reflect sensory mech- 
anisms; and that in spite of the differ- 
ences in location and encapsulation of the 
different types of receptor, their adapta- 
tion and recovery are similar. This con- 
clusion, it may be noted, is not in dis- 
agreement with the quantitative analysis 
by Loewenstein and Skalak (1966) show- 
ing how the mechanical properties of 
the Pacinian corpuscle can account for 
its phasic response; their analysis deals 
with a much shorter time scale and with 
nonsinusoidal stimuli. 
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NUMBER OR DURATION DISCRIMINATION FOR 
AUDITORY PULSE TRAINS 


IRWIN POLLACK 
Mental Health Research Institute, University of Michigan 


Ss discriminated between pulse trains which differed in terms of the 
number of pulses. Pulse rates between 10 and 2,000 pulses/sec, where 
perceived number does not follow the presented number, were employed. 
Over these rates, the relative discrimination of number was constant at 
about 10% and was relatively independent of the internal structure of 


the pulse trains. 


Since the perceived number of pulses is proportional 


to the number of elapsed yy-sec. periods, or “moments,” and since num- 
ber discrimination is possible within 1 such elapsed period, it is con- 
cluded that perceived duration—rather than perceived number—was the 
basis of discrimination in the present tests. 


The operations for the study of the 
discrimination of number and the dis- 
crimination of duration are well de- 
fined. One manipulates the number of 
discrete events and Ss make judgments 
with respect to the number of events. 
Or, one continuously manipulates the 
duration of a specified interval and Ss 
make judgments with respect to the 
duration of the interval. Perceptually, 
however, number and duration are 
hopelessly intertwined. (Boring, 1933, 
1942; Fraisse, 1963; Woodrow, 1951). 

The present studies examined the 
accuracy of discrimination of pulse 
trains, which differed in their number 
of pulses, at rates between 10 and 2,000 
pulses/sec—at rates where Cheatham 
and White (1952, 1954), Taubman 
(1950a, 1950b), and Forsyth and 
Chapanis (1958) demonstrated that 
perceived number does not follow the 
presented number of flashes, In es- 
sence, the author sought to determine 
whether S can discriminate between 
two pulse trains which differ in their 
number of pulses but which $ judges 
to contain the same number of pulses. 


GENERAL METHOD 


A Digital Equipment Corp. PDP-8 com- 
puter generated trains of pulses, executed 
the experimental tests, and terminated each 
test when an arbitrary criterion of accuracy 


had been achieved. The two latter features 
were accomplished by an adaptive stimulus 
programming procedure of variable step size 
—PEST or point estimation by sequential 
testing (Taylor & Creelman, 1967). 

A four-interval forced-choice (4IFC) test 
was employed. Four pulse trains were pre- 
sented to S. Three of the trains, the 
“standard” trains, employed the same num- 
ber of interpulse intervals, ns; the other train, 
the “variable” train, employed a larger num- 
ber of intervals, my. The Ss’ task was to 
select whether the first, second, third, or 
fourth pulse train had the larger n. Imme- 
diate feedback was presented after each 
trial. If S selected the correct interval, 
PEST decreased the difference between nv 
and ms; if S selected the incorrect interval, 
PEST increased the difference between nv 
and na PEST manipulated the difference 
between my and ms to converge on 50% 
correct trials within the 4IFC procedure. 
Thresholds were estimated to the nearest 
-5 interval. k 

The Ss were primarily university music 
majors with extensive previous experience in 
pulse-discrimination tests (5-10 hr/wk over 
2-5 semesters). The reported thresholds 
represent the geometric means of four PEST 
thresholds by each of five (Exp. IV) or nine 
(Exp. I, II, III) Ss. Brief pulses (10 asec.) 
were presented over binaural earphones 
(Koss PRO-4) to Ss at a comfortable 
listening level. 


EXPERIMENT I: EFFECT or AUDITORY 
INTERVAL GRADIENTS 


The confounding of number and 
duration is especially marked in the 
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discrimination of auditory pulse-inter- 
val gradients (Pollack, 1968a). These 
gradients are defined in terms of the 
duration of the initial interpulse inter- 
val (IPI,) the duration of the terminal 
interpulse interval (IPI,), and the num- 
ber of transition intervals between the 
initial and terminal intervals. 


Method 


One can fix the duration of the initial and 
terminal interpulse intervals and covary the 
number of transition intervals along with the 
interval gradient. For example, with IPI, 
=2 msec. and IPI:-=3 msec, and with 
nine intermediate transition intervals, the 
temporal gradient is defined by the train of 
interpulse intervals: 2.0, 2.1, 2.2, . . . , 2.9, 
3.0 msec. With four intermediate transition 
intervals, the train is 2.0, 2.2, 2.4, 2.6, 2.8, 
3.0 msec. With one intermediate interval, 
the train is 2.0, 2.5, 3.0 msec. 

Or, one can fix the number of transition 
intervals and covary the extreme intervals 
along with the interval gradient. For exam- 
ple, with a center interpulse interval (IPI-) 
of 2.5 msec. and with nine intermediate 
transition intervals, a transition gradient of 
.1 msec/interval yields the pulse train: 2.0, 
21, 2.2,..., 29, 3.0 msec. A transition 
gradient of .2 msec/interval yields the pulse 
train: 1.5, 1.7, 1.9,..., 3.3, 3.5 msec. 

In the first method, discrimination of the 
magnitude of the interval gradient is con- 
founded with the changes in the number of 
transition intervals and with changes in the 
total elapsed duration within the constraint 
of identical initial and terminal intervals. 
In the second method, discrimination of the 
magnitude of the interval gradient is con- 
founded with changes in the initial and ter- 
minal intervals within the constraint of the 
same number of intervals and the same 
elapsed duration. Since the auditory system 
is especially sensitive to interval, or pitch, 
differences, the first method appears to be 
a more useful procedure for studying audi- 
tory interval gradient discrimination. To 
this end, Exp. I attempted to determine how 
the accuracy of number discrimination is 
dependent upon the magnitude of the asso- 
ciated interval gradients. 


Results 


Figure 1 presents the results of Exp. 
I. Within the interval ranges ex- 
amined, the threshold change in num- 


ber discrimination (An) was relatively 
independent of the magnitude of the 
interval range of the associated swept- 
interval train; and was also relatively 
independent of the center interpulse 
interval, IPI.. 

Number discrimination in the ab- 
sence of an interval gradient, indicated 
by N, was only slightly poorer than 
the discrimination with added interval 
gradient differences (13 cases out of 
17 comparisons based upon the average 
An over 20, 40, and 80% jitter). The 
magnitude of the difference, however, 
was so small—except for large n—that 
it is concluded that number (or dura- 
tion) discrimination is relatively inde- 
pendent of accompanying auditory in- 
terval gradients when the initial and 
terminal interpulse intervals are fixed. 


N discrimination - 


THRESHOLD An, (ny —ng) 


N 
INTERVAL RANGE (IPI; - IPI,)/IPI., in % 


Fie, 1. Auditory pufse number discrimi- 
nation with and without associated interpulse 
interval gradients. (The range of inter- 
pulse intervals spanned by the interval gradi- 
ent, expressed relative to its mean interval, 
is plotted on the abscissa. The ordinate of 
Fig. 1-5 is the dependent performance mea- 
sure: the difference in the number of inter- 
vals between the reference standard signal 
and the variable signal which yields 50% 
correct discrimination within a four-interval 
forced-choice test. The parameter on the 
curves of Fig. 1-5, ms, represents the num- 
ber of intervals in the reference standard. 
The three sections of Fig. 1-3 represent 
three interpulse intervals. The data plotted 
above N in Fig. 1 represent number dis- 
crimination tests without interval gradients.) 
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EXPERIMENT I]: EFFECT oF 
TEMPORAL IRREGULARITY 


The author next sought to determine 
whether the accuracy of number dis- 
crimination suffers as the intervals be- 
come more and more irregular. 


Method 


Temporal irregularity or jitter (J) was 
introduced into otherwise periodic pulse 
trains. Random selections were made from 
a uniform distribution between 0 and J. 
Each selection (j) was paired with its com- 
plement (—7). A set of n/2 random selec- 
tions thus provide n numbers which were 
scrambled and added to IPI.. This opera- 
tion resulted in a train of interpulse intervals 
with a fixed mean, IPI., and a variation 
between IPI. —J and IPI.+J. 


Results 


Figure 2 presents the results of Exp. 
II. Number discrimination was rela- 
tively independent of jitter, suffering 
only at the highest jitter conditions 
and, then, primarily for a small num- 
ber of pulses at the two highest pulse 
frequencies. As in Fig. 1, number dis- 
crimination was relatively independent 
of the average interval. 

The approximate relationship be- 
tween An and n for n = 16-256 is a 
power function of slope 1, where An/n 
= 9.4%. The correlation between log 
n and the average value of log An at 
the lower jitter levels was over .99 
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Fic. 2. Auditory pulse number discrimi- 
nation as a function of temporal jitter about 
the mean interpulse interval. 


(.997) pooled over the three IPIs. The 
discrimination threshold for n = 8 was 
probably limited by the unit of mea- 
sure available to the program: a An 
of 1 with extrapolation to the nearest 
.5 interval. 


EXPERIMENT III: EFFECT OF 
Successive REPETITIONS 


The author next sought to determine 
whether identical repetition of pattern 
of interpulse intervals might enhance 
number discrimination. 

If a given pattern of nonidentical in- 
terpulse intervals is successively re- 
peated, the periodicity of the repeated 
pattern may often be detected (Pol- 
lack, 1968d). The detection of the 
periodicity of repeated pulse patterns 
is especially notable at higher, rather 
than lower, pulse frequencies because 
the period of the repeated pattern is 
more likely to be within the auditory 
pitch range. With repeating patterns, 
S, therefore, has two opportunities to 
detect changes in number: one based 
upon An within each pattern, as con- 
sidered in the previous tests, and one 
based upon differences in the periodic- 
ity of the repeating pattern of intervals. 


Method 


Repeated interval patterns are defined in 
terms of the number of successive presenta- 
tions (P) of a given pattern of n interpulse 
intervals. For example, a pulse train of 256 
intervals may be produced by 1 presentation 
of a pattern of 256 intervals, by 16 presenta- 
tions of a pattern of 16 intervals, by 256 
Presentations of a pattern of 1 interval. 
With a An of 1, variable signals are obtained 
of 1 presentation of a pattern of 257 intervals, 
of 16 presentations of a pattern of 17 inter- 
vals, of 256 presentations of a pattern of 
2 intervals. 


Results 


Figure 3 presents the results of Exp. 
IIT. At high pulse frequencies or short 
interpulse intervals (left section), a 
substantial lowering of the threshold 
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An resulted from successive presenta- 
tions. At intermediate pulse frequen- 
cies (middle section), there was sub- 
stantially less effect of successive 
presentations, although there may have 
been a lowering of the threshold An 
with repeated short patterns (8 and 
16 intervals long). At low pulse fre- 
quencies (right section), there was an 
apparent rise in the threshold An. 

The fall in thresholds at short inter- 
pulse intervals is probably associated 
with the detection of periodicity of the 
repeated pattern. It is suspected that 
the rise in threshold at long interpulse 
intervals was due to their long dura- 
tion. With IPI, =8 msec., a pattern 
of 8 intervals repeated 16 times took 
over 1 sec. In a 4IFC test, the time 
occupied by a single test trial, including 
the silent intervals between successive 
pulse trains, was about 5 sec. The 
overall effect of successive repetitions, 
however, was not substantial. 


EXPERIMENT IV: NUMBER DISCRIMI- 
NATION AT Lower Purse RATES 


Cheatham and White (1952, 1954) 
noted that the gain in estimated num- 
ber with presented flash number was 
often “quantal,” i.e., an increase in the 
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Fic, 3. Auditory pulse number discrimi- 
nation as a function of the number of suc- 
cessive presentations of a pattern of fs 
pulses, (The jitter level was 40% of the 
mean interval.) 
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presented number of flashes brought 
little or no increase in the estimated 
number of flashes until a sufficiently 
larger number of flashes brought about 
in increment by one in the estimated 
number. Even with auditory pulse 
trains, there is a great overlap in per- 
ceived number as a function of pre- 
sented number. For example, Chea- 
tham and White (1954, Table 1) ob- 
served that Ss reported 5 or fewer 
pulses on 95% of trials when 8 audi- 
tory pulses were presented, and on 91% 
of trials when 9 auditory pulses were 
presented at 30 pulses/sec, Over the 
range of 10-30 auditory pulses/sec, 
perceived number grows at the rate of 
about 10 perceived pulses for each sec- 
ond of elapsed duration. Stated dif- 
ferently, a change of 3 pulses at 30 
pulses/sec or a change of 6 pulses at 
60 pulses/sec is required to achieve a 
shift of one in the estimated number 
of pulses, 


Method 

The author sought to determine whether S's 
could discriminate changes in the number of 
intervals between two pulse trains to which 
Cheatham and White’s Ss assigned the same 
estimated number. At first hand the question 
appears superfluous: if Ss assign the same 
estimated number to two pulse trains, they 
cannot be expected to tell them apart, How- 
ever, at sufficiently fast rates, differences in 
number may be coded into differences in ap- 
parent loudness, pitch, or duration, Thus, the 
fact that two pulse trains are assigned the 
same number does not necessarily mean they 
are indistinguishable. 


Results 


Figures 4 and 5 present the results 
of Exp. IV. Figure 4 examines the 
effect of pulse repetition rate upon the 
accuracy of number discrimination. 
Little systematic effect of pulse repeti- 
tion rate was found. For the larger 
pulse trains (filled points, s > 24), 
about a 10% change in number was re- 
quired across pulse repetition rates, 
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Fic. 4. Auditory pulse number discrimi- 
nation as a function of pulse repetition fre- 
quency (lower abscissa) or of the equivalent 
interpulse interval (upper abscissa). 


Approximately the same percentage 
change was required in Fig. 1 and 2 
for presentation rates up to 2,000 
pulses/sec. For a small number of 
pulses (open points, ns > 16), a smaller 
range of An was required across pulse 
repetition rates. Interpretation of this 
finding is made difficult by the limita- 
‘tion of An at .5. The temptation to 
identify the break between 16 and 24 
pulses in terms of a change in the mode 
of processing the pulse trains, perhaps 
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Fic. 5. Auditory pulse number discrimi- 
nation as a function of jitter level, (The 
upper section represents pulse trains with 
64 intervals; the lower section with 24 
intervals. The parameter is the pulse repe- 
tition frequency.) 


IRWIN POLLACK 


from estimated number to estimated 
duration, is weakened hy the small ef- 
fect of presentation rate. 

Figure 5 examines the effect of jitter 
upon number discrimination for refer- 
ence trains of 64 pulses (upper curves) 
and for reference trains of 24 pulses 
(lower curves); the parameter is the 
pulse presentation rate. Up to a jitter 
level of 30% there was little change in 
the number threshold. At higher jitter 
levels, number thresholds appeared to 
increase, but the effects of jitter were 
not consistent across conditions. 


Discussion 


The striking finding of the present 
study is the relative invariance of audi- 
tory number or duration discrimination 
thresholds with several variables which 
substantially altered the internal structure 
of the signals. The finding is in agree- 
ment with several reports that apparent 
duration is relatively independent of 
qualitative differences of the signals 
under evaluation (Goldstone & Goldfarb, 
1964; Jones & McFarland, 1967). 

The relative insensitivity of number or 
duration discrimination with respect to 
interpulse interval may be interpreted in 
terms of conceptual mechanisms for the 
processing of number or duration in- 
formation. Substantial differences in 
auditory functions, spanning several dec- 
ades of sensitivity, were obtained in favor 
of short interpulse intervals over long in- 
tervals in a wide range of auditory pitch 
discrimination tasks (Pollack, 1967a, 
1967b, 1968a, 1968b, 1968c, 1968d). For 
auditory pitch tasks, theorists have sug- 
gested that the auditory system employs 
a frequency spectrum analysis for high 
pulse frequencies or short interpulse 
intervals, and employs a temporal analysis 
for low pulse frequencies or long inter- 
pulse intervals. In contrast with pitch, 
the relative insensitivity of auditory num- 
ber or duration discrimination to inter- 
pulse interval suggests little need for 
postulating two conceptual systems for 
Processing auditory number or duration 
information. 


NUMBER OR DURATION DISCRIMINATION OF PULSE TRAINS 


Finally, can the question which moti- 
vated the experiments be answered? Is 
number discrimination more closely re- 
lated to perceived number or perceived 
duration? The answer is made difficult 
by the nearly direct proportional relations 
between An and n, so that changes of 
number are directly related to changes of 
duration. More specifically, Cheatham 
and White (1954, Fig. 4) reported that 
perceived number at high pulse rates was 
directly proportional to the number of 
elapsed yo-sec. periods. Such periods 
have been indentified with the psychologi- 
cal “moment”—a unit of psychological 
time—by Stroud (1955). Yet Ss can 
discriminate a 10% change in n for pulse 
trains of less than yy-sec, duration (n to 
128 for IPI, =.5 msec.; to 32 for IPI, 
=2 msec; n to 8 for IPI, =8 msec.). 
Moreover, their sensitivity is relatively 
invariant over presentation rates of 10- 
2,000 pulses/sec. The results imply that 
number discrimination is possible even 
for events which subtend the same psy- 
chological moment and, hence, for events 
which are assigned the same perceived 
number, One is thus forced to abandon 
the hypothesis that number discrimination 
for high pulse rates is related to per- 
ceived number. Presumably, number dis- 
crimination is more closely related to 
perceived duration. 
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COMPARISON OF DIRECT AND VICARIOUS REINFORCE- 
MENT AND AN INVESTIGATION OF 
METHODOLOGICAL VARIABLES* 


ROBERT E. PHILLIPS 
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2 experiments compared the relative effectiveness of vicarious and 
direct reinforcement and investigated the 2 major methodological 
differences between 2 studies differing in their ability to demonstrate 
vicarious reinforcement effects. The results indicated that, contrary 
to previous indications, direct reinforcement is more effective than 
vicarious reinforcement. Neither methodological difference proved to 
be crucial in determining vicarious reinforcement effects indicating a 
serious problem of replicability in vicarious reinforcement research. 


Vicarious reinforcement (VR) has 
been investigated in a systematic series 
of studies summarized by Marston 
(1966). In the first experiment of 
this series McBrearty, Marston, and 
Kanfer (1961) demonstrated that VR 
is an effective reinforcer, and subse- 
quently Kanfer and Marston (1963) 
and Marston and Kanfer (1963) at- 
tempted to compare the relative effec- 
tiveness of VR and direct reinforcement 
(DR). In both of these studies VR 
appeared to be effective in producing 
learning while DR did not. However, 
neither study used an experimental de- 
sign appropriate for this comparison. 
As Phillips (1968) has indicated, Mar- 
ston and Kanfer (1963) did not dem- 
onstrate that the experimental condi- 
tions were adequate to produce learn- 
ing by either VR or DR, and Kanfer 
and Marston (1963) compared grossly 
inequitable proportions of VR and DR. 
The present study consisted of two 
experiments designed to provide valid 
comparisons of VR and DR by includ- 
ing controls for imitation (Phillips, 


1 This research was supported by a grant 
from the graduate school of the University 
of Minnesota. The author would like to 
express appreciation to Scott Bentson, Henry 
Korent, and David Petrie for their assistance 
in the data collection, 


1968) and number of reinforcements. 
Tn addition, both experiments explored 
methodological problems in research on 
VR. 


EXPERIMENT I 


Experiment I was a 3 X 3 x 5 fac- 
torial with the three levels of the first 
factor consisting of the reinforcement 
treatments VR, DR, and NR (no rein- 
forcement). The NR treatment pro- 
vided the control for the imitation in 
the VR-DR comparison. 

The second factor was the instruc- 
itonal variable with three treatments. 
This was included to investigate what 
appeared to be a significant methodo- 
logical difference between Phillips 
(1968) and Exp. II of Marston 
(1966). Both experiments used basi- 
cally the same methodology except that 
Marston informed all VR Ss that they 
would be hearing E say “Good.” This 
was a change from the instructions 
used in all other VR research of this 
type (e.g., Kanfer & Marston, 1963) 
and may explain why Marston (1966) 
obtained VR effects and Phillips 
(1968) did not. The hypothesis that 
Ss who know that E will occasionally 
say “Good” will show greater learning 
was tested in Exp. I by comparing an 
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instructional treatment containing men- 
tion of “Good” with two control in- 
structional treatments. 

The last factor in the design was 
blocks with the five levels correspond- 
ing to five blocks of 54 trials each. The 
relatively large number of trials was 
run to provide optimal conditions for 
finding reinforcement effects of VR 
and DR. 


Method 


Subjects—One hundred thirty-five under- 
graduate male volunteers served as Ss in 
return for extra points toward their grade 
in introductory psychology. Each treatment 
group was randomly assigned 15 Ss. 

Apparatus—The S sat at a table in a 
room by himself with a microphone and a 
small signal light placed in front of him. 
Following the initial instructions, E retired 
to an adjacent room and thereafter com- 
municated with S only via an intercom. 
Both S and E wore earphones. One channel 
of a stereo tape recorder was used to pre- 
sent recorded material (the model) to S; 
the other channel served as an amplifier for 
E's communications to S. 

Procedure—The Ss were told to say a 
word whenever their signal light flashed and 
that during part of the experiment they 
would respond alternately with a tape re- 
cording of another person performing the 
same task. For all Ss, E flashed the signal 
light 10 times at 10-sec. intervals without 
any intervening responses by the tape-re- 
corded S. This constituted a 10-trial base 
rate. Then followed 270 trials of alterna- 
tion with the taped model. The tapes were 
constructed so that in each of five blocks of 
54 trials each there was a progressively 
greater frequency of words in the critical 
response class, ie, vs, f, wo, vo, and fo. 
The noncritical response-class words were 
placed randomly within each block and were 
neutral filler words of many different classes. 

There were three reinforcement treat- 
ments: VR, DR, and NR. In the VR treat- 
ment each word in the critical response 
class said by the taped S was followed on 
the tape by E saying “Good.” In the DR 
treatment there were no “Goods” following 
the recorded S’s responses, but the real S 
was reinforced by E saying “Good” if he 
said a word in the critical response class. 
In the NR treatment neither recorded S 


nor real S received any reinforcement. The 
critical response class was human words 
(Kanfer & Marston, 1963). 

There were also three instructional treat- 
ments: C, G, and P. The basic control 
treatment (C) involved only the instructions 
to say words when the light flashed, which 
replicated the instructions used by Phillips 
(1968). The second treatment (G) added 
the statement “You will occasionally hear 
the experimenter say ‘Good’” to the instruc- 
tions given in Treatment C. This treatment 
replicates the “control” instructions used in 
Exp. II in Marston (1966). The third 
treatment (P) informed Ss that “One class 
of words will begin to occur more often 
than any other in the subject’s responses. 
Your task is to discover what that class of 
words is and to give words of that same 
class.” This instructional treatment was in- 
cluded to examine the effectiveness of the 
“problem” instructions used in Exp. II of 
Marston (1966) without the additional in- 
formation that E would say “Good,” 


Results 


Analysis of variance indicated that 
the main effects (reinforcement, in- 
structions, and blocks) were all sig- 
nificant, F (2, 126) = 8.03, p <.01; 
F (2, 126) = 4.21, p < 05; F (4, 504) 
= 262.49, p < .01. The only signifi- 
cant interactions were the reinforce- 
ment and instruction effects across 
blocks, F (8, 504) = 8.13, p <01; 
F (8, 504) = 6.55, p < 01. 

The Reinforcement effect and Rein- 
forcement X Blocks interaction are il- 
lustrated in Fig. 1. It is apparent that 
the DR treatment demonstrated the 
greatest increase in performance, and 
i tests confirm that the overall mean of 
the DR treatment is significantly 
greater than that of either the VR or 
NR treatments, t = 4.24, p< Oku 
= 4.45, p< Ol. The VR and NR 
means did not differ significantly, t 
= .20, p > 05. 

Investigation of the instruction ef- 
fect indicated that the overall mean for 
the P treatment (X = 89.57, SD = 
10.83) was significantly greater than 
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Fic. 1. Performance of the reinforcement 
treatments across blocks of trials in Exp, 
Tand II. 


the means of either the G (X = 73.71, 
SD = 6.93) or the C (Ž = 77.82, SD 
= 8.07) treatments, £= 3.48, p <.01; 
t= 2.45, p < .01. The G and C treat- 
ments did not differ significantly, t = 
1.09, p > .05. 


Discussion 


The results clearly indicate that, in this 
situation, DR is a more effective rein- 
forcer than VR which is in sharp contrast 
to the results of the studies of Kanfer 
and Marston (1963) and Marston and 
Kanfer (1963), The effectiveness of the 
P instructions and the nonsignificant dif- 
ference between the G and C treatments 
indicates that the hypothesis that the 
mention of “Good” facilitates learning 
must be rejected. Therefore, including 
mention of “Good” in the instructions 
must not have been the crucial difference 
between the studies by Phillips (1968) 
and Marston (1966). 


ExPERIMENT II 


Since Exp. I indicated that the in- 
structions were not the necessary con- 
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ditions for VR effects, it appeared that 
the response class might determine the 
success in finding VR effects since this 
was the only other major difference 
between the Phillips (1968) study and 
Exp. II in Marston (1966). Both 
sexes were run as Ss to test the efficacy 
of reinforcement effects across sexes, a 
previously neglected variable in VR re- 
search. The design of Exp. II was a 
3 x 2 X 5 factorial with the three fac- 
tors being reinforcement (VR, DR, 
and NR), sex, and blocks of trials. 


Method 


Subjects—The Ss were 48 male and 48 
female undergraduates who volunteered for 
the experiment under the same conditions as 
Exp. I. They were assigned in equal num- 
bers to the six treatments. 

Apparatus—The apparatus was the same 
as in Exp. I. 

Procedure——The procedure was the same 
as that used in Exp. I with the exceptions 
that all Ss received the P instructions in 
order to maximize the possibilities of rein- 
forcement effects, and the response class was 
weapons (Smith & Marston, 1965). 


Results 


In Exp. II analysis of variance re- 
vealed that Reinforcement, Blocks, and 
the Reinforcement X Blocks interaction 
were the only significant effects, F (2, 
90) = 10.66, p< .01; F (4, 360) = 
95.54, p < .01; F (8, 360) = 7.66, p 
< .01. As indicated in Fig. 1 the mean 
for the DR treatment is again signifi- 
cantly greater than the means of either 
the VR or the NR treatments, t = 4.71, 
p< 01; t=461, p<.01, and the 
means of the VR and NR treatments 
do not differ significantly, t= .21, f 
> .05. The nonsignificant VR effect 
with weapons indicates that the re- 
sponse class was probably not the cru- 
cial difference between Phillips (1968) 
and Marston (1966). Sex was not 
found to be a significant factor, F (1, 
90) <1, p> .05, in the reinforcement 
effects, 
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Discussion 


From the results of the two experi- 
ments it is clear that DR is not only 
effective in acquisition but actually more 
effective than VR, and this result gener- 
alizes over at least two response classes. 
Marston (1966) has compared VR and 
DR during an extinction procedure and 
also found DR to be effective while VR 
was not. However, Marston (1966) con- 
cluded that this. result. was only applicable 
to extinction, since his previous studies 
had indicated that during acquisition VR 
was effective and DR was not. The 
results of the present study provide a 
more logical description of the VR-DR 
relationship by demonstrating that DR is 
more effective than VR during acquisi- 
tion as well as extinction. 

As Kanfer (1965) has indicated VR 
possesses the potential for providing in- 
sights into a major part of human learn- 
ing, but the lack of VR effects in the 
present experiments, despite concerted 
efforts to find them, raises questions about 
the utility of the present methodology for 
VR research. For example, the second 
experiment was explicitly designed to 
provide optimal conditions for reinforce- 
ment effects (eg., 135 reinforcements, 
270 trials, and very explicit instructions) 
but found no VR effects, yet Marston 
(1966) has demonstrated VR effects with 
only 25 reinforcements in 50 trials using 
the basic ambiguous control conditions. 
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Until the critical difference between the 
present experiments and those by Marston 
and his colleagues can be found, the value 
of VR research must remain in doubt. 
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4 experiments, each directing attention toward a different gestalt 
principle, examined the effect of anchors upon perceptual organization. 
The 4 principles studied were proximity, similarity, good continuation, 
and closure. In the case of proximity a matrix of dots distinctly 
organized as rows (or columns), when used as an anchor, increased 
the frequency with which other more ambiguous matrices were per- 
ceived as columns (or rows). In the case of similarity a matrix 
of alternating black and white rows (or columns) increased the fre- 
quency with which more ambiguous patterns of gray patches were 
perceived as columns (or rows). In the case of good continuation 
a discontinuous linear pattern increased the frequency with which other 
similar patterns were judged to be continuous. And in the case of 
closure a circle containing a wide gap increased the frequency with 
which circles containing smaller gaps were judged to be complete, 


Helson’s (1947, 1948) concept of 
adaptation level (AL) was first identi- 
fied as a general theory of perception 
in the late forties. Since that time a 
wide variety of studies, most employ- 
ing the paradigm of the anchor experi- 
ment, has been offered as evidence of 
its versatility, But an examination of 
this literature (see Helson, 1967, chap- 
ter on perception) reveals that investi- 
gators for the most part have directed 
their attention to the commonly ex- 
ploited prothetic dimensions of sensory 
psychophysics. Exceptions are found 
in Helson’s (1938; Helson & Jeffers, 
1940) own studies of color conversion 
that first gave rise to the concept, Gib- 
son’s (Bergman & Gibson, 1959; Gib- 
son, 1933) demonstrations of negative 
aftereffect, and a few experiments on 

1These experiments were conducted under 
Contract Nonr 3634(01) between Kansas 
State University and the Physiological Psy- 
chology Branch, Office of Naval Research. 
This report was prepared under Contract 
N00014-67-0163-0001 between Johns Hopkins 
University and the Physiological Psychology 
Branch, Office of Naval Research. 

2 Now on active military duty with the 
Office of Personnel Operations, Department 
of the Army, Washington, D. C. 


the judgment of shape done in this 
laboratory (Bevan & Pritchard, 1963; 
Pritchard & Bevan, 1966). 

The purpose of the experiments de- 
scribed in this paper was to confirm 
the generality of the adaptation-level 
interpretation of perception by demon- 
strating that the underlying mechanism 
(referencing of the response to a labile 
subjective norm) operates in a pecu- 
liarly perceptual setting. More specifi- 
cally, the research strategy involved 
seeking changes in the operation. of 
selected organizing principles within 
the context of the anchor experiment. 
An earlier study concerned with per- 
ceived shape in a given orientation had 
offered encouragement (Turner & 
Bevan, 1964). The Ss shown a cube 
in a given orientation during a series 
of adaptation trials repeatedly reported 
the usually ambiguous Necker cube to 
be a cube in the opposite orientation. 
Accordingly, four gestalt principles 
(proximity, similarity, good continua- 
tion, and closure) were selected for 
investigation. The experimental ap- 
proach was essentially the same for all 
cases. A stimulus pattern was selected 
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that embodied the principle under con- 
sideration and a psychophysical series 
was constructed that extended from one 
perceptual organization through an 
intermediate condition to its alternative. 
Independent groups were then tested: 
a control which received only the series, 
and one or more experimental groups 
which, in addition to the series, judged 
an extreme version of one of the alter- 
native organizations interpolated into 
the order of presentation. For each 
experiment it was predicted that the 
presence of the extreme stimulus (an- 
chor) would result in a shift of the 
indifference point (here equated with 
the adaptation level) away from the 
intermediate condition toward the por- 
tion of the series sharing the predomi- 
nant characteristic of this anchor stimu- 
lus. 


EXPERIMENT I: PRoxIMITy 


According to the principle of prox- 
imity, “that form of grouping is most 
natural which involves the smallest 
interval [Wertheimer, 1938, p. 74].” 
For example, when in a matrix of dots 
the vertical separation between adja- 
cent dots is smaller than the horizontal, 
the pattern will be reported more fre- 
quently than not to be a set of columns. 
If the opposite relationship holds, the 
report will be of rows. If the separa- 
tion is equal in both dimensions, the 
pattern will be ambiguous with columns 
and rows each reported about 50% of 
the time. 


Method 


Subjects—Thirty female undergraduate 
students enrolled in general psychology at 
Kansas State University served as Ss. Ob- 
servations were collected in three groups of 
10 randomly assigned Ss each. 

Materials and procedure—Stimulus mate- 
tials consisted of an 11-member psycho- 
physical series. Each series member was a 
matrix of dots projected on a white beaded 


671 


screen by means of a 2 X 2 in, slide projector 
with tachistoscopic shutter. The overall di- 
mensions of each matrix were the same 
having been prepared for photography by 
appropriate arrangements of 36 dots within 
an 8-in. square. All dots were .5 in. in 
diameter and in the middle-most series mem- 
ber were spaced 1 in. apart in both horizontal 
and vertical dimensions. In order to main- 
tain the overall shape and size of the matrix 
the positions of the corner dots were fixed. 
In the columnar patterns the positions of 
the dots in Columns 1 and 6 were held 
stationary and those in Columns 2, 3, 4, and 
5 were shifted horizontally. For example, 
in the matrix with a vertical to horizontal 
ratio of 1.11 the dots of Column 2 were 
moved .1 in. toward Column 1, those of 
Columns 3 and 4 each were moved .05 in. 
toward each other, and those of Column 5 
were moved .1 in. toward Column 6, These 
operations produced three pairs of columns 
with horizontal separation between members 
of each pair 9 that of the vertical. The 
same method was used to form the row 
patterns, mutatis mutandis, Ratios of vertical 
to horizontal separations in the columnar pat- 
terns were 1.11, 1.25, 1.44, 1.67, and 2.00, 
and in the row patterns, .90, .80, .70, .60, 
and .50. Ratios for the column and row 
anchors were, respectively, 10 and .10. 
The projector was situated 15 ft. from the 
screen with Ss grouped in a semicircle 5 ft. 
in front of the projector. Individual pat- 
terns, which subtended a visual angle of 
approximately 3.75°, were shown for periods 
of .5 sec. with 5 sec. intervening between 
presentations. A total of 55 judgments per 
S were made by the control group in re- 
sponse to five independently randomized se- 
quences of the stimulus series. Each of the 
two experimental groups made 110 judg- 
ments per S since the anchor stimuli were 
presented without formal identification on 
alternate presentations. The Ss were asked 
to indicate whether each pattern was a set 
of rows or columns by striking through an 
R or C opposite each trial number on a 
data, sheet provided by E. The relative fre- 
quency of R responses for each series mem- 
ber was calculated for each S and averaged 
for the several groups. These group aver- 
ages were then plotted to yield Fig, 1. 
Curves, as is the case for all figures, were 
fitted graphically. The indifference points 
were read from the graphs as the ratio 
yielding 50% row responses. Statistical 
tests were performed on actual judgmental 


frequencies. 
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Results 


Inspection of Fig. 1 indicates that 
the interpolation of the anchor patterns 
into the series had a substantial effect 
upon the perceptual organization pro- 
duced by the factor of proximity. With 
the series alone (Group C) the indiffer- 
ence point was associated with a col- 
umnar pattern in which the vertical 
distance between adjacent dots in a 
pair was 6% greater than the horizon- 
tal. The interpolation of anchors on 
50% of the presentations produced a 
shift in the row judgments in the an- 
ticipated directions for Groups, F (2, 
27) = 15.57, p < .01. With the intro- 
duction of the column anchor (Group 
Eo) the series members were seen 
more frequently as row patterns, and 
the condition of ambiguity was asso- 
ciated with a columnar array in which 
the separation within adjacent pairs 
was 19% greater than the horizontal. 
Dunnett’s test (Edwards, 1960) indi- 
cated, for a = .05 and df = 27, a criti- 
cal difference of 1.88; Zro — Xo = 


PERCENT ROW RESPONSES 


2,00 1.67 1.44125 1.00 .80 .60 
COLUMNS ROWS 
RATIO OF VERTICAL TO HORIZONT/ 
BET S [AL DISTANCE 


Fic. 1. Relative frequency with which a 
matrix of dots was perceived to consist of a 
set of rows when the ratio of vertical to 
horizontal distances between pairs of adja- 
cent dots varied from .50 to 2. (Data are 
for control, C, and column and row anchor, 
Eo and Ex, conditions.) 


2.20. Introduction of the row anchor 
(Group Er ) had an opposite and more 
pronounced effect, the indifference 
point being identified with a pattern 
of rows in which the distance was only 
87% that of the horizontal, Dunnett’s 
test critical difference = 1.88; Žo — 
Xer = 11.60. Finally, anchor effects, 
as expected, were greatest for the am- 
biguous stimuli for the interaction 
Groups X Stimuli, F (20, 270) = 8.59, 
p<.01. Of special interest was the 
fact that the presence of the anchor not 
only modified judgments of the series 
members but actually effected a per- 
ceptual organization for some stimuli 
opposite to that favored by proximity. 


EXPERIMENT II: SIMILARITY 


Wertheimer defines the factor of 
similarity as “the tendency of like parts 
to band together [1938, p. 75].” The 
common textbook example consists of 
a matrix of alternating horizontal ar- 
rays of open circles and dots invariably 
described by the observer as alternating 
rows of circles and dots rather than 
columns of alternating circles and dots. 
Because the construction of a psycho- 
physical series from such a pattern 
would mean changing spatial relations 
among the dots and thus the confound- 
ing of the influence of similarity by 
proximity, a different set of stimulus 
relationships was sought. This was 
found in varied lightness among adja- 
cent patches and a psychophysical series 
was constructed in which (a) the 
middle-most member consisted of a 
matrix of patches all of the same re- 
flectance (black), (b) five members 
consisted of alternating rows of black 
patches and, with increasing eccentric- 
ity in the series, patches which were 
progressively lighter shades of gray, 
and (c) five consisted of alternating 
columns of black and dark to light gray 
patches. The row anchor was con- 
structed by alternating horizontal rows 
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of black and white patches, and the 
column anchor by a vertical arrange- 
ment of black and white columns of 
patches. 


Method 


Subjects—Forty-five male students of gen- 
eral psychology at Kansas State University 
served as Ss. As in Exp. I these were as- 
signed at random to three groups for testing. 

Materials and procedure—The stimuli 
were 6 X6 matrices of 1.5-cm. square gray 
patches separated in both dimensions by 1 
cm. The background was white. Table 1 
gives the orientation of the array and the 
reflectances of adjacent pairs of patches in 
this array. Inspection indicates that the 
series, when described in terms of the factor 
of similarity, actually consisted of two over- 
lapping series of six members each with 
differences in reflectance being maximal at 
one end (Stimuli 1 and 11) and zero at 
the other (Stimulus 6). Six sets of slides 
were made of the series for use in the 
projector. On projection each matrix sub- 
tended a visual angle of 2.9°. Except for 
the nature of the psychophysical stimuli and 
the total number of Ss involved, the design 
and procedure of the experiment were exactly 
the same as the design and procedure of 
Exp. I. 


Results 


Figure 2 graphically summarizes the 
results of Exp. II. Inspection immedi- 
ately indicates that the introduction of 
the anchors had a marked influence 
upon the judgment of the series mem- 
bers. Again the effect was one of 
perceptual contrast for Groups, F (2, 
42) = 95.47, p < .01. And again the 
influence was greatest on the more am- 
biguous stimuli, i.e., those adjacent to 
Stimulus 6 in the series, for the Groups 
X Stimuli interaction, F (20, 420) = 
14.90, p < .01. The control curve sug- 
gests that there is an overall tendency 
to report the matrix more frequently to 
be a series of columns than rows. This 
is enhanced by the interpolation on al- 
ternate trials of a readily distinguish- 
able row pattern; on the other hand, 
the introduction of the column anchor 
resulted in an increase in the tendency 
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TABLE 1 


ORIENTATION oF ALTERNATE Gray AND 
BLACK ARRAYS AND REFLECTANCE OF 
ADJACENT PATTERNS IN EACH ARRAY 


Stimulus Orientation Poeeanee in 
1 Vertical 2,9-33.9 
2 Vertical 2.9-23.2 
3 Vertical 2.9-16.5 
4 Vertical 2.9- 9.9 
5 Vertical 2.9- 6.8 
6 2.9- 2.9 
7 Horizontal 2.9- 6.8 
8 Horizontal 2.9- 9.9 
9 Horizontal 2.9-16.5 
10 Horizontal 2,9-23.2 
11 Horizontal 2.9-33.9 

Anchor Vertical 2,9-72.8 

Anchor Horizontal 2.9-72.8 


to report a row organization. Further- 
more, these anchors extend their influ- 
ence not only over arrays of the same 
orientation but those of the opposite 
orientation as well, That judgment 
reflects the interaction of both orienta- 
tion and lightness differential is seen 


VERTICAL ORIENTATION HORIZONTAL ORIENTATION 
RATIOS OF REFLECTANCE BETWEEN PATCHES 


Fic. 2. Relative frequency with which a 
matrix of gray and black patches was per- 
ceived to consist of a set of rows when 
patches of like reflectance were arrayed in 
the same dimension (horizontal or vertical) 
and when alternate arrays differed in re- 
flectance from no difference to ratios of 11.7 
to 1. (Data are for a control group, (CF 
and anchor groups in which the anchor con- 
sisted of either alternating columns, Eo, or 
rows, Ex, of black and white patches. There 
is only one data point at Reflectance Ratio 
1 which has been plotted twice for the con- 
venience of fitting double ogives to the data.) 
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in the position of the indifference 
points. Finally, recognition that the 
stimulus series was in fact two over- 
laping series based on contrasting light- 
ness in alternate arrays, combined with 
the observation that the control data 
varied little for the intermediate series 
members, strongly suggested a double 
ogive fit for the data of each group. On 
the other hand, the data, especially those 
of the experimental groups, will readily 
accommodate simple ogives and the 
determination of the indifference points 
has been based on this more parsi- 
monious assumption. 


ExperIMENT III: Goop 
CONTINUATION 


The factor of good continuation is 
more difficult to define than it is to 
illustrate. The phrase refers to the 
fact that lines or other stimuli that con- 
tinue or complete a direction tend to 

ı be perceived as a single pattern more 
readily than those that represent dis- 
continuities or abrupt changes in direc- 
tion. The pattern chosen for Exp. III 
was a vertical line intersected by a 
diagonal line running at a 45° angle, 
left to right, top to bottom. The psy- 
chophysical series consisted of the stim- 
ulus just described and six other series 
members in which the left and right 
halves of the diagonal were vertically 
separated by distances of 1-6 mm. 
Two such series were constructed : one 
in which the left-hand segments were 
taised and the right-hand segments 
lowered to produce the separation, and 
one in which the opposite operations 
were performed. Anchor stimuli, one 
for each series, had separations of 10 
mm. 


Method 


Subjects—The Ss were 30 male under- 
graduates enrolled in basic military science 
courses at Kansas State, assigned at random 
to three independent groups and tested in- 
dividually. 


Materials and procedure—The stimulus 
materials were prepared in black on white 
cardboard for presentation in a tachistoscope 
(Gerbrands Model T-1C). On presentation 
the separation between segments of the di- 
agonal subtended, in the series members, 
visual angles of approximately 7, 13, 20, 26, 
32, and 39 min., and in the anchor, 64 min. 
Presentation times per stimulus were .05 sec. 
at intervals of 10 sec. Control Ss received 
the 13-member combined series randomly 
ordered five times with a different random 
order each time. The Ss in one experimental 
group (Ex), in addition, received the anchor 
with the left-hand segment above the right 
on alternate trials; Ss in the other (Er) 
received the anchor with the right side of 
the diagonal above the left. Instructions 
were to report the condition of the diagonal 
(“continuous” or “discontinuous”) on each 
trial. The frequency of continuous judg- 
ments was computed for each series mem- 
ber and each S. 


Results 


Because the 13 psychophysical stim- 
uli were in fact two six-member series 
with a single overlapping member (the 
figure involving the continuous diag- 
onal), the data for the control and two 
experimental groups are graphically 
summarized as independent halves. 
Figure 3A summarizes results obtained 
with the stimuli in which the left-hand 
diagonal segment intersected the verti- 
cal at a position above the right-hand 
segment. Figure 3B includes data from 
the complementary series. Both figures 
indicate clearly that the interpolation of 
anchor stimuli containing relatively 
wide separations between segments re- 
sulted in an increase in the number of 
times the series members were judged 
to include a continuous diagonal. For 
Fig. 3A the Groups effects, F (2, 27) 
= 39.48, p< .01, and the Groups X 
Stimuli interaction, F (12, 162) = 
16.12, p < .01, were significant. For 
Fig. 3B the Groups effects, F (2, 27) 
= 13.51, p< .01, and the Groups X 
Stimuli interaction, F (12, 162) = 
6.13, p < 01, were significant. The 
effect thus was one of contrast, in line 
with expectation. Furthermore, the 
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SEPARATION IN MILLIMETERS 
Fic. 3. Relative frequency with which left and right diagonal segments intersecting a 


vertical line were judged to form a continuous line. 


(The stimuli consisted of two over- 


lapping series: one in which the left-hand segment intersected above the right; the other in 
which the opposite relation obtained. The results of the left-above series are presented in 
Fig. 3A; of the right-above series in Fig. 3B. Data in both graphs were obtained from a 
control group, C, and anchor groups with the left segment of the diagonal above the 


right, Ex, and vice versa, En.) 


effect appeared even when the spatial 
arrangement of the segments in the 
series was opposite to that in the 
anchor. For Fig. 3A Dunnett’s test 
indicated, for a= .05 and df= 27, a 
critical difference of 1.26 and Er — 
Xo=6.7. For Fig. 3B the critical 
difference = 1.49 and Xr, — Xo = 9.8. 
The E; anchor was clearly more potent 
than the Ep, ter—Er(18) =3.37, $ 
< .01, when it shared with the series 
the left-above-right orientation. This 
result may be associated with the fact 
that such a spatial arrangement of seg- 
ments represents an instance of interpo- 
sition and emphasizes the discontinuity 
in the anchor. In the complementary 
condition, where neither interposition 
nor common orientation of series and 
anchor obtained, the superiority of 


Er over Ep was not demonstrated, 
ter Er (18) = 1.4, p > .05. 

The operations of this experiment 
were assumed to reflect differences in 
simple discrimination. Yet one obvious 
fact about the data suggested strongly 
that the processes are most complex: 
the indifference points for all condi- 
tions, when converted to angular mea- 
sures of minimum separable, were 
many times larger than the crudest 
estimates reported in the literature on 
visual acuity. In the instance of ver- 
nier acuity, the technique of measure- 
ment most closely resembling the pres- 
ent task, the minimal detectable dis- 
placement was of the order of several 
seconds (Riggs, 1965, p. 326). Meridi- 
onal variations in acuity, differences in 
exposure time, state of dark adaptation, 
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and luminance are among the variables 
that may account for the present results. 
Furthermore, it was not wholly clear 
that S$ was really seeing physically dis- 
continuous segments as continuous as 
much as he might have been subtly 
changing his criteria for categorizing 
stimuli. 


EXPERIMENT IV: CLOSURE 


The principle of closure is related 
to the principle of good continuation. 
Strictly speaking, however, it is not a 
grouping factor but rather a special 
case of the law of Prägnanz and stands 
with regularity, symmetry, and sim- 
plicity as a property of good form. 
The purpose of this experiment was to 
examine the effect of an anchor upon 
the threshold for closure. The stimulus 
series was a set of nine circles, one with 
a complete contour and eight with in- 
terruptions, varying in width from .5- 
4.0 mm, in .5-mm. steps. The anchor 
was constructed with an 8-mm. gap. 
The threshold was defined as the indif- 
ference point of judgment and the 
expectation was that the interpolation 
of the wide-gapped anchor on alternate 
trials would produce an upward shift 
in this value. The experimental strat- 
egy thus involved a simple two-group 
design. 


Method 


Subjects—The Ss were 20 male under- 
graduate students at Kansas State drawn 
from the basic military science course. They 
were assigned alternately to the control and 
experimental groups. 

Materials and procedure—The stimulus 
figures were drawn with a radius of 10 mm. 
in black india ink on medium-gray cards. 
The width of the contour was .5 mm. and 
each stimulus was prepared in four versions 
with gaps at 3, 6, 9, and 12 o'clock. The 
gaps in the stimulus series when drawn 
varied in .5-mm. steps; when presented in 
the Gerbrands tachistoscope, they subtended 
visual angles of 3-26 min. of arc. Each 
figure was shown twice in every orientation 
for a total of 72 presentations for the control 
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group. Each presentation was for .05 sec. at 
intervals of 30 sec. Instructions were to 
report presence and position of a gap: e.g., 
“Gap at 6 o'clock.” An incorrectly located 
gap was scored as a response of “Closed.” 
Performance was summarized in terms of 
the frequency of closed responses for each 
series member. 


Results 


The results as indicated in Fig. 4 
were simple and straightforward and 
consistent with those of the earlier ex- 
periments in this series. Interpolating 
the anchor enhanced the frequency of 
closed responses with a significant in- 
crease in the size of the threshold gap 
from .40 to 1.65 mm. for Groups, F 
(1, 18) = 32.28, p < .01, and for the 
Groups X Stimulus interaction, F (8, 
144) = 15.25, p < .01. Here, as in the 
case of the experiment on good con- 
tinuation, the angular separation at 
threshold was, at best, many times more 
than clinically accepted standards of 
visual acuity. This may have been 
due partly to the low stimulus contrast 
(black on medium gray), but again the 
importance of other conditions of stim- 
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Fic. 4. Relative frequency with which a 
series of incomplete circles were judged to 
be complete or closed. (Data are for con- 
trol, C, and anchor, E, groups.) 
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ulation as well as the possibility of a 
shift in judgmental criteria in the 
presence of the contrasting anchor must 
be considered. 


Discussion 


The aim of the present experiments was 
to examine the operation of several of 
the more familiar gestalt organizing prin- 
ciples within the paradigm of the adap- 
tation-level experiment. The outcomes 
were, without exception, successful, for 
the effect of interpolating anchor patterns 
within psychophysical series constructed 
to demonstrate the operation of these fac- 
tors was uniformly to change perceptual 
organization in ways dictated by this con- 
ceptual model. They stand, as Helson 
(1966, p. 173) has already noted, as “evi- 
dence that there are spatial adaptations, 
no less than chromatic adaptations, which 
are responsible for the way spatial pat- 
terns are formed.” A recent experiment 
by Rock and Brosgole (1964) on the 
factor of proximity also supports this con- 
clusion. The Ss were shown a pattern 
of dots with horizontal separations greater 
than vertical, When the figure was in the 
frontal plane, the response was of col- 
umns. As the figure was tilted out of the 
frontal plane, responses changed to rows 
at about 45°. However, vertical equaled 
horizontal separation in the optical pro- 
jection at about 41° so that the pattern 
was perceived as columnar when the law 
of proximity dictated rows. Helson 
(1966, p. 175) suggests that these results 
may be attributed to an order effect pro- 
duced by the initial grouping functioning 
as an anchor and refers to Bevan, Barker, 
and Pritchard’s (1963) demonstration 
that order effects may be explained by 
defining the effective stimulus as the dif- 
ference between the variables being 
judged and the AL computed as a 
weighted mean of the preceding stimuli 
and the last stimulus judged. 

These data recall a controversy of an 
earlier period: the relative importance of 
native vs, experiential determinants of 

. form perception. The same Gestalt theo- 
tists who brought the relational determi- 
nation of responses to the fore in psycho- 
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logical thinking adopted an absolutistic 
view of the origin of perceptions. Be- 
cause they coupled the empiricism of 
Classical Associationism with its elemen- 
talism, they rejected the former when 
they rejected the latter, The Gestaltists’ 
dialectic involved essentially anecdotal 
evidence (cf. Koffka, 1935, Ch, 3, Pp. 
155, 217, etc.) along with heavy reliance 
upon the experiments of Gottschaldt 
(1938) dealing with the perception of 
masked forms. In these studies it was 
reported that simple figures, when masked, 
were not detected unless there was a 
strong set to search, and even under this 
condition success was relatively infre- 
quent. The response of experimenters 
challenging this view was to devise figures 
that after practice could be identified in 
the masked condition (cf. Braley, 1933; 
Djang, 1937; Hanawalt, 1942). None 
directly manipulated the operation of spe- 
cific grouping principles over time to 
yield responses organized in particular 
ways. Studies like the present experi- 
ments have relevance for the nativism- 
empiricism issue in perception because 
they identify a mechanism by which the 
operation of the spatial principles may be 
extended over time. Additional studies of 
the duration of these anchor effects there- 
fore become imperative, 

Finally, these data, along with Turner 
and Bevan’s (1964) results on shifts in 
perceived orientation, have implications 
for the problem of anchor effects as per- 
ceptual change vs. change in response- 
category labels to accommodate an in- 
creased range of stimuli (Stevens, 1958). 
In the experiments on good continuation 
and closure, where degrees of closedness 
and continuity constitute logical dimen- 
sions, subtle changes in judgmental cri- 
teria constitute a reasonable alternative 
interpretation to a perceptual change as 
an explanation of the results. However, 
in the experiments on proximity and simi- 
larity, the column and row responses are 
logically mutually exclusive alternatives 
and the shift in the proportion of one 
over the other strongly suggests a basic 
perceptual change. Perhaps the most 
direct method for differentiating per- 
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ceptual changes from semantic shifts in- 
volves the establishment of equivalence 
classes. This involves asking, eg., if S 
would more likely identify two simul- 
taneously presented circles, one continu- 
ous and one gapped, as the same in 
the presence as compared to the absence 
of anchors. Consistent with this ap- 
proach, Egeth, Avant, and Bevan (1968) 
asked Ss to compare identical pairs of 
achromatic patches with each pair in the 
comparison appearing on contrasting 
backgrounds (black vs. white; black vs. 
gray; white vs. gray) and found that the 
apparent difference between members of 
a pair was greater when the pair and the 
background shared the same region of 
the lightness continuum than when they 
were drawn from different regions of 
the continuum. 
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ARTICULATION AND ACOUSTIC CONFUSABILITY 


IN SHORT-TERM MEMORY? 
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Lists of letters varying in length and in acoustic confusability were 
presented for immediate probed recall. Presentation was either 
visual (with nonarticulation, silent articulation, or articulation aloud) 
or auditory (with nonarticulation or silent articulation). It was found 
that recent visual items which were articulated gave acoustic con- 
fusability effects intermediate between the heavy effects obtained 
when retrieval was ostensibly from an auditory afterecho and the 
negligible effects obtained when retrieval was ostensibly based on 
visual memory. These results suggest that articulation enhances the 
discriminability particularly of recent items in STM, and also that 
visual or auditory STM can be investigated independently of STM 


for speech-coded information. 


Immediate recall may be based on 
information stored as speech, or on a 
short-lived and more purely sensory 
kind of storage. The latter has often 
been called the “primary memory 
image”; the visual image probably dis- 
appears within a second or so (Sper- 
ling, 1960), while the auditory image 
may last for as long as 3 or 4 sec. 
(Mackworth, 1964). The latter phe- 
nomenon is very close to the notion of 
“primary memory” (Waugh & Nor- 
man, 1965), although for Waugh and 
Norman, primary memory also is con- 
sidered to include very recent items 
which may have been articulated at 
presentation. 

The purpose of the present work 
was to discover whether items osten- 
sibly retrieved from sensory storage 
Were as susceptible to the effects of 
acoustic confusability (AC) as were 
items ostensibly retrieved from speech 
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storage. It has been argued that items 
coded as speech may be stored aurally 
(e.g., Sperling, 1967), although Wick- 
elgren (1966) and Hintzman (1967) 
have shown that there probably are 
strong motor components in speech 
storage as well. At the same time, 
Murray (1967) has argued that one 
of the advantages of speech coding for 
immediate memory purposes is that, 
by appropriate intonation, extra motor 
discriminative cues may be supplied by 
S which facilitate retrieval of each 
item. If this were the case, then the 
following predictions might be made 
about the various effects of AC on 
the retrieval of items stored visually 
or aurally, as contrasted with those of 
items stored as speech (with its motor 
component): The accuracy of retrieval 
of items available in auditory image 
form should be influenced heavily by 
the level of AC in the sequence. The 
accuracy of retrieval of items stored as 
speech also should be influenced by 
list AC, but items highly confusable 
with each other should benefit partic- 
ularly from articulation as new motor 
cues will have been added to the audi- 
tory components. If, however, stor- 
age is of a kind involving neither audi- 
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tory nor motor components, e.g., visual, 
then retrieval should not be influenced 
by the AC of the material. This last 
point already has received some sup- 
port from an earlier study (Murray, 
1967): if Ss emitted neutral sounds 
while seeing lists of letters, so that it 
became very difficult to articulate the 
letters, recall did not vary with AC. 

In the present experiment, therefore, 
immediate recall was examined for lists 
presented visually or aurally. As each 
item appeared, S$ either articulated it 
(silently or aloud), or was prevented 
from articulation. (It is assumed that 
if retrieval of items which were not ar- 
ticulated is possible, then that retrieval 
probably was based on visual or audi- 
tory memory.) Recall was measured 
by a probe technique in order to re- 
duce artifacts of intrarecall interference, 
particularly since so much interest is 
focused on the last items. As a means 


of evaluating the effects of intralist in- 


terference other than that due to AC, 
list length also was varied (cf. Mur- 
dock, 1968). 


METHOD 


Design—The three variables of interest 
are: presentation conditions (PC), list 
length (L), and acoustic confusability (AC). 
Serial position (SP) constituted a fourth 
variable, The material consisted of lists of 
6, 7, 8, or 9 letters. For each of these list 
lengths, the lists were classified as being of 
three levels of acoustic confusability: one 
sound (eg. PBCTVG), two sounds (eg., 
PXNTVS), or five sounds (e.g, PYQFOJ). 
For the purpose of constructing the lists, the 
following letters were held to rhyme: 
BCDEGPTV; FLMNSX; AHJK; QU; 
WFO and R also were used in the five- 
sounds lists. - When vowels occurred in a 
sequence, care was taken that they did not 
accidentally form words; for the case of the 
long one-sound lists, it was necessary to have 
at least one letter repeated. However, the 
probe itself was never a duplicated letter, 

The five presentation conditions were as 
follows: 

Visual, suppression (VS). Lists were pre- 
sented visually, and as each letter appeared, 
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S said “the.” This condition, called “sup- 
pression” in Murray (1967), has been found 
to be an effective way of reducing, if not 
eliminating, subvocal articulation at presen- 
tation. 

Auditory, suppression (AS). This con- 
dition was the same as for VS, except that 
the lists were presented aurally through 
headphones. 

Visual rehearse (VR). Lists were pre- 
sented visually and S was instructed to re- 
hearse the items silently. 

Auditory, rehearse (AR). This condi- 
tion was the same as for VR, except that 
the lists were presented aurally through 
headphones. 

Vocalize (Voc). Lists were presented 
visually and S spoke each item aloud as it 
appeared. 

For the experiment, 40 undergraduate Ss 
served in each of five groups, one group to 
each PC. Each S received initial practice 
on 5 lists, then worked through 78 lists. As 
each of these lists represented one of the 
78 LX ACXSP combinations, these vari- 
ables are “within Ss.” The number of data 
points for each of the L x AC X SP X PC 
combinations, therefore, was 40. The lists 
representing each L X AC X SP combination 
were randomly intermingled so that Ss did 
not know in advance which L to expect, 
nor, of course, which SP would be probed. 
No mention was made to Ss of the AC 
factor. In order to counteract idiosyncrasies 
of list difficulty, four sets of 78 lists were 
devised, and within each group of 40 Ss, 10 
Ss received each of the sets. As a further 
precaution, the random ordering of the L X 
AC X SP combinations also differed between 
sets of lists. 

Procedwre—For visual presentation, the 
five practice lists and the 78 test lists were 
typed on a continuous roll of paper which 


2 Tf a person is saying “the” while listen- 
ing to a sequence, there is a risk that he 
will mishear some of the items. This risk 
was alleviated by having heavily muffled 
earphones so that perception of the air- 
borne sound of S’s own voice was reduced. 
Moreover, when research students who did 
not serve as Ss in the experiment simply 
copied what they heard while saying “the,” 
the number of acoustic confusions in copy- 
ing was only about 6%, many fewer than 
the number of acoustic confusions in recall. 
It is likely, therefore, that a few of the 
errors under this condition were due to 
mishearing but not enough to invalidate the 
findings under this condition. 


ACOUSTIC CONFUSABILITY IN STM 


was fed through a memory drum (continuous 
movement). Just before each list appeared 
there was a red warning line; the probe also 
was printed in red. The presentation rate 
was two letters per second with the probe 
appearing about ł sec. after the final item. 
There was a recall interval of 10 sec, be- 
tween lists and a short rest pause after the 
first 34 lists. 

For auditory presentation, the lists were 
tape-recorded in a male voice; a warning, 
“now,” and the probe item were spoken in 
a woman’s voice. The lists were recorded 
by directly reading from the printed lists 
when they were put through the memory 
drum. 

On arrival, each S was given written in- 
structions which informed him of the nature 
of the probe task: He was to perceive a 
list of letters, then one of those letters 
would be repeated. The S was to write 
down (on prepared sheets) the items he 
thought had followed that item in the origi- 
nal list. The instructions were simply “to 
try to remember as much as possible” of the 
original lists. (It will be noted that S was 
not prevented from intralist rehearsal if he 
wished to carry it out—this almost certainly 
explains why the author obtained greater 
Primacy effects than have been obtained by 
others who have used the probe technique, 
€g., Norman, 1966.) The S was told ver- 
bally about the kind of articulation he was 
to carry out. 


RESULTS 


A general picture of the findings is 
given in Fig. 1 and 2. Figure 1 shows 
the probability of recall at each SP as 
a function of PC and L, summing 
across values of AC. It will be seen 
that there were marked primacy effects 
under many of the PC conditions, par- 
ticularly for short lists. Figure 2 
shows the probability of recall at each 
SP as a function of PC and AC, sum- 
ming across L. The “humps,” seen on 
Positions 4 or 5 from the end on sev- 
eral of the graphs, simply reflect the 
Primacy effects of Fig. 1 and need not 
be of further concern. No separate 
illustrations of AC X L data are given 
„because this was not a significant in- 


‘teraction in the analyses outlined below. 


Rather than carry out an overall 
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analysis of variance, which the graphs 
indicate would yield interactions diffi- 
cult to interpret, it was decided first 
to analyze the data for the last four 
serial positions only. These are the 
items often taken to be retrieved from 
“primary memory.” The results of 
such an analysis carried out over all 
the data, and in which appropriate 
significant higher order interactions 
served as the denominators in deriving 
the F ratios for lower order interactions 
or main effects, showed that all four 
main effects were significant: for PC, 
F (4, 24) = 59,256, p < .01; for AC, 
F (2, 6) =9.76, p < .025 (the lower 
the acoustic confusability, ie, five- 
sound lists, the better the recall in 
general) ; for L, F (3, 18) = 4.49, p 
< .025 (but it shall be shown that this 
effect appeared restricted to the cases 
of VS and AR); and for SP, F (3, 
6) = 50.51, p<.01. The only sig- 
nificant two-way interaction was AC 
xSP, F (6, 24) =2.63, p<.05. 
Figure 2 shows that this probably re- 
sulted mainly from the two conditions 
involving auditory presentation, AS 
and AR. In accordance with this ob- 
servation, the three-way interactions 
PC X AC x SP, F (24, 72) = 3.11, p 
< 01, and L x AC X SP, F (18, 72) 
= 2.31, p < .01, also were significant. 
Further justification for treating the 
above analysis as being particularly af- 
fected by the inclusion of the auditory 
conditions resulted from five separate 
analyses of variance carried out on the 
data within each presentation condition. 
List length gave significant variance 
ratios only on Conditions AR, F (3, 
18) = 4.39, p < .01, and VS, F (3, 
18) = 6.84, p<.01, and was con- 
cerned in no significant interactions 
within any of the five analyses. Other- 
wise, these analyses supported the con- 
clusions of the main analysis: there 
were highly significant effects of SP 
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in each and the only two-way inter- 
action of significance anywhere was 
AC x SP—for VS, E (6, 18) = 3.29, 
p < .05; for AS, F (6, 18) = 9.94, p 
< .001; for AR, F (6, 18) = 3.58, p 
<.05. This interaction was not sig- 
nificant in the cases of VR and Voc. 
However, one interesting new relation- 
ship emerged: the effects of AC were 
significant under all presentation con- 
ditions except VS (see Fig. 2). This 
result confirms a finding of Murray 
(1967) that the recall of nonarticulated 
visual lists is unaffected by the AC of 
the sequence. But, now, it is seen that 
the recall of the final items of nonartic- 
ulated auditory lists (AS) is affected 
by AC, F (2, 18) = 31.23, p < .001. 

Turning to the early items, Fig. 1 
shows that primacy effects were exag- 
gerated on short lists, particularly when 
rehearsal was permitted (AR, VR, 
Voc). Figure 2 shows that the effects 
of AC also were marked on early items, 
but only when rehearsal was permitted 
(AR, VR, Voc). These effects could 
be statistically confirmed, but only by 
an ad hoc choice of which variables to 
collapse across each other in any se- 
lected analysis. To simplify matters, 
the author shall focus on the PC and 
AC variables. Analysis of variance on 
the proportions correctly recalled, col- 
lapsing across the first four SPs as 
compared with the last four, showed 
that AC did not interact significantly 
with early vs. late items. However, 
the interaction PC x AC was signifi- 
cant, F (8, 8) =5.60, p< .025, as 
might be expected given the anomalous 
trend shown by VS. In most cases, 
then, AC effects, observed to influence 
recall of the last items, also influenced 
recall of the early items. 


Discussion 


The main result of this experiment was 
the finding that AC inversely affected 
recall apparently based on auditory stor- 


age (AS) but not recall apparently based 
on visual storage (VS). This finding has 
at least two implications. First the “sup- 
pression” method, coupled with a probe 
technique, would appear to allow an in- 
vestigation of STM for sensory inputs 
relatively uncontaminated by rehearsal. 
Second, the errors found in the recall of 
nonarticulated verbal items directly re- 
flect the characteristics of storage. This 
does not necessarily mean that acoustic 
errors are the result of some kind of 
“assimilation” between traces in sensory 
store: the results of this experiment could 
have been obtained if S knew that most 
of the items seen or heard had a common 
vowel sound, and biased his responses 
accordingly. Indeed, it may be stated that 
most of the errors in the experiment were 
either acoustic or order errors: the num- 
ber of extralist intrusions which were 
not acoustically similar to one or more 
items in the sequence was very small. 
Moreover, the proportion of errors which 
were acoustic was higher in the case of 
two-sounds VS lists than in the case of 
five-sounds VS lists (.318 as opposed to 
.064); had the experiment dealt with 
visual storage exclusively, no difference 
would have been expected. It must be 
concluded that Ss are capable of retain- 
ing some information about the verbal 
qualities of visual items even though they 
are formally prevented from articulation, 
But it is also plain, from a comparison 
of VS and AS, that there is a much 
greater risk of interference due to acous- 
tic similarity in the latter case. 

When articulation is permitted, the 
effects of AC are more moderate than 
they are when retrieval is from auditory 
storage. As was suggested in the intro- 
duction, this is consistent with the view 
that the addition of motor-articulatory 
cues serves to enhance the discriminability 
of the individual items, particularly the 
most recent ones of a list of high AC. 
Articulation at presentation, then, has 
the advantage that it particularly en- 
hances the recall of items which have just 
entered storage. It also enhances the 
recall of items early in the list (as com- 
pared with nonarticulation). But in situ- 
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ations where all items are articulated 
it might sometimes happen that the ac- 
tions of responding to the late items pre- 
vents rerehearsal of the earlier items. 
Compare the recall of early items on AR 
and VR: in the former case, § can re- 
rehearse the early items and still take in 
the last items and retrieve them from 
auditory storage. Thus, heavy primacy 
effects may be found when auditory lists 
are presented, even though the auditory 
image may outlast its visual counterpart. 
With visual lists, S must necessarily re- 
spond at presentation to the last items, 
with the result that he cannot fluently 
rerehearse the earlier items. The use 
made of sensory as opposed to speech 
storage, therefore, is adaptable to the 
circumstances (cf, Aaronson, 1967; Mur- 
ray, 1965), However, the present results 
do indicate that, despite this source of 
difficulty in experimentation, sensory 
storage can be investigated independently 
of speech storage, and an important dif- 
ference between the two might reside in 
the role played by motor activity in the 
latter kind of storage. 
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ASSOCIATIVE UNLEARNING OF A-B FOLLOWING 
A-C OR A-Br INTERPOLATION 1 


DENNIS J. DELPRATO anp BERTRAM E. GARSKOF 


Michigan State University 


According to a current interpretation of associative unlearning, 1 


potential source of unlearning is 


present in the A-B, A-C paradigm 


(specific forward associates) and 2 sources are present in the A-B, 
A-Br paradigm (specific forward and backward associates). However, 


a recent study failed to provide 


associative unlearning between these 2 paradigms. 


conclusive evidence for differential 
The present study 


tested 3 groups on an A-B associative matching retention test 
following either A-C or A-Br learning or irrelevant activity (con- 


trol.) 


Both work groups showed significant unlearning but Group 


A-Br retained significantly fewer associations (p<.05) than Group 
A-C. Thus, in this study, the prediction that A-Br learning would 
lead to more associative unlearning than A-C learning was supported. 


Using four transfer paradigms and two 
measures of retention, McGovern (1964) 
provided evidence for the extinction or un- 
learning of three types of first-list associa- 
tions as a result of interpolated learning 
(IL). It was concluded that specific for- 
ward (S—>R) and backward (R—S) as- 
sociations and contextual-response (C —> R) 
associations would be unlearned whenever 
a corresponding A-B, A-C relationship held 
between successive lists.  Contextual-re- 
sponse unlearning was assumed to be mea- 
sured by a free-recall technique while asso- 
ciative unlearning was assumed measured 
when the effects of C—>R unlearning were 
eliminated by providing S with first-list stim- 
uli and responses and requesting first-list 
matching (associative matching). 

An analysis of the A-B, A-C and A-B, 
A-Br (first-list terms re-paired) paradigms 
shows one potential source of associative un- 
learning in the former (S—>R_ associa- 
tions) and two potential sources in the latter 
(both S—>R and R— S associations). In 
other words, in the A-B, A-Br paradigm a 
different response must be learned to be the 
same List 1 stimulus in IL (as in A-B, A-C) 
and also a different stimulus must be asso- 
ciated with the same List 1 response since 
first-list responses are again present. Thus, 
if the amount of (S—>R) unlearning is the 
same in both paradigms, associative matching 
following A-Br interpolation should be in- 
ferior to that following A-C interpolation. 


_-Since McGovern’s results failed to reach an 


1The research was supported by United States 
Public Health Service Grant HD 02626. 


acceptable level of significance for the above 
comparison, it was felt worthwhile to further 
test this specific deduction from the theory. 

Method—A total of 48 Michigan State 
University undergraduates were assigned to 
one of three groups (control, A-C, or 
A-Br). 

Eight 53%-73% Glaze association-value 
nonsense-syllable stimuli were paired with 
eight two-syllable unrelated adjectives for 
all lists. Intralist stimulus similarity was 
kept minimal by using three vowels twice and 
failing to duplicate consonents. The first list 
(A-B) was identical for all Ss. Two A-B 
pairings were used equally often within 
groups. Two pairings of List 2 (A-C or 
A-Br) were used with equal frequency. 
Four random orders of each list were con- 
structed to minimize serial learning, Each 
order was used as the starting order equally 
often. 

Both lists were learned by the anticipation 
method at a 2:2-sec. rate with a 4-sec, inter- 
trial interval. All Ss learned the first list to 
one perfect recitation. Approximately 1 
min. after reaching criterion, S began List 2 
learning or digit cancellation (controls). 
After 15 trials of interpolated learning or 
an equal amount of time spent on the ir- 
relevant task, instructions were read for 
retention and S was tested. Each S was 
given a booklet containing one stimulus per 
page along with a sheet on which the first- 
list responses were typed. The stimuli were 
arranged in the booklets in eight random 
orders and each order was used with ap- 
proximately equal frequency. The S was 


685 


686 


asked to go through the booklet page by page 
and to write the appropriate response under 
each stimulus. The test was unpaced. 

Results and discussion—The means and 
standard deviations for trials to criterion on 
List 1 and for retention are presented in 
Table 1. There were no differences among 
groups in first-list learning (F < 1.0). 

A simple analysis of variance of matching 
test scores indicated a significant between- 
groups effect, F (2, 45) =11.85, p< .001. 
Individual comparisons were made using the 
Newman-Keuls procedure. Both Group 
A-C (p<.05) and Group A-Br (/<.01) 


TABLE 1 


TRIALS TO List 1 CRITERION AND ASSOCIATIVE MATCH- 
ING (RETENTION) DATA FOR THE THREE GROUPS 


Control A-C A-Br 
Trials to List 1 
Criterion 
M 12.06 11.88 11.25 
SD 5.01 4.76 414 
pecan cite 
a 6.44 5.31 
SD 1,00 1.75 1.40 
An = 16 per group, 
Journal of Experimental Psycholog: 
1968, ve 78, No. 4, 686-689 i 
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differed significantly from the controls, indi- 
cating unlearning in both groups. Of main 
interest is the A-C vs. A-Br comparison 
which also was significant (p < .05). As a 
further check on the A-C vs. A-Br differ- 
ence, the somewhat less powerful Mann- 
Whitney U test was used yielding results 
consistent with the above analysis, U (m = ne 
=16) =71.5, p<.05 (two-tailed). | 

These results, thus, appear to support Mc- 
Govern’s (1964) analysis in terms of A-C 
and A-Br interpolation. That is, the fact 
that A-Br interpolation resulted in signifi- 
cantly more associative unlearning than A-C 
interpolation is interpreted as a result of 
the nature of the “extinctiye” relationship — 
between successive forward and backwad 
associations. While identical stimuli in both 1 
paradigms lead to the nonreinforced evoca- 
tion of forward associations, only the A-Br 
paradigm results in the additional nonréin- 
forced elicitation of first-list backward asso- 
ciations due to the presence of re-paired 
responses. 
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REVERSAL OF AN INSTRUMENTAL DISCRIMINATION BY Ei 
CLASSICAL DISCRIMINATIVE CONDITIONING: 


MILTON A. TRAPOLD, DOUGLAS M. GROSS, anp GEORGE W. LAWTON 
University of Minnesota 


The purpose of this Study was to determine whether learning the 
reversal of an appetitive instrumental S*-S- discrimination would be 
facilitated by interpolated classical discriminative conditioning on 
the reversal (i.e, S- paired with reinforcement, St with nonreinforce- 


ment). 


2 groups of 6 rats were first trained on a discrete-trial 


discriminative bar-pressing task in which responding in S* delivered 
food reinforcement, but responding in S- did not. Then Group E was 
given classical conditioning on the reversal while Group C was not, 
after which both groups were trained on the reversal instrumental 
problem. Group E showed facilitated reversal performance in both 
S* and S- both at the outset of reversal training and over reversal 


training sessions, 


Previous results (eg., Trapold, Lawton, 
Dick, & Gross, 1968; Trapold & Winokur, 
1967) have shown that acquisition, both sim- 


1 This research was supperter tional 
Institute of Mental Healt Gant Dar oon the 
first author, and in part by grants from National 


ple and discriminative, extinction, and st 
ulus generalization of a discrete appetitiv 


Institutes of Child Health and Human Development, - 
National Science Foundation, and the Graduate 
search Fund of the University of Minnesota to ts 
Center for Research in Human Learning, University ol 
Minnesota. r 
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instrumental response to specific stimuli can 
be strongly manipulated by prior classical 
conditioning manipulations in which these 
specific stimuli are simply paired in the man- 
ner of classical conditioning with reinforce- 
ment (S?) or nonreinforcement (S**). In 
general, instrumental responses are acquired 
more rapidly and generalize more strongly 
to stimuli that have been paired with S*, and 
extinguish more rapidly and generalize less 
strongly to stimuli that have been paired 
with S~", 

The purpose of the present study was to 
extend the study of classical instrumental 
transfer of training to the case of discrimi- 
nation reversal. That is, are the processes 
responsible for the transfer effects so far 
observed strong enough so that classical con- 
ditioning on the reversal of an already 
learned instrumental discrimination will ef- 
fect reversal of the instrumental discrimina- 
tion? 

Method—The Ss were 12 naive male 
albino rats, 120 days old. The apparatus 
consisted of two identical operant condition- 
ing chambers, each of which contained a 
response bar, a food cup, a device for dis- 
pensing .045-gm. Noyes food pellets, a house- 
light, and a speaker, Each chamber was 
housed in its own sound-attenuating en- 
closure, and was connected to programming 
and control equipment located in another 
room. 

The experiment can be divided into four 
phases: preliminary training, instrumental 
discrimination training, classical discrimina- 
tion-reversal training, and instrumental dis- 
crimination-reversal training. 

In preliminary training, all Ss were (a) 
reduced to 80% ad-lib body weight and main- 
tained there by a single controlled amount 
feeding following each daily session, (b) 
then assigned to one of the chambers and 
given three 30-min. adaptation sessions, (c) 
then given two sessions of magazine training 
during each of which 50 pellets were de- 
livered noncontingently on a 1-min. variable 
interval schedule (VI-1), and (d) then given 
one session during which each bar press, to 
a total of 100 responses, was reinforced by a 
single pellet. 

In instrumental discrimination training, 
each trial consisted of the onset of either a 
tone (2,000 cps) or a clicker (20 pps). On 


- -tone trials the first bar press terminated the 


tone and delivered a single pellet; on 
clicker trials, the first bar press, Or 60-sec. 


elapsed time, whichever occurred first, ter- 
minated the clicker but produced no rein- 
forcement. The intertrial interval was 
nominally VI-1. However, to this was added 
the condition that a trial would not begin 
until at least 20 sec. had elapsed without an 
intertrial response. This procedure very 
rapidly reduces intertrial responding to a 
very low level. The response measure was 
latency from trial onset. This phase, which 
was the same for all Ss, lasted for six ses- 
sions, each of which consisted of 24 tone and 
24 clicker trials irregularly intermixed. 

The S's were then divided into two groups 
of six (three/group/chamber). Group E 
was subjected to classical discrimination 
reversal while Group C received no treat- 
ment but was merely weighed and fed 
daily, Classical reversal training consisted 
of 10 sessions in each of which, with the 
bar removed from the chamber, 24 tone and 
24 clicker trials were presented irregularly 
intermixed on a VI-1 schedule. Each trial 
consisted of a 3-sec. presentation of the stim- 
ulus; a single pellet was delivered at the 
offset of all clicker presentations, but not 
tone presentations. 

Following classical reversal training, the 
bars were reinserted into the chambers and 
all Ss were given 12 sessions of instrumental 
discrimination-reversal training under ex- 
actly the conditions described for original 
discrimination learning except that responses 
were now reinforced on clicker trials, but 
not on tone trials. 

Results and discussion—Figure 1 shows 
performance during original discrimination 
learning and reversal learning. Each point 
(except those over Abscissa Point 0) is the 
group mean of the logarithms of each S's 
mean S* or Sq latency for a session (24 
trials). The points over Abscissa Point 0 
are the mean logs of mean latency on the 
first six S* and S- trials of reversal learn- 
ing, and are presented to show the extent to 
which classical reversal training produced 
an immediate reversal of instrumental per- 
formance, as distinct from facilitating re- 
versal learning over an extended series of 
instrumental reversal trials. 

Considering first the initial six St and S7 
trials, it is clear that Group C showed no 
tendency toward reversal of the discrimina- 
tion; St latencies were essentially the same 


as at the end of original discrimination train- 
ing, and S7 latencies were somewhat longer. 
This increase in S- latencies was probably 
due to removal of the 60-sec. trial limit 
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learning and reversal learning. 
fist six S* and S- trials of the 


which previously had imposed an arbitrary 
ceiling on S~ latencies. 

Group E, on the other hand, showed both 
a marked decrease in S- latencies and a very 
large increase in S* latencies on these initial 
trials relative to performance to these stimuli 
at the end of original discrimination learning. 
A series of t tests, employing error esti- 
mates based upon the last six St and S~ trials 
of original discrimination learning and the 
first six S* and S- trials of reversal learning 
(Winer, 1962, p. 323), showed both S* and 
S- performance for Group E on the first six 
trials of reversal learning to differ signifi- 
cantly (#=.05) from the corresponding 
performance of Group C, and from its own 
performance on the last six trials of 
original learning. 

Considering next the transfer evidenced 
over the further course of discrimination 
learning, it is clear that to S* reversal was 
greatly facilitated by the interpolated clas- 
sical reversal training; Group E achieved 
asymptotically slow responding in no more 
than three sessions with the bulk of the 
change occurring in the first session, whereas 
Group C took at least eight sessions to reach 
asymptotically slow S* responding. 

In S- the facilitation of reversal was 
much shorter lived than in S*, due largely to 
the very rapid rate at which Group C de- 
veloped fast responding. Group E clearly 
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responded faster than Group C on the first 
session, and was slightly superior on the 
second, but from then on Group E was, if 
anything, slightly slower than Group C. 

These results show clearly that classical 
conditioning on the reversal of an already 
learned instrumental discrimination will 
facilitate subsequent instrumental discrimina- 
tion-reversal learning. Furthermore, the 
processes modified by virtue of the classical 
conditioning are strong enough so that not 
only was reversal learning facilitated in 
the savings sense, but also that a very sub- 
stantial amount of performance reversal 
was evident at the very outset of instru- 
mental reversal learning. In these respects 
these results also support those of Trapold 
(1966) who studied classical —> instrumental 
transfer of reversal learning with a very dif- 
ferent instrumental task and response mea- 
sure. 

Additionally, these results are consistent 
with most previous results (though not all; 
see Trapold et al., 1968) in showing that 
performance was both facilitated to the S* 
paired stimulus and interfered with the S“? 
paired stimulus. 
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FACTORS AFFECTING LEARNING AND INTRUSION RATES IN A 
MULTIPLE-CHOICE VERBAL TRANSFER TASK 


IRWIN P. LEVIN anp JERAL R. WILLIAMS 
University oj Iowa 


In each of 2 experiments preshift and postshift response dominance 
level were varied and in each it was found that a higher mean number 
of errors occurred when the response to be learned was Low in 
dominance level than when it was High. Exp. I Ss were required 
to give confidence ratings of their responses and it was found that 
over 3 of all postshift errors were represented by overt intrusions but 
that less than 50% of these intrusions were rated as correct. In Exp. 
II 3 groups of Ss each received different information and instructions 


regarding the response-shift operation. 


These groups did not differ 


significantly on mean number of errors or intrusions. 


The present paper presents an extension of 
the research reported by Levin and Williams 
(1968) who investigated learning and intru- 
sion rates in a multiple-choice verbal trans- 
fer task in which preshift and postshift re- 
sponse dominance level were varied within 
Ss. It was found that both preshift and post- 
shift performance levels were directly related 
to the dominance level of the response to 
be learned and that overt preshift intru- 
sions represented a major source of post- 
shift error, accounting for about 60% of all 
postshift errors. Two additional experi- 
ments are presented to further analyze the 
nature of an overt intrusion and its role as a 
source of interference in the multiple-choice 
transfer paradigm. In Exp. I, Ss were re- 
quired to give confidence ratings of their re- 
sponses so that the degree to which an in- 
trusion is rated as correct could be assessed. 
In Exp. II, S’s “set” to detect a shift in 
correct response was manipulated through 
instructional and informational variation. 

Method—The materials and procedures 
are described in more detail in Levin and 
Williams (1968). For each of 16 stimulus 
words four response alternatives were se- 
lected such that two of the chosen alterna- 
tives were the most frequently occurring 
“(High dominant) responses on the Palermo 
and Jenkins (1964) word association norms; 


the other two alternatives were idiosyncratic 
(Low dominant) responses and were AA 
Thorndike-Lorge (1944) words. After a 
given item was learned to a criterion of three 
consecutive correct responses, the correct re- 
sponse for that item was switched to one 
of the other alternatives. Within a 16-item 
list there were 4 items assigned to each 
of four possible transfer conditions, de- 
fined as High-High, High-Low, Low-High, 
and Low-Low, where the first term refers to 
the dominance level of the preshift response 
and the second term refers to the postshift 
dominance level. 

To fulfill a course requirement, 64 stu- 
dents participated as Ss in Exp. I. Eight 
males and eight females were randomly as- 
signed to each of four experimental groups. 
Forty-eight students served as Ss in Exp. II, 
with eight males and eight females assigned 
randomly to each of three experimental 
groups. 

In Exp. I the four experimental groups 
were defined as follows: 3-3, 3-2, 2-3, 2-2, 
where the first number refers to the presen- 
tation rate (in seconds) on study trials and 
the second number refers to both the re- 
sponse interval and the rating interval (in 
seconds) on test trials. The S was re- 
quired, on a given test trial, to indicate his 
response choice, and then, when a rating 
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scale appeared, to say aloud one of the words 
from the scale, “Correct,” “Uncertain,” or 
“Incorrect,” depending upon his judgment of 
the correctness of the response choice he 
just made. 

In Exp. II all groups were given 3-sec. 
presentation and response intervals; confi- 
dence ratings were not required of Ss. The 
Ss in different experimental groups were 
given different information regarding the re- 
sponse-shift operation. The Ss in Group A 
were given the “standard” instructions and 
thus were not forewarned that the correct 
response would be changed for a given item. 
In Group B Ss were told that the correct 
response for each item would be changed 
sometime during the course of the experi- 
ment. In Group C Ss were told that correct 
responses would be changed and were in- 
structed to signal by pressing a lever when 
they detected that the response designated 
by E as correct for a given item was differ- 
ent from what it had been on previous trials. 

Results—For Exp. I in the analysis of 
variance of mean number of preshift errors 
as a function of experimental conditions, 
preshift response type was a significant com- 
ponent of variance, F (1, 180) = 29.11, p< 
01, with a higher mean number of errors oc- 
curring on Low-dominant preshift responses 
than on High-dominant preshift responses. 
Presentation rate was also a significant com- 
ponent, F (1, 60) =8.54, p < .01, with more 
errors occurring with 2-sec. than with 3-sec. 
presentation rates, More errors were com- 
mitted with 2-sec. than with 3-sec. response 
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intervals but this effect did not reach statis- 
tical significance.1 

The mean number of postshift errors per 
S, the percentage of postshift errors repre- 
sented by overt preshift intrusions, and the 
percentage of intrusions in each rating cate- 
gory are shown in Table 1 for each experi- 
mental condition. Analysis of variance re- 
vealed that postshift response type was a sig- 
nificant factor, F (1, 180) = 21.39, p<.01, 
with a higher mean number of errors occur- 
ring on Low-dominant postshift responses 
than on High-dominant postshift responses; 
neither preshift response type nor the inter- 
action of preshift and postshift response 
types were significant sources of variance, 
Mean postshift errors were significantly 
greater with 2-sec. presentation rates than 
with 3-sec, rates, F (1, 160) =9.57, p <.01, 
but neither response interval nor the inter- 
action of Presentation Rate X Response In- 
terval were significant factors (see Foot- 
note 1.) 

An inspection of Table 1 reveals that, for 
most experimental conditions, overt preshift 
intrusions represented over one-half of all 
postshift errors. Averaged over groups, the 
Percentage of intrusions was higher for 
transfer conditions High-High (68%) and 
Low-Low (62%), where the dominance level 


+ Subsequent data collected in the laboratory indicate 

t when confidence rating intervals are not included, 

icant differences between response intervals of 

2 and 3 sec. exist. The effect of response interval in 

the present study may have been obscured due to the 

inclusion of rating intervals if Ss learned to use these 
rating intervals for rehearsal and/or rest. 


TABLE 1 
MEAN NUMBER or Postsuirt ERRORS, PERCENTAGE OF POSTSHIFT ERRORS REPRESENTED BY INTRUSIONS, AND 


PERCENTAGE OF INTRUSIONS IN 


Eacu RATING CATEGORY ror EACH CONDITION IN Exp. I 


Postshift Errors Percentage ot Intrusions in Each 
Pi kA Rating Category 
Condition 
Pi 
M SD rar aperea Incorrect | Uncertain Correct 
3-3 High-High 1.38 1,58 
. 59 
Hoja | i | i | # | 8) 8 | 8 
Ho] Hom | ik | ie | B | ge | 8 | 8 
Hoy pee | ie | | gw | E 
Ho] eee | ie | | aJa ge | g 
Bo) pom | | i | Bo) oe |B | F 
I ie X 3.30 60 
Ta Low-Low 3.44 2.50 69 is 32 30 
-2 High-High 3.19 2.74 69 17 57 26 
2-2 High-Low 4.69 2.14 63 13 43 45 
2-2 Low-High 2.69 2.31 61 12 42 46 
2.2 Low-Low 4.25 2.91 56 16 47 37 


Note.—Postshift error means based on total errors for the four items within each transfer condition. 


| 
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is the same for pre- and postshift responses, 
than for transfer conditions High-Low 
(50%) and Low-High (55%). For most 
experimental conditions, somewhat less than 
one-half of the intrusions committed were 
given a rating of “Correct.” That this is 
not an artifact of the rating method em- 
ployed is evidenced by the fact that virtually 
100% of all responses made on the preshift 
criterion trial immediately preceding the 
shift in correct response were rated as 
“Correct.” 

In Exp. II the effects observed regarding 
response dominance level were virtually the 
same as observed in Exp. I. Only the domi- 
nance level of the response being learned was 
a significant factor in the number of pre- 
shift and postshift errors. The highest per- 
centage of overt preshift intrusions occurred 
in transfer conditions High-High and Low- 
Low, 

In the analysis of postshift performance 
as a function of instructional variation, the 
groups did not differ significantly from each 
other on mean number of postshift errors or 
on mean number of intrusions. For Group 
C, the response-shift operation was detected 
and signaled correctly on 89.4% of the items. 
For those items which were correctly sig- 
naled, the proportion of correct responses 
on the trial immediately following the re- 
sponse shift was .677 and the proportion of 
items on which an overt intrusion was com- 
mitted on this trial was .170. However, 
these proportions are not significantly dif- 
ferent from the corresponding proportions 
for the other two groups on the trial im- 
mediately following the response shift. 

Discussion—The results of the two ex- 
periments reported here confirmed the find- 
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ings of the earlier study (Levin & Williams, 
1968) regarding preshift and postshift re- 
sponse dominance level effects, and extended 
their generality, In each experiment a high 
rate of overt preshift intrusions was ob- 
served. However, in Exp. I it was found 
that less than one-half of all intrusions were 
rated as correct, showing that in many in- 
stances when an intrusion was committed S 
either knew it to be an incorrect response or, 
more frequently, was uncertain of its correct- 
ness. In Exp. II it was found that the group 
of Ss who were forewarned of the response 
shift and instructed to signal it did not 
differ significantly on postshift error and in- 
trusion rates from the other groups. Thus, 
although some intrusions may haye occurred 
because S did not perceive the shift in cor- 
rect response and some intrusions may have 
occurred when S recognized that the pre- 
shift response was no longer correct but 
could not pick out the new correct response, 
results seemed to indicate that the majority 
of intrusions occurred when, during the time 
between the study and test trial presentations 
for a given item, S failed to recall or was 
uncertain as to whether the correct response 
had been shifted for that particular item. 


REFERENCES 


Levin, I. P., 11aMs, J. R. Effects of associative 
citar ia a multiple, dolee verbal, transfer tak, 
Journal of Experimental Psychology, 1968, 76, 530- 
537. 


IERO, D, S, & Jenxins, J. J., Word association 
bee Pawan University ‘of Minnesota Press, 
1964. 


THORNDIKE, Loroz, I. The teachers word 


EL, & 

‘30,000 words, New York: Teachers Col- 
ee A University, Bureau of Publications, 
1944. 


(Received October 5, 1967) 


692 


Journal of Experimental Psychology 
1968, Vol. 78, No. 4, 692-693 


SUPPLEMENTARY REPORTS 


ADDITION OF CONTEXT CUES TO RESPONSE TERMS OF 
PAIRED-ASSOCIATE LISTS? 


SAM C. BROWN axo J. D. READ 
Kansas State University 


80 Ss learned a list of 10 paired associates within a 2 X 2 X 2 factorial 
design. The 3 variables were: (a) learning with or without differently 
colored backgrounds surrounding the response (R) terms of the list; 
(b) low or high formal intralist similarity between R terms; and 


(c) recall or no recall of R terms after learning trials. 


Contrary to 


a cognitive differentiation position, the addition of context cues failed 
to have a significant effect on performance. 


Characteristic of much recent research in 
verbal paired-associate (PA) learning has 
been the addition of context cues (e.g. 
colors) to the list. Previous investigations, 
however, have added context cues to the stim- 
ulus (S) terms of the pairs, primarily as a 
means of studying stimulus-selection pro- 
cesses. The purpose of the present study 
was to determine the effect of adding context 
cues to the response (R) terms of the pairs. 
Tt has been hypothesized that one likely effect 
of stimulus context cues is to facilitate dif- 
ferentiation among the S terms in the list 
(eg. Saltz, 1963). Thus, similar effects 
should be expected for R terms. Also 
manipulated was degree of formal intralist 
similarity (i.e, amount of letter duplication 
between R terms). This was done because 
Previous research has shown a greater use 
of stimulus context cues with high than with 
low S-term similarity (e.g., Newman & Tay- 
lor, 1963). Since R-term context cues may 
have differential effects during the associative 
(S-R) and response learning (availability) 
stages of learning, it was deemed desirable 
to examine both stages simultaneously. This 
was accomplished by requiring R-term recall 
after every S-R learning trial following the 
technique of Jung (1965). 

Method.—Two lists of 10 nonoverlapping 
CVC „pairs of 52-59% Archer (1960) as- 
sociation value were used. One (List H) 
was taken from Rubin and Brown (1967) 
and contained S terms of low formal intra- 
list similarity, paired with R terms of high 
formal similarity. The second (List L) used 
the stimuli of List H, paired with a set of 
10 low-similarity R terms. The 10 colored 
2 This investigation was supported by Public Health 
Seale Ma Gag wah tan 


wish to thank William F. Battig fi suggestii 
for improvement of the BEET wpa ed 


context cues, cut from construction paper, 
completely surrounded the verbal unit. 

Eighty college students were randomly 
assigned to one of eight equal sized groups, 
representing all combinations of: (a) two 
lists (L and H); (b) two color conditions, 
color (C) and no color (NC) added to the 
R terms of the lists; and (c) two recall 
conditions, response recall (R) and no re- 
sponse recall (NR) after every PA learning 
trial. 

Learning was carried out under the pair- 
ing-test (recall) method to a criterion of 
one errorless trial or a maximum of 20 
trials. No reference to the colors was made 
either within the initial instructions or dur- 
ing the recall periods. The Ss were re- 
quired only to spell aloud the letters of the 
verbal R term on test series. The pairs 
were presented manually, each at a 4-sec. 
rate with a 45-sec. interval between pairing- 
test series and trials. During the latter in- 
terval,» R-condition Ss listed as many 
terms as they could recall. At the com- 
pletion of PA learning a terminal recall test 
also was administered to provide a measure 
of the relative associative strength of S-R 
and S-color associations within each C group. 
All Ss were given a sheet of paper contain- 
ing the 10 S terms of the list and were m- 
structed to write down the R terms, paired 
correctly if possible. In the C condition, Ss 
also were asked to recall the background 
color of the R term by writing the first and 
second items remembered in the left- and 
right-hand column of the recall sheet, re- 
spectively. Before leaving the laboratory, Ss 
were tested for defects in color vision. 

Results and discussion—Table 1 presents 
the mean number of total S-R errors (fail- 
ures to respond correctly to S terms) for 
each group. The overall superiority of the 
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TABLE 1 
MEAN TOTAL ERRORS ror EACH GROUP 
—— 
Color No Color 
Recall Li Recall 
Condition tL List H M Condition List L List H M 
R 62.6 91.3 77.0 R 48.7 83.7 66. 
(42:3) (39.7) (32:8) (41.9) 5 
NR 63.9 103.6 83.8 NR 49.1 07.3 78,2 
(31.8) (35.2) (26.9) (31.1) 
M 63.3 97.5 80.4 M 48.9 95,5 72.2 
Note.—SDs are in parentheses. 
L (56.1) over the H (96.5) list proved to rectly. There was a total of 122 such 


be highly significant, F (1, 72) =23.19, p< 
.001. However, contrary to expectation, 
more errors occurred under the C (80.4) 
than NC (72.2) condition, although the dif- 
ference was not significant (F <1). Simi- 
larly, all other main effects and interactions 
were nonsignificant (p’s > .20). Thus, the 
results provide no evidence for a facilitation 
of S-R learning when colored context cues 
are added to the R terms of the L or H list. 

At the completion of each learning trial, 
R-condition Ss were asked to recall the R 
terms of the list. The mean number of 
R-term errors (failures of recall) for the 
four R groups was as follows: Group L-C, 
58.8; L-NC, 47.1; H-C, 862; and H-NC, 
70.3. Overall, there were significantly fewer 
recall errors in the L (53.0) than H (78.3) 
list, F (1, 36) =4.55, p <.05, in agreement 
with the results of previous research (Jung, 
1965). However, no other source of vari- 
ance approached significance (p’s >*.20). 

All groups also displayed fewer R-term 
(65.6) than S-R (71.6) errors, leading to a 
significant main effect of Stages, F (1, 36) = 
7.54, p<.01. However, the Color X Stages 
interaction was nonsignificant (F <1), indi- 
cating that the magnitude of the difference 
between stages was comparable under C and 
NC conditions. 

All recall measures at the completion of 
PA learning showed better retention for R 
terms than context cues. Specifically, under 
the C condition there were significantly more 
R terms (9.0) recalled correctly than color 
names (5.8), F (1, 36) =46.76, and sig- 
nificantly more of these R terms (8.8) were 
paired correctly with their S terms than 
were colors (3.2), F (1, 36) = 143.36. All 
interactions were nonsignificant (p’s > .20). 
The results for R-term recall under the NC 
condition were essentially the same as those 
obtained above for the C condition. 

Also tabulated was the number of times 
R terms and colors were paired together cor- 


occurrences, and all were matched correctly 
with S terms. Thus, of the total correct S 
term-color recalls (129) virtually all (122) 
were accompanied by correct R-term recall, 
Moreover, R-term recall preceded color re- 
call in almost all instances (114), as indi- 
cated by the order in which Ss listed these 
items on the recall sheet. This suggests that 
S-R associations are relatively stronger than 
S term-color associations. 

Briefly summarized, the results of the 
present research fail to show a facilitative 
effect from context cues. If anything, the 
addition of colors to R terms served to 
slightly impede both response learning and 
S-R learning. Thus, it would appear that 
the cognitive differentiation effect proposed 
by Saltz (1963) is eliminated by shifting 
the locus of color from S to R terms. How- 
ever, it should be noted that since the present 
research did not require overt responding to 
context cues at any time during learning, S's 
may have ignored some or all these cues. 
The finding of significantly better R-term 
than color recall at the end of PA learning 
is consistent with this conclusion. Perhaps if 
Ss were required to respond to both items, 
a greater use of color might have occurred, 
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f Using a bar-pressing task and a successive lows shock only fortuitously with the Y 
discrimination Procedure, previous research Procedure, the contribution of any acquired 
(Curlin & Donahoe, 1965) has shown that, reinforcing property of shock to the facilita- 
at low shock intensities, performance is facil- tion of discrimination performance (Free- 
itated by response-contingent shock in either burne & Taylor, 1952) should be minimized. 
the positive stimulus (S+) or the negative Method—Sixty male albino rats (Wistar 
(S—) whereas, at higher shock intensities, strain) maintained at 80% of free-feeding 
performance is retarded by shock in S+ and weight on a 22-hr. deprivation schedule 


EFFECTS OF THE RESPONSE-SHOCK CONTINGENCY ON THE 


FACILITATION OF DISCRIMINATION PERFORMANCE?! 


JAMES H. McCROSKERY anp JOHN W. DONAHOE 
University of Kentucky 


Using 60 rats as Ss in a successive discrimination experiment employ- 
ing a bar-pressing task, a study was conducted to determine whether 
the facilitation of performance by shock could be attributed to the 
interaction of two well-documented functions of shock—its ability to 
Serve as a cue and as a punishment—without recourse to functions such 
as secondary reinforcement. The performance of yoked (Y) groups 
was contrasted with lead (L) groups (response-contingent shock) 
to separate the cue and punishing functions of shock. Facilitation 
occurred independent of the Placement of shock in the positive 
stimulus (S+) or the negative stimulus (S—) in the Y groups, but 
not in the L groups. No evidence for an acquired reinforcing effect 
of shock was obtained. Thus, shock mediated facilitation of discrim- 
ination performance through serving as a cue which increased the 
difference between the functional S+ and S=. 


facilitated by shock in S—. Retardation of served at Ss. 


discrimination by shock in S+ was attributed Two operant chambers similar in con- 
to the overriding of the cue effects of shock struction to those used in the earlier experi- 
by its Punishing effects. In an effort to ment (Curlin & Donahoe, 1965) were used. 
Separate the cue and Punishing effects of One end wall of each chamber contained a 
shock, the present experiment employed two response bar with a pilot light mounted 3 in. 
procedures for the delivery of shock, In above the bar and a food cup 24 in. to the 
the lead (L) Procedure in which shock was right of the bar. The intensity of the pilot 
response contingent, shock was expected to light was 72 ftc. during S+ and 18 ftc. dur- 
serve both as a punishment for the response ing S—. Programming equipment in an 
which produced it and as a Cue for the sub- adjacent room controlled the experimental 


sequent response. In the yoked (Y) pro- events. 


cedure in which shock was delivered inde- During pretraining, all Ss received one 
penea of the behavior of the Y animals session in ee a approach behavior 
ies wheneveg the L partners were shocked, was reinforced at random intervals and two 
j ock was expected to serve only as a cue sessions of continuous reinforcement with 
or the subsequent response, Thus, the fifty 45-mg. pellets being dispensed per session. 
differential effect of the Placement of shock At the beginning of discrimination training, 
in S+ or S— on the magnitude of the facili- 5, were randomly divided into five groups 
tation effect should be diminished with the of 12 Ss each—four experimental groups 
Y procedure. Additionally, since food fol- ang one nonshocked control group (Groni 
This investigati C). The four experimental groups forme 
10706 from ‘the United ‘States Putte Tale a 2X2 factorial a ne with the stimulus in 


to the second author. Acknowledgment is . —) as one 
the Computing Center, University: of Kerbs i which shock was placed (S+ a nae Rp 
the use of its facilities and to John J. B Ayres for factor and the relationship of sl 


sponding (L or Y) as the second factor. 
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(Hereafter, the four experimental groups 
are referred to as S+L, S—L, S +Y, and 
S—Y.) During S+, all responses were re- 
inforced; during S—, all responses were non- 
reinforced. Scrambled shock at an intensity 
of 55 v. (peak to peak) was administered 
through the grid floor in the appropriate 
conditions. Using four additional Daoa 
moderately aversive shock intensity was se- 
lected which elicited 66% “flinch” and 27% 
“jump” behavior using a procedure devised 
by Kimble (1955). There were 10 daily 
discrimination sessions which consisted of 15 
cycles of S+ and S— with 60 sec. of S+ 
and 60 sec. of S— during each cycle. Fol- 
lowing discrimination training, all Ss were 
given one session with 3 food-reinforced 
cycles and 20 extinction cycles. Shock per- 
sisted in extinction for the experimental 
groups to determine if the association of 
shock with food in the L groups had lessened 
the aversive effects of shock. Within each 
experimental group, half of the S's received 
L shock and half received Y shock in ex- 
tinction. 

Results and discussion—Discrimination 
performance during acquisition and extinc- 
tion was measured by the proportion of total 
responses occurring to S+ (S+ proportion) 
and by the rate of responding in S+ and S—. 
The first three cycles of each session were 
excluded from the analyses to prevent the 
confounding of intergroup differences with 
unequal warm-up rates (Curlin & Donahoe, 
1965). 

The acquisition of the discrimination is 
shown in Fig. 1. As a set, the shocked ex- 
perimental groups discriminated above the 
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level of Group C, F (1, 55) = 10.11, p< .01. 
The S—L, S+Y, and S—Y groups per- 
formed above control levels throughout train- 
ing, p<.01, .01, and .05, respectively, and 
the S+L group was superior to Group C 
during the first eight sessions of training, p 
<.05. Considering the performance of Ss 
receiving shock contingent upon responding, 
the S+L group was inferior to the S—L 
group, F (1, 55) =8.49, p<.01. However, 
for those Ss receiving shock independent of 
responding, the performance of the S+Y 
and S—Y groups did not differ, F<1. 
The presence of an effect of shock place- 
ment in the L condition, but not in the Y 
condition, produced a significant interaction 
of placement (S+ or S—) with the L—Y 
contingencies both throughout all sessions 
and during Session 10 when performance 
had stabilized, F (1, 55) =5.68 and 10.45, p 
<.05 and .01, respectively. Consistent with 
the analysis of the S+ proportion, the differ- 
ence between S+ and S— rates was greater 
in the experimental groups as a set than 
in Group C, F (1, 55) = 11.44, p < .01. Gen- 
erally, facilitation resulted from both an in- 
crease in S+ rate and a decrease in S— 
rate relative to control levels. Within the 
L condition, the difference between S+ and 
S— rates was less for the S +L group than 
for the S—L group, F (1, 55) =11.09, p 
<.01. Within the Y condition, the com- 
parable difference was nonsignificant, F < 1. 
The acquisition findings are, therefore, con- 
sistent with the following conclusions: (a) 
shock facilitated discrimination performance 
through serving as an additional cue corre- 
lated with reinforced responding, and (b) the 
Y procedure provided for comparable facili- 
tation by shock in S+ and S— through 
eliminating the punishment contingency which 
existed in the L procedure. The deleterious 
effect of the punishment contingency on per- 
formance of the S+L group was primarily 
produced by an increase in the rate of S— 
responding, a result interpreted in the earlier 
experiment as a displacement phenomenon 
(Miller, 1959). 

In extinction, no effect of the acquisition 
and extinction L-Y relationship on either 
the S+ and S— response rates was found 
and, accordingly, Fig. 1 depicts the mean 
performance of all shocked experimental 
groups as contrasted with the no-shock con- 
trol group. The only statistically significant 
finding in extinction, aside from the persist- 
ence of the discrimination in all groups (p 
<.01) and the occurrence of extinction itself 
(p<.01), was that the rate of responding 
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decreased more rapidly in the shocked ex- 
perimental groups than in the control group, 
F (4, 204) =3.50, p < .01. The presence of 
facilitation with the Y procedure in which 
shock was not consistently paired with food, 
together with the failure of shock-food pair- 
ings in the L procedure to reduce the aver- 
siveness of shock in extinction, indicates 
that secondary reinforcement mechanisms do 
not play an important role in the genesis of 
the facilitation effect (Fowler & Wischener, 
1965). 
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UNLEARNING OF LIST 1 WRONG ITEMS IN VERBAL 
DISCRIMINATION TRANSFER* 


A. JOHN ESCHENBRENNER, JR. ano DONALD H. KAUSLER 
Saint Louis University 


Unlearning of List 1 intrapair R-W associations (associations be- 
tween right, R, and wrong, W, items of a verbal discrimination, 
VD, list) was found for the W1.-R:, W:-R: (identical R items in 
Lists 1 and 2 but different W items) transfer paradigm. The results 
were attributed to the A-B, A-C interference relationship generated 
by the Ri-Wi, Ri-Ws associations of the two lists during List 2 
i practice. Unlearning of the incidentally learned Rı-W, associations 
occurred even though intentional learning (i.e. intrapair discrimina- 
tions between W and R items) yielded pronounced positive transfer. 


Evidence for associative interference and 
unlearning in verbal discrimination (VD) 
transfer was found by Kausler, Fulkerson, 
and Eschenbrenner (1967) for a paradigm in 
which the wrong (W) and right (R) items 
of Lists 1 and 2 enter into a W:-Rs, Wi-Re 
(identical W items for the two lists but 
different R items) relationship. The inter- 
ference-unlearning effect seemingly results 
from the fact that the W.-R,, W1-Re associ- 
ative relationship is functionally equivalent 
to the A-B, A-C relationship indigenous to 
paired-associate Negative transfer. When 
intrapair associative learning during VD 
practice is extensive, as is apparently true 
when W and R items consist of highly mean- 
ingful words (Kausler & Boka, 1968) 
associative interference is sufficient to reduce 
markedly (Kausler et al., 1967) the positive 
1 This study was supported in part by a National 


Aeronautics and Space Administration pi 
traineeship awarded to the first author. 


transfer for discrimination learning on Wi- 
Ra lists predicted to occur, at least during 
the early trials of List 2 practice, by con- 
temporary frequency theory (Ekstrand, Wal- 
lace, & Underwood, 1966). 

A similar analysis of VD associative trans- 
fer in terms of interference-unlearning ef- 
fects may be made for the W:-R:, Ws-Ri 
(identical R items for the two lists but dif- 
ferent W items, hereafter abbreviated Wr 
R:) paradigm. For this paradigm the intra- 
pair R-W associations of Lists 1 and 2 enter 
into an A-B, A-C relationship, and an inter- 
ference-unlearning effect might be expected 
for List 1 Rı-Wı associations following VD 
practice on List 2 W:-R: pairs. However, a 
complicating factor for this expectation 
stems from the massive amount of positive 
transfer reported on the intrapair discrimina- | 
tion task for Ws-Rı pairs (Underwood, Jesse, 
& Ekstrand, 1964). Recognition of intrapair 
R items as “right” items within Ws-R: pairs, 
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presumably based on differential frequencies 
of prior responses to We and R, items as 
discriminatory cues (Ekstrand et al., 1966), 
occurs very early during List 2 practice and 
remains highly stable thereafter. Under 
such conditions, responding to Ws items may 
well be minimized, thus yielding a pronounced 
reduction of associative interference via com- 
peting Rı-Wı and R,-We associations and an 
accompanying reduction of Rı-Wı unlearn- 
ing. The present study investigated the gen- 
erality of associative interference in VD 
transfer when the interlist associative rela- 
tionsip engenders such interference but the 
interlist discrimination relationhsip militates 
against interference. 

Method.—The 60 Ss, undergraduates naive 
with respect to prior participation in verbal 
learning experiments, were assigned alter- 
nately to five groups (n=12). Three of 
these groups, one experimental (Group E) 
and two control groups (Groups C1 and C2), 
served as the main conditions for the study. 
The other two groups (Groups C3 and C4) 
served as additional controls for various 
aspects of the study. Group E practiced the 
W:-R: list (List 1) to a criterion of one 
perfect trial (i.e. correct recognition of all 
intrapair R items), followed by five trials 
on the Ws-R: list (List 2). Group C1 also 
practiced the W:-R: list to a criterion of one 
perfect trial, but then received five trials on 
a nonspecific Ws-Rs list (i.e., new W items 
and new R items) as List 2, This group 
served as a control for the unlearning of 
List 1 W, items that may be associatively 
related to contextual cues and may, therefore, 
be independent of specific interlist relation- 
ships, and as a control for “output” inter- 
ference (Kausler et al, 1967). Group C2 
practiced on the Ws-R: list to criterion and 
then performed on the Meier Art Judgment 
test for a period of time equivalent to that 
required for List 2 practice in the other 
conditions. This group served as a control 
for the “normal” forgetting over time of 
Rı-Wı associations that occurs independently 
of practice on a subsequent list. Following 
practice on either. List 2 or the irrelevant 
task, each S received a modified free recall 
(MFR) test in which the R: items from 
List 1 were listed randomly with a blank 
space next to each for S’s written recall of 
the paired Wı item. There was no time 
limit for recall, and all Ss were instructed 
to guess when uncertain of an item pairing. 

Group C3, like Group C2, learned only 
List 1; however, this group received the 
MER test immediately after the termination 
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of practice. This group was included to 
assess the amount of R-W associative learn- 
ing accruing during VD practice, with the 
measure being uncontaminated by interfer- 
ence effects of any kind during an inter- 
polated interval. Group C4 was equivalent 
to Group E in terms of the transfer par- 
adigm represented. However, the group dif- 
fered in that a modified, modified free re- 
call (MMFR) test (ie. recall of W items 
from both lists), rather than MFR, was given 
after List 2 practice. Although the evidence 
(Keppel, 1968) suggests that MFR and 
MMER tests give equivalent estimates of 
List 1 unlearning in paired-associate transfer, 
the possibility remains that competition be- 
tween Wı and Wa responses to R, stimuli 
may be great enough in the VD situation to 
depress significantly MFR measures of un- 
learning. Group C4 provided a check on this 
possibility. Moreover, Group C4 provided an 
estimate of the amount of List 2 Ri-Ws 
associative learning accompanying the ex- 
pected easy VD learning of Ws-R: pairs. 
Two different sets of meaningful words 
entered into the construction of 15 pair lists 
for the various conditions. These sets were 
identical to two of the sets employed by 
Kausler et al. (1967) and were found in the 
earlier study to yield lists of comparable 
difficulty. The specific lists within the sets 
were modified in the present study to con- 
form to the requirements of the Ws-R: con- 
dition where appropriate. Within each set, 
half of the Ss in Groups E and C4 received 
the lists in the order W:-R:, W:-R:, and 
the other half in the order Ws-R:, Wi-R:. 
Similarly, half of the Ss in Group C1 re- 
ceived the lists in the order Wi-Ri, Ws-R:, 
and the other half We-Rs, W:-R:; half of 
the Ss in Groups C2 and C3 received, as 
their only list, W:i-R:, the other half W2-Rs. 
Although this procedure does not assure per- 
fect balancing of lists across groups, the 
potential confounding seems negligible. The 
two sets and various orders of list presenta- 
tion were employed for the purposes of 
greater control over unique associations 
within W-R pairings as they might enter 
differentially into the various conditions, and 
for greater generalization of the findings 
related to associative unlearning. The ef- 
fects of sets of words and order of lists 
were found to be extremely negligible with 
regard to both VD learning and associative 
unlearning, and these variables were pooled 
for the analyses reported below. The pro- 
cedure of presenting the materials for VD 
practice was identical to the Kausler et al, 
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(1967) study, with the exception of employ- 
ing four. different serial orders across trials 
rather than three. 

Results and discussion —For List 1 learn- 
ing (Groups E, Cl, C4) and single list 
learning (Groups C2, C3) the mean number 
of trials to criterion ranged 5.66-6.75, and 
all contrasts among groups were clearly not 
significant, p’s >.20. Thus the five groups 
were comparable in VD learning proficiency. 
Errors on List 2 confirmed previous findings 
of pronounced positive transfer on WR, 
lists. The mean and standard deviation for 
error scores were 2.25 and 1.37 for Group 
E and 14.75 and 4.69 for Group Cl, with 
the error scores falling into two nonover- 
lapping distributions. A median test re- 
vealed the obvious statistical significance be- 
tween the two distributions, xê (1) =20.17 
, (the maximum value with N = 24), p < .001. 
Moreover, the error distribution of Group 
C4 on the We-R: list was nearly identical 
with that of Group E. 

Means and standard deviations for MFR 
scores were 2.58 and 3.23, 6.92 and 1.24, and 
9.25 and 2.26 for Groups E, Cl, and C2, 
respectively. These means are based on 
stringent scores in that S was required both 
to recall W items and to pair them correctly 
with R items; however, a lenient scoring 
system based on W recall without regard 
to pairing failed to alter the scores appre- 
ciably. The between-group effect was highly 
significant, F (2, 33) =24.09, p<.001, and 
all comparisons between individual means 
were also significant (Duncan's test; Group 
E vs. both Groups C1 and C2, p < .01; Group 
Cl ys. Group C2, p < .05). In general, these 
results ‘strongly support the hypothesis of 
associative interference via competing R-W 
associations in the WR, paradigm, with 
Ri-Wi unlearning as a resultant. The sig- 
nificantly lower mean for Group C1 than for 
Group C2 suggests the presence of a non- 
specific source of interference acting upon 
Ri-W; recall. A likely mechanism for non- 
specific interference stems from the A-B, 
A-C relationship formed by contextual cues 
as stimuli (A terms) and W, and Wa items 
as responses (B and C terms). The pres- 
ence of a small, but statistically significant, 
unlearning effect in Group C2, the control 
for normal forgetting, is also apparent when 
the amount of R:-W, associative learning 
measured immediately after attaining the VD 
criterion is taken into consideration. The 
mean and standard deviation for Group C3’s 
MER scores were 11.08 and 1.68, with ¢ (22) 
=2.27, p < .05, for the Group C2 ys. Group 
C3 comparison, 
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Finally, the MMFR scores obtained for 
Group C4 contributed further insights into 
the nature of associative processes operative 
during VD transfer. The mean and standard 
deviation for List 1 MMFR scores were 2.50 
and 2.46, respectively; comparable values for 
List 2 were 6.00 and 3.88, The close agree- 
ment between the mean MFR score for 
Group E and the mean MMFR score for 
Group C4, # (22) <1, suggests that the 
equality of the two procedures previously 
demonstrated for paired-associate recall 
(Keppel, 1968) generalizes to VD associative 
recall as well. The amount of List 2 as- 
sociative learning manifested in the MMFR 
data (6 out of 15 pairs, or 40%) is sur- 
prisingly large in view of the easy recogni- 
tion task confronting Ss in Ws-R: lists, 
namely, recognizing R items repeated from 
an already learned first list, and the rela- 
tively few trials given on List 2. Current 
analyses of VD learning processes (eg, 
Kausler et al, 1967) stress that overall VD 
learning may be viewed as consisting of 
simultaneously occurring intentional and in- 
cidental learning. Intentional learning, of 
course, consists of the recognition of R 
items within their W-R contexts, and is 
most likely to involve response frequency 
cues (Ekstrand et al., 1966). Incidental 
learning, on the other hand, consists of the 
acquisition of associations between intrapair 
W and R items, and is most likely to involve 
rote contiguity learning. Rote learning of 
associations apparently carries over to a 
transfer list, even when the new list elicits 
the rapid application of a readily discrimi- 
nable rule, a rule which permits S to avoid 
responding to the new W items. Acquisition 
of List 2 Rı-Wa associations, in turn, gener- 
ates the interlist associative relationship re- 
sponsible for associative interference and the 
unlearning of Ri-W; associations. 
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EFFECTS OF WAVELENGTH AND RETINAL LOCUS ON THE REACTION 


TIME TO ONSET AND OFFSET STIMULATION? 


NEIL R. BARTLETT 
University of Arizona 


THOMAS G. STICHT 


Monterey, California 
AnD VICTOR P. PEASE 
St, Lawrence University 


This study measured onset and offset reaction time (RT) to a long 
and a short wave band in both the fovea and periphery. The wave 
bands were selected with the intention of isolating foveal and peripheral 
systems. RT to onset stimulation on the periphery is significantly 
faster than to offset. The difference between onset and offset RT 
in the fovea was not significant. On-off differences are independent 
of wavelength. Moreover, because a common function for intensity, 
independent of wavelength, existed in the periphery as well as in the 
fovea only when correction was made in the peripheral data for the 
peripheral luminosity function, it appears that those data, with their 
on-off differences, reflect the activity of the peripheral (rod) system. 


Human Resources Research Organization 


In an earlier study (Pease & Sticht, 1965), 
reaction time (RT) was measured as a 
function of the onset or offset of visual 
stimuli covering a wide range of intensities 
and presented either foveally or peripherally 
in the temporal retina. For stimuli pre- 
sented to the periphery the RT was sig- 
nificantly longer for the offset of the stim- 
ulus than for the onset. There was a further 
suggestion that, with near-threshold flashes, 
RTs for the fovea were a little faster for 
offset than for onset. 

In seeking a possible mechanism for the 
foveal and peripheral differences in onset 
and offset reactions, the authors have deter- 
mined RT differences to onset and offset 
stimuli for different wave bands and inten- 
sities. Such manipulation should establish 
whether the peripheral and foveal responses 
are mediated by a single or by separate 
receptor mechanisms. Further, studies (Rat- 
liff, 1962; Wolbarsht, Wagner, & Mac- 
Nichol, 1961) using animal preparations and 
microelectrode recording techniques have in- 
dicated that whether or not a particular 
ganglion cell will produce an “on,” “off,” 
or “on-off” response may depend upon wave- 
length. Since the fovea and periphery of 
the human eye differ with respect both to 
luminosity functions and to retinal micro- 
structure, it was anticipated that variations 
in wavelength might also result in significant 
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differences in the onset and offset RT rela- 
tions observed previously with “white” light. 

Method—The visual discriminator is de- 
scribed in detail by Pease and Sticht (1965). 
It presents a visual stimulus in Maxwellian 
view. Two fixation arrays consisting of a 
diamond-shaped arrangement of dots sub- 
tending a visual angle of 1°51’ were used. 
With one, the stimulus light fell on the 
fovea in the center of the fixation dots. 
With the other, it fell at a region 12° in 
the temporal retina. The diameter of the 
stimulus light subtended 21’ of visual angle. 
Observation was with the right eye. 

The two wave bands were determined by 
placing either Kodak Wratten filter 70, 
which eliminates energy below 650 mp, or 
filter 49b, which eliminates energy above 
495 mp, in the path of the collimated light 
beam originating in the Sylvania R1131-C 
glow modulator light source, The luminance 
was varied by placing Wratten Neutral 
Density filters in the light path adjacent to 
the color filter. Luminance values were 
calibrated with a Gamma Scientific Log- 
Linear photometer and were (a) red: 1.29 
mL, .129 mL., .0129 mL,, and (b) blue: 
43 mL., .043 mL., and .0043 mL. These 
values reflect the built-in corrections in this 
device for the foveal luminosity function. 

The Ss were three male graduate students 
in the department of psychology with normal 
color vision who had been trained in this 
RT task for 10 hr. prior to the test trials. 
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The Ss were seated in a light-tight booth 
and dark adapted 30 min. before testing. In 
the onset condition, S initiated a foreperiod 
(the duration of which was varied sys- 
tematically from trial to trial, ranging 1.7 
sec.-3.7 sec.) by depressing a footswitch. 
The light came on at the end of the fore- 
period and S thereupon manually released a 
microswitch key as rapidly as possible. In 
the offset condition, the light came on when 
S initiated the foreperiod, and S reacted to 
the disappearance of the light at the end of 
the foreperiod. Reaction times were mea- 
sured with a Hewlett-Packard electronic 
counter. In an experimental session, for a 
day, 120 RTs were obtained from S to yield 
5 RTs for each location, luminance, wave- 
length, and onset-offset condition. There 
were eight sessions, hence each S provided 
40 RTs for each stimulus combination. All 
conditions were systematically varied in order 
of presentation to control for order effects. 
Before any tests were conducted each $ was 
practiced in the tasks until his RTs were 
reduced to asymptotic level. 

Results and discussion—The means and 
standard deviations of the 40 RTs for each 
condition and each S were computed and 
analyses of variance performed on the means 
(see Tables 1 and 2). Separate ANOVA 
for the two color filter conditions were per- 
formed. The results indicated by the data 
and the ANOVA, for the conditions of this 


TABLE 1 


MEANS AND STANDARD DEVIATIONS FOR EACH OF THE 
EXPERIMENTAL CONDITIONS AND FOR EACH S 
UsING THE RED FILTER 


Intensity (mL.) 
LARE Ee a ae 


Subject Fovea Periphery 
4 0129 | .129 | 1.29 |.0129 | .129 | 1,29 
1 
oF 289 
266 | 248 | 357 | 28' 
0 oe 26 29 31 46 38 238 
286 | 258 | 256 | 408 | 34: 
SD 28 22 28 76 i9 533 
Ony 248 
225 | 208 | 321 
One 26 23 31 37 233 23 
M 249 | 217 | 212 | 377 
i SD 22 17 20 73 S 332 
Ony 237 | 215 
197 | 384 
Gi ee 12 26 18 69 E E 
M 231 199 | 199 | 406 
SD 16 12 29 70 560 335 
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TABLE 2 


MEANS AND STANDARD DEVIATIONS FOR EACH OF THE 
EXPERIMENTAL CONDITIONS AND FOR EACH 
S USING THE BLUE FILTER 


Intensity (mL.) 


Subject Fovea Periphery 
«0043 | .043 |.4304 | .0043 | .043 |.4304 
1 
On 
M 339 | 276 | 256 | 297 | 283 | 266 
one 25 26 24 52 30 28 
M 345 | 268 | 255 | 346 | 325 | 280 
5 SD 52 24 28 32 33 26 
On 
M 338 | 232 | 217 | 252 | 228 | 215 
SD 51 15 16 14 17 18 
M 364 | 234 | 214 | 313 | 282 | 246 
3 SD 76 15 17 22 18 22 
On 
>M 316 | 235 207 261 237 | 220 
SD 66 46 26 31 44 31 
of 
M 296 | 214 195 | 304 | 265 | 238 
SD 65 13 12 33 25 39 


experiment, may be summarized as follows: 
(a) Foveal RTs for the red filter, for both 
onset and offset stimulation, are faster than 
such RTs for peripheral stimulation, F= 
125.56, ~<.01. However, with the blue 
filter and the lowest luminance, the peripheral 
onset RTs are much faster than the foveal 
onset or offset RTs, p<.01 for each of 
three Ss by ¢ test. These results reflect 
the well-known differences in luminosity 
functions between these two retinal regions. 
(b) All RTs decrease as luminance is in- 
creased, which is again consistent with 
historical data. (c) There are no significant 
differences between foveal onset and offset 
stimuli with respect to RT for either wave 
band. In the periphery, however, onset RTs 
are consistently and significantly faster than 
offset RTs, F=247.76, p<.01. These 
peripheral results are consistent with the 
authors’ earlier findings for “white” light. 
For both fovea and periphery it is possible 
to illustrate the relationship of the onset and 
offset RTs to luminance as independent of 
wavelength if appropriate correction for the 
photochromatic interval is made, adjusting 
for the foveal and peripheral luminosity 
functions entailed here. 

Figure 1 presents a plot of the foveal and 
peripheral onset and offset RTs obtained with 
both the red and blue filters, as a function 
of log luminance, for one S after such cor- 
rection. The data for the other two Ss are 


SUPPLEMENTARY REPORTS 


OFF 


REACTION TIME(msec.) 
po g 


FOVEA PERIPHERY 
o A 


701 


SUBJECT: D.R. 


LOG LUMINANCE [mL.] 


Fis. 1, 


like these. Since the Gamma photometer 
calibration yields values adjusted for the 
foveal luminosity functions, log luminance 
has been plotted for the foveal data accord- 
ing to the photometer readings, for both flash 
colors. The foveal data show that onset 
and offset RTs for the two wave bands fall 
along common functions for intensity. 
For the peripheral data, an adjustment for 
one color of about 2 log units in intensity 
places the red and blue flash data along a 
common curve also. This correction is of 
the order of magnitude to be expected for 
these flash conditions (see Bartlett, 1965, 
pp. 158-163, for discussion of foveal and 
peripheral luminosity functions). Hence the 
luminance values of the blue stimuli, as 
expressed in the Gamma photometer cali- 
bration, were multiplied by 100 in this plot. 
In conclusion, wavelength has no effect in 
altering the onset and offset RT relationships 
previously established for the periphery using 
white light; the system yields faster re- 
sponses to onset than to offset for both 
wave bands tested to a degree dependent on 
intensity. With respect to the foveal data, 
however, some caution must be exercised 
before it is concluded that there are no 
onset and offset RT differences. A number 
of studies (see Pease & Sticht, 1965) have 
reported faster offset RTs in the fovea with 
low intensities. Figure 1 shows that, for 


Reaction time plotted against log luminance for two wave 
peripheral blue values arbitrarily multiplied by 100. 


bands, fovea and periphery, with 


these conditions, offset RT may be slightly 
faster than onset RT in the middle range 
of intensities; a similar slight trend is 
shown by the other two Ss. But irrespective 
of whether there are certain observation 
conditions in which foveal offset RT is re- 
liably faster than onset, the foveal data are 
markedly different than the peripheral, where 
onset clearly is faster than offset. This 
finding is in conformity with the differences 
in functions of the periphery, which may be 
considered a collector and detector of light 
energy for the organism, and of the fovea, 
where perception of form and color char- 
acterize the outcome of activity. 
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From BRS-Foringer: 
a selection of primate chairs 


BRS-Foringer provides primate chairs and accessories for use with 
Rhesus, Cebus, Pigtail and Squirrel monkeys. Every unit is designed 
to provide maximum comfort and fit to each animal by means of 
adjustable locking features. All units are constructed of optical 
quality acrylic for ease of cleaning, minimum weight, and reliability 
of recording. 


PCP-1206. Rhesus chair for physiological research. Designed for 
use in chronic studies. 


PCS-1206. Rhesus chair for use in behavioral studies requiring 
negative reinforcement. 
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ASSOCIABILITY: 


A STUDY OF THE PROPERTIES OF ASSOCIATIVE RATINGS AND 
THE ROLE OF ASSOCIATION IN WORD-WORD LEARNING? 


RICHARD KAMMANN 
Bell Telephone Laboratories, Holmdel, New Jersey 


An investigation first was made of the properties of associative ratings 
of word pairs. These were analyzed in respect to rating instructions, 
word orders, word frequency, and grammatical class and were judged 
to be more useful than free-association probabilities. They were found 
to be substantially correlated with free-association probability and nat- 
ural language mediators, but were less well correlated with semantic 
distance. and meaningfulness, In 2 word-word paired-associate 
learning experiments, associative ratings were, overall, more predictive 
of correct responses and intrusions than semantic distance, response 
meaningfulness, or response frequency. List intrusions were inter- 
preted in terms of a combination of associative selection and response 


sets, 


This paper is most easily understood 
as bearing two distinguishable mes- 
sages, The first is that interword asso- 
ciation can be measured more usefully 
by associative ratings than by the free- 
association or semantic-differential 
methods, The second is that associa- 


1 This research was supported by the Na- 
tional Science Foundation through an Aca- 
demic Year Extension grant to Oakland 
University following the author’s participa- 
„tion in the National Science Foundation Re- 
“search Participation for College Teachers 
Program at the University of Michigan 
(Human Performance Center). The re- 
search was conducted at Oakland Univer- 
sity. Each of the experiments was executed 
and much of the data was analyzed by Sue 
„Cox, an able and persevering assistant whose 
“help is gratefully acknowledged. 
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tion plays a more important part in 
verbal learning than is usually acknowl- 
edged, particularly in studies focusing 
on response availability as an explana- 
tory concept. 

One difficulty with the free associa- 
tion (FA) method is that the FA fre- 
quency between most pairs of words is 
zero, Furthermore, the FA value is 
dependent on which of the two’ words 
is used as the stimulus. These are 
not merely practical limitations, since 
they reflect the operation of selective 
factors in FA responses, such as word 
frequency and part of speech. Fur- 
thermore, even very low-frequency FA 
responses seem to represent rather ob- 
vious and strong types of association. 


© 1968 by the American Psychological Association, Inc. 


There also is sufficient evidence to 
suggest that the reaction to two or 
more words may sometimes be based 
on factors of common context or com- 
mon associations among the words. 
This argument is supported by medi- 
ational learning studies, multiple-stimu- 
lus FA effects (Howes & Osgood, 
1954; Johnson, 1968), and by the para- 
digmatic nature of some free associa- 
tions, particularly nouns (Deese, 1965). 
As Deese, pointed out, even free asso- 
ciations can no longer be thought to 
represent simple contiguities. Thus, it 
is reasonable to postulate that ratings 
might better reflect association by com- 
mon associations as well as by con- 
tiguity. 

The semantic differential might 
seem to be a suitable method since it 
does not depend on response produc- 
tion and is, by its very design, media- 
tional in nature. However, the dif- 
ferential assumes that mediational 
similarity can best be measured in 
terms of adjectival attributes, and, fur- 
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thermore, the differential makes no 
attempt to represent associations based 
on even approximate contiguities. 
Thus, the two words ENGINE and DE- 
BATE in fact measure as semantically 
close, while the words court and 
JESTER would certainly measure as 
semantically more distant. 

In the first section of this paper, the 
pair-rating method is analyzed for its 
properties as a method of scaling asso- 
ciation. It is referred to as the method 
of associability (Asb) to signify that 
pairs of words are the stimuli, that 
rating Ss are asked to consider a va- 
riety of ways for relating the two words 
to each other, and that the method 
possibly permits the operation of medi- 
ational or common context factors. 
The Asb method consists of asking Ss 
to state how likely it is that two words 
would occur together in a sentence or 
in poetry, or that one word would call 
the other to mind, or how easy it is 
to relate the words to each other. 

If Asb is a broad measure of asso- 


TABLE 1 


Summary or Worps, PAIRS, AND ASB Instructions USED IN 
ANALYSES OF AsB SCALE PROPERTIES 


Word p 
Croup Weeds Source Ros Label Instructions Analyses 
1 11 | Semantic Atlas | 43 | Heterogeneous |(Sentences }| Asb Distribution 
Poetry Rating Instructions 
Association)! Word Orders 
Group Reliability 
2 24 | Semantic Atlas | 41 | Heterogeneous | Association | Asb, NLMs, D, M, F 
3 |100 | FA Norms 100 | Free Associates cae Rating Instructions 
Association{| Asb of FA Pairs 
4 | 100 | Assistant 100 | Intuitive Relatedness)| Rating Instructions 
Association {| Asb of Intuitive Pairs 
5 | 210 | Dictionary 180 | Random Relatedness | Asb, NLMs, Compared 
Asb, NLMs, of Gram- 
matical Pairs 
6 60 |° 30 | Heterogeneous | Relatedness | Sentence Lengths from 
Asb Pairs 


® See text. 
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ciation, it should be more independent 
than the FA method of left-right di- 
rectionality, of word frequency (F), 
and of part of speech. Asb should not 
yield a large number or zero values, 
and should discriminate among pairs 
with zero FA frequency with such 
pairs turning up as both high and low 
in Asb score. If Asb measures asso- 
ciation more comprehensively than the 
distances (Ds) based on the semantic 
differential, it should, at the least, not 
correlate highly with D (and should 
be a better predictor of paired-associate 
learning). 

The second part of the paper is de- 
voted to the relative importance of 
Asb, D, natural language mediators 
(NLMs), word frequency (F), and 
word meaningfulness (M) in paired- 
associate (PA) learning and intru- 
sions, The analysis is designed to dis- 
tinguish between the first three mea- 
sures as associative indexes, and the 
last two as response availability indexes. 
Partly in preparation for these com- 
parisons, the first part of the paper also 
considers the correlations among the 
five measures. 


THE Properties or AsB SCORES 


Since the study of Asb as a scaling 
method developed in an evolutionary 
fashion with different materials and 
rating instructions, Table 1 is pro- 
vided as an overview of the analyses 
conducted. 


Method 
Subjects—In all ratings, Ss were under- 
“graduate students at 


graphed sheets in class meetings. r 
ber of pairs presented to a given S varied 
between 30 and 100; the number of Ss rat- 
ing a particular pair varied between 20 and 
30 for each rating condition. 

« Asb rating procedure —Rating Ss were 
instructed at different times to give a rating 
from 1 to 10 for sentences “the degree to 


which it is likely that the two words would 
appear together in a single sentence,” poe- 
try “the degree to which it is likely that 
two words would appear together in a single 
line of poetry,” association “the degree to 
which the first word calls the second word 
to mind,” relatedness “how easy it is to find 
a way to relate the two words to each other.” 

The short form of the instructions was 
only 3 sentences long and is adequatedly rep- 
resented by the phrases quoted above. The 
long forms were about 28 sentences in length 
and were elaborated with the following 
anchoring examples each with an explana- 
tion of possible bases for ratings: salt-pepper 
(10), salt-glass (6-8), salt-dirt (3-4), salt- 
microphone (1). These examples were de- 
scribed as “just one person’s opinion,” and 
the set-inducing instruction for the ratings 
was repeated at the end. The relatedness 
instructions were always used in the long 
form, which included not only the above 
examples, but the following possible sources 
of relatedness: words found together (bird- 
nest), similarity of objects (salt-snow) op- 
posites (hot-cold), rhymes (quick-trick), 
sound similarities (part-parcel), similar uses 
(car-train), synonyms (fair-just), language 
use (holy-ghost), common idea or associate 
(ring-index: finger), same category or class 
(square-triangle). 

Word groups——Word Group 1 was used 
primarily to establish the distribution, reli- 
ability, and effects of different instructions 
on Asb scores, but these words also were 
used in a subsequent PA experiment. The 
words were sampled from the semantic atlas 
published by Jenkins, Russell, and Suci 
(1959) with the restriction that only words 
with known Noble’s (1952) m values for 
meaningfulness be considered. Originally, 
12 words were drawn and 50 pairs were ran- 
domly constructed, but an erroneous inter- 
pretation of “rompin” was made by an assist- 
ant during the Asb ratings, so 7 pairs involv- 
ing this word had to be omitted. The re- 
maining 11 nouns were ARMY, BODICE, EFFORT, 
INCOME, JELLY, KITCHEN, MONEY, QUOTA, 
WAGON, YOUNGSTER, and zenrrH, from which 
set 43 pairs remained. Since these words 
varied in M and F, and had no known asso- 
ciative relationships, they were labeled 
heterogeneous. 

Word Group 2 was used primarily to 
compare Asb with D, M, and F, and later 
NLMs were added as the fifth index. These 
words also were used in a subsequent PA 
study. This group included Word Group 1 
but added 13 new words from the semantic 


ES 


TABLE 2 


10 AssoctaTIVE RESPONSES AND 10 Cox 
WORDS For THE STIMULUS WorD BLUE 


FA 
Asb Cox Words Asb 
Re- 

sponse Frequency® 
Sky 252 9,39 | Blood (High) 7,02 
Red 113 6.26 ahere eh) 8.51 
Green 106 7.46 | Horizon (High) 7.76 
Color 59 9.08 | Ink fii ; 7.86 
Cool i 7,02 | Ja: igh) 8.60 
Cold 9 7,64 le (Lon 1.33 
Danube 3 8.19 | Fade da 1) 3.80 
Car 5,28 | Fur w) 2.21 
Earth 1 2.22 | Pain n Low) 3.70 
Blanket | Idiosyncratic | 5,25 | Symbol (Low) 3,20 


a 1,000 college students. 


atlas. The new words, which were not re- 
quired to have a Noble’s m value on record 
in this case, were DOWNY, FRIGID, GARMENT, 
LEG, LEMON, RELIGION, ROSE, SEX, SPIDER, 
SNOW, SWEEPING, SWEET, and ucty. From 
the pool of 24 words, 41 pairs were de- 
veloped by random pairing, with each word 
occurring in three or four different pairs. 

Word Group 3 was developed by drawing 
10 stimulus words at random from the FA 
norms developed by Palermo and Jenkins 
(1964). For each of the 10 stimulus words, 
10 response words were chosen, with one 
word representing each of the following 10 
levels of FA frequency (per 1,000 male and 
female college students): “idiosyncratic,” 
1, 2, 3-5, 6-10, 11-30, 31-80, 81-130, 131-200, 
201-300, Idiosyncratic refers to a response 
word which occurred once in the norms for 
all age groups combined. The 10 stimulus 
words used were AFRAID, APPEAR, BLUE, 
BROADER, CITY, FOOT, HAND, PRIEST, SPEAK, 
and WINDOW. 

Word Group 4 was intended to test the 
hypothesis that there are many easily found 
word pairs with high associability which do 
not register in FA norms, To this end, a 
student assistant was given an overnight 
assignment to think of 5 words which were 
highly associable, and 5 words which were 
about as associable as low-frequency FA re- 
sponses, for each of the 10 stimulus words 
taken from the Palermo-Jenkins norms in 
Word Group 3. The assistant was given a 
copy of the norms and was instructed that 
neither the high-associable nor low-associ- 
able items could be words which appeared 
in the FA response norms so that all would 
be words with FA frequencies of 0/1,000. 
These were labeled the intuitive pairs, or the 
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high- and low-Cox words. An example of 
the 10 FA response words (Group 3) and 
the 10 Cox words (Group 4) is given for 
the stimulus word BLUE in Table 2. 

Word Group 5 was primarily intended to 

examine the relationship between Asb and 

ical class, although it also was used 
to compare Asb with the NLM index. For 
this purpose, a random sampling by page 
entries was made from Webster’s New Col- 
legiate Dictionary (1959) for words which 
were listed as having only one part-of-speech 
category, with separate searches for 60 
nouns, 60 verbs, and 60 adjectives. The 
words also were chosen to give equal repre- 
sentation to seven logarithmic levels of the 
Thorndike-Lorge (1944) frequency. The 
verbs were half transitive and half intransi- 
tive. An adverb group was made by attach- 
ing “ly” to adjectives which could take that 
suffix, Each grammatical class was split 
randomly into two subgroups which were 
paired off randomly with other subgroups 
in the following six combinations of 30 pairs 
each: noun-noun (N-N), verb-verb (V-V), 
adjective-adjective (J-J), noun-verb (N-V), 
adjective-noun (J-N), and adverb-verb (A- 
V) with left-right order in the pairs as 
indicated. 

Word Group 6 was a selection of 30 
heterogeneous noun pairs, the majority from 
Word Groups 1 and 2, representing 15 
equally spaced levels of associability, 2 pairs 
per level. These pairs were separated into 
two lists of 15 with no duplicate words. 
Each list was presented to 43 or 44 Ss who 
were told to write the most feasible or likely 
sentence they could think of for each pair. 
The purpose of this test was to determine 
whether or not there would be an inverse 
relationship between pair Asb and length 
of generated sentences. 

Other scaling procedures-—For Word 
Groups 1 and 2 (heterogeneous) the seman- 
tic distance values were taken directly from 
Jenkins, Russell, and Suci (1959). M was 
measured in terms of Noble’s m values taken 
from Noble’s (1952) norms for Word Group 
1, and was taken from a replication of 


Noble’s procedure with Oakland University... 


students for Word Group 2. For the 12 
words in this group which had both Noble's 
values and the replication values, the cor- 
relation between the two methods was .91. 
F was measured in terms of the Thorndike- 
Lorge G-count values with A words assigned 
a value of 75 per million and AA words 
assigned a value of 125 per million. Since 
this last value was an arbitrary guess, 


st, 
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statistical tests involving F were checked 
with nonparametric tests; in no case was 
the result changed. 

The NLM values for the pairs in Word 
Groups 2 (heterogeneous) and 5 (random, 
grammatical) were obtained in a group 
testing of Oakland students. The Ss were 
given booklets of 40 grammatically mixed 
pairs, 1 pair per page, and were instructed 
to write down a letter, word, or phrase which 
would help them remember one of the two 
words if they were given the other. The 
Ss were given 15 sec. per pair, and the NLM 
index for a pair was the percentage of Ss 
who gave an NLM response. Each S$ 
attempted 40 pairs, and each pair was 
attempted by 40 Ss. 


Results 


Word Group 1.—Table 1 provides 
an organizational framework for assim- 
ilating the results. The Asb score for 
a pair of words was defined as the mean 
rating given to it. Ignoring for the 
moment the variations in rating in- 
structions used in Word Group 1, the 
distribution of Asb scores across the 
43 word pairs had a range from 1.69 
(BODICE-WAGON) to 9.52 (MONEY-IN- 
comE). The mean was 4.57, the 
median was 4.29, the standard devia- 
tion was 1.92; the distribution was nor- 
mal except for mild positive skewness. 

With three frame-of-reference in- 
structions (sentences, poetry, associa- 
tion) each divided into a short and 
long form, and with each of these six 
conditions further divided into a for- 
ward and backward (reversed) left- 
right order of the words in the pairs, 
there was a total of 12 separate rating 
conditions with 20-30 Ss per condition. 
With each rating condition compared 
to every other rating condition there 
were n(n —1)/2 pairs of conditions, 
for a total of 66 possible correlations. 
All 66 correlations were calculated and 
the results ranged from .89 to .98 with 
a median r of .95. 

There was no evidence that correla- 
tions between conditions with opposite 


left-right word orders in the pairs were 
lower than those between conditions 
based on the same left-right order. 
There was, however, a distinct tend- 
ency for correlations between sentences 
and poetry conditions to be higher 
than the correlations between either 
one of these conditions and the asso- 
ciation instructions. Furthermore, the 
mean Asb score in the combined asso- 
ciation conditions (3.84) was a full 
scale point, or one-half a standard 
deviation below the means for the com- 
bined sentences conditions (4.97) and 
the combined poetry conditions (4.95). 

When each of the 12 rating condi- 
tions was split in half and balanced for 
sex, the 12 split-group reliability cor- 
relations for 43 Asb scores based on 
12 Ss per subgroup varied from .91 
to .96 with a median of .93. When 
the 12 rating groups were restored and 
redivided into males and females, the 
correlations varied from .88 to .96 with 
a median of .94 between the sexes. In 
summary, these results indicate that, 
except for an absolute shift downward 
with association instructions, Asb 
scores are the same across various rat- 
ing conditions, word order makes no 
difference, and the results are consis- 
tent with small groups of Ss of either 
sex. 

Word Group 2.—Across the second 
group of 41 heterogeneous pairs, the 
correlations among Asb, NLM, D, M, 
and F were calculated. In this case, 
M, and F, stand for the sum of mean- 
ingfulness and frequency values for 
both words in each pair. The correla- 
tions with M and F using just right- 
hand or just left-hand members did 
not change the results. The set of all 
correlations is presented in Table 3, 
The means and standard deviations 
are added at the bottom to indicate ade- 
quate yariability in each of the scales. 

The table indicates that there are 


TABLE 3 


CoRRELATIONS AND DISTRIBUTIONS FOR FIVE 
VERBAL SCALING METHODS 


Scale | Asb | NLMs D | Mp Fp 
Pearson r’s 
Asb = 62** | —.37*| 14 16 
NLMs| — | 18 29 22 
D — |—18},—.22 
Mp — .64** 
Fp a 
Distributions 
M 4.3 | 80.2 61.0 |16.8 [115.3 
SD 1.7 | 11.0 19.6 1.9 | 71.0 
-A N 
*p <05, 
*+p < .001 


two basic dimensions at work: one for 
association with Asb and NLMs cor- 
related at .62; and one for unit 
strength, with M, and F, correlated 
at .64. Also, D shows up with a mod- 
erate r of —.37 with Asb. None of 
the associative variables has a signif- 
icant correlation with either of the M 
or F variables, a fact which is of some 
significance in distinguishing Asb from 
the FA method which produces high- 
frequency response words (Johnson, 
1956). It is also interesting that the 
number of associates which each of 
two words produces, that is Mp, is 
not predictive of the Asb score of the 
pair. 

Word Groups 3 and 4.—The 100 FA 
pairs and the 100 intuitive Cox pairs 
were subjected to both association and 
relatedness Asb ratings. The Pearson 
r between these two types of scores 
across all 200 pairs was .91. The mean 
for association Asb was 6.27 and for 
relatedness Asb was 7.67, this differ- 
ence significant by # test beyond the 
001 level. In general, it appears that 
all four types of Asb instructions pro- 
duce equivalent interval scale results 
but that association Asb is consistently 
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a scale point or more below the other 
three. 

The relationship between Asb and 
FA frequency, shown in the right-hand 
section of Fig. 1, is positive and sub- 
stantial, confirming the results of 
Garskof and Forrester (1966). They 
found that association Asb correlated 
with unidirectional FA frequency at 
.54, and with bidirectional FA fre- 
quency at .77. Still, this leaves asso- 
ciation Asb better correlated with other 
Asb ratings than with FA frequency. 

Tt was commented that the FA 
method appears to tap only strong 
levels of associative strength, even at 
low-FA frequencies. This can be veri- 
fied in Fig. 1 by taking 4.0 as the base 
line representing random association 
Asb (Word Groups 1 and 2). It will 
be observed that at every level of FA 
frequency above idiosyncratic, the mean 
association Asb is above this general- 
ized average. A similar conclusion 
holds for the relatedness Asb scores if 
5.0 is taken from sentences Asb (Word 
Group 1) as a rounded base line. 

The left side of Fig. 1 shows the Asb 
means for the high- and low-intuitive 


ASSOCIABILITY 
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—— RELAT. INSTR. 
=== ASSOC. INSTR. 


WO- 1 2 3- 6- I- 3i- Bi 13- 20l- 
SYNCRATIC 5 10 30 80 130 200 300 
RESPONSE FREQUENCY PER 1000 Ss 


Fic. 1. Mean Asb ratings for 50 Cox- 
low and 50 Cox-high pairs, and for 10 pairs 
of words at each of 10 free association levels. 
(Each mean is shown twice, once for Asb 
scores by the association instructions, and 
once for Asb scores by the relatedness in- 
structions.) 
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TABLE 4 
CORRELATIONS AND DISTRIBUTIONS OF AsB AND NLMs IN SIX GRAMMATICAL SAMPLES 


Grammatical Combination 


Statistic 
N-N V-V J-J N-V J-N A-V Combined 
r (Asb-NLM) 72 64 52 65 
M Asb 49 5.8 5. 5. rea r 5 i 
SD Asb 1.9 14 1. 1.8 13 1.3 1.6 
M NLM 77.2 77.5 71. 79.5 70.6 68.4 74.1 
SD NLM 10.4 9.7 13. 121 13.8 11.9 12.6 


Cox pairs by each method of rating. 
Tt will be recalled that none of these 
pairs were listed as free associates in 
the Palermo-Jenkins norms. Never- 
theless, even with the casual method 
used to obtain these pairs, the intui- 
tively high pairs are equal in Asb to 
about the fifth level of FA frequency; 
there were some individual high-Cox 
pairs with Asb scores over 9.0, In 
summary, the Asb method is substan- 
tially corrected with FA frequencies 
where they exist, but it also measures 
associative relationships below the level 
detectable by the FA method; and, 
finally, it detects very strong relation- 
ships overlooked by the FA method. 
= Word Group 5—A more informa- 
tive correlation between Asb and 
NLMs is presented in Table 4 in the 
context of six different grammatical 
combinations of 30 pairs each. The 
two combinations involving adjectives, 
particularly J-N, yield lower 7’s, but for 
all other combinations the r’s are in 
the range .64-.72. How or why Asb 
and NLMs differ on J-N combinations 
cannot be interpreted from the present 
data, 

Table 4 also indicates the mean 
values of Asb and the mean values of 
NLMs as a function of grammatical 
combination. The difference among 
fhe Asb means was significant, F (5, 
174) = 2.78, p <.05, as was the dif- 
ference between the NLM means, F (5, 


174) = 4.34, p < 01. Duncan’s range 
test indicates the following grouping: 


Asb: V-V N-V J-J N-N A-V J-N 


NLMs: 
N-V V-V N-N J-J J-N A-Y? 


The significance of the Asb results 
for grammatical classes rests in their 
contrast with the results of FA pairs, 
in which case Deese (1965) has shown 
that nouns yield about 75% paradig- 
matic associates, verbs and adjectives 
about 50% paradigmatic associates, 
and adverbs about 25% paradigmatic 
associates; the remainder in each case 
are syntagmatic associates. The gram- 
matical analog to a paradigmatic asso- 
ciation is a pair made up of the same 
parts of speech and the analog to a 
syntagmatic one is a pair providing 
a phrase-making combination (N-V, 
J-N, A-V). Thus, the frequency of 
occurrence of the different grammatical 
combinations in FA data leads to the 
following expectations for Asb : 


(See Footnote 2.) 

First, the Asb means are not as 
strongly differentiated as the FA re- 
sults would anticipate. Second, the re- 
sults that V-V is significantly greater 


2 Groups underlined with the same line are 
not significantly different. 
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than A-V, and N-V is (not signif- 
icantly) greater than N-N, are both 
reversals of the expectations from the 
FA data. This indicates that Asb and 
FA are sensitive to somewhat different 
associative processes. The psycholog- 
ical basis of the two significant dif- 
ferences in Asb means is not identifi- 
able in the present data. The same is 
true for the differences among NLM 
means, although it may be summarized 
that combinations made from nouns 
and verbs produce more NLMs than 
combinations involving an adjective or 
adverb, 

Word Group 6.—With the aid of 30 
heterogeneous pairs, it was possible to 
determine the correlation between the 
Asb of pairs and the mean length (in 
words) of sentences generated by Ss 
instructed to write the most feasible 
or likely sentence using those pairs, 
The resulting + was —.50, p< 01. 
An examination of the scatterplot 
showed that unassociable pairs always 
produced high-mean sentence lengths, 
whereas highly associable pairs pro- 
duced both low- and high-mean sen- 
tence lengths. 


Discussion 


Proceeding without the benefit of the 
second section of this Paper, it is assumed 
that Asb is some kind of measure of as- 
sociation or associative strength, Granted 
this, it appears to be superior to the FA 
method in its greater independence from 
word order, word frequency, and gram- 
matical class. It also is more compre- 
hensive than the FA method, yielding both 
high and low values for Pairs with zero 
FA frequency; it does not produce a 
large number of zerolike values itself, 

The Asb method is eminently. practical. 
It is consistent across a variety of rating 
instructions, reliable with small groups of 
Ss, and independent of sex, Tf data are 
needed on a large, number of pairs it is 
simple to divide the Pairs into sets of 50- 
100 and present these to separate sub- 


groups in large classroom testing. The 
method can be applied to any pairs of 
interest, including those which are pro- 
duced by Ss, e.g., the stimulus-intrusion 
pairs in PA learning. 

As with FA frequencies, there is no 
direct evidence on the origins of Asb 
scores; there are no norms for either the 
number of co-occurrences of two words in 
the language, or for their separate co- 
occurrences with common third events, 
However, the facts that Asb is positively 
correlated with NLMs, negatively cor- 
related with semantic distance (D), and 
negatively correlated with generated sen- 
tence length are all suggestive of the 
possibility that Asb in part reflects com- 
mon context or mediational sources of 
relatedness. 


AssociaBiLity IN Patrep-AssocraTE 
LEARNING 


If Asb is a measure of association it 
should be predictive of both PA learn- 
ability and the occurrence of intrusions. 
However, this hypothesis assumes that 
specific associative strength is a factor 
in pair learning but, in fact, the avail- 
able evidence is in favor of the im- 
portance of response availability as 
indexed by M, F, or response familiar- 
ization (Goss & Nodine, 1965 ; Under- 
wood & Schulz, 1960a). 

Attempts to demonstrate differences 
in word-word learning as a function of 
high-- and low-FA frequencies have 
had mixed results with Haun (1960) 
obtaining no effect, Postman (1962) 
a weak effect with low-F stimulus 
words only, and Underwood and 
Schulz (1960b) a stronger effect using 
only nouns and their descriptive ad- 
jectives. Part of the difficulty may be 
the restricted range of high associative 
strengths tapped by the FA method. 
On the other hand, Homzie and 
Weimer (1967) have been able to 
demonstrate more rapid learning of 
pairs with low semantic Ds than pairs 
with high Ds. 
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The intrusions in PA learning would 
seem to be based even more obviously 
on response availability. Underwood 
and Schulz (1960a) add recency to 
frequency as a source of unit response 
strength to account for the predomi- 
nance of intralist over extralist intru- 
sions. However, in the case of extra- 
list intrusions, there is some evidence 
from word-word learning studies that 
association may be a factor, and in 
either of two different senses, Post- 
man (1962) found that extralist intru- 
sions were FA primaries of the (low- 
F) stimulus words; Underwood and 
Hughes (1950) found them to be rated 
as highly synonymous or homonymous 
with the response words they displaced, 

What appears to be needed is a di- 
rect comparison of the response avail- 
ability factor with the association fac- 
tor in both forward and backward 
word-word learning, and in respect to 
both intralist and extralist intrusions, 
The following two PA experiments 
(and an extra analysis of intrusions 
from a third experiment) attempt to 
provide this comparison in terms of 
words which are relatively unselected 
except for part of speech (mostly 
nouns with a few adjectives). Re- 
sponse availability is represented by 
Mg and Fx in forward learning, and 
Ma and F, in backward recall. Asso- 
ciation is represented by Asb, NLMs, 


and D. 


Method 


Experiment I: Subjects—The Ss were 
three groups of 20-26 Oakland University 
volunteers female students naive to verbal 
learning experiments. Each group learned a 
separate equivalent list. 

Materials —The word GARMENT was added 
to the 11 heterogeneous words in Word 
Group 1 as described previously. The 12 
words were randomly divided, randomly 
paired, and randomly assigned to stimulus 
or response positions. This was done three 
Successive times to form three alternative 


lists of 6 pairs each, for a pooled total of 
18 unique pairs in the analysis of item learn- 
ability, New Asb ratings were collected on 
all 18 pairs with poetry instructions; NLMs 
were collected along with Word Group 5 
(random, grammatical) ; D, M, and F values 
were taken from the published norms. 

Procedure.—The Ss were given one study 
trial and two anticipation trials on a Stowe 
memory drum at a 2:2 sec, rate of presenta- 
tion for S and S-R, respectively. Four dif- 
ferent starting orders for the items were used 
equally often, and the different orders also 
were used on successive learning trials, 
Immediately after the last trial, S was shown 
the six response items in the fourth list 
order and given up to 8 sec. to recall each 
stimulus word in order, 

Experiment II.—Experiment II was essen- 
tially an expanded version of Exp. I. The 
number of items was doubled from 6 per 
list to 12, the number of anticipation trials 
was increased from two to three, and the 
number of lists was increased from three 
to four, with each list learned by 12 female 
Ss. The 12 pairs in each list were con- 
structed by random pairing of the 24 words 
in heterogeneous Word Group 2, with 7 
pairs appearing in two different lists (6 by 
intent). The backward recall test was paced 
at a 4-sec, rate on the memory drum. With 
these 41 pairs the Asb ratings were taken 
in terms of association instructions, and 
Noble’s m values also were obtained locally, 
as previously described for Word Group 2, 


Results 


Learnability—The forward learn- 
ability of each pair was defined as the 
mean number of correct responses on 
all learning trials combined. The back- 
ward learnability of each pair was the 
percentage of Ss who were correct in 
the backward recall test. The correla- 
tions between each type of learnability 
and the various verbal scalings of the 
items are shown in Table 5, 

In forward learning, association is 
the major factor, with Asb the only 
predictor in the first experiment, and 
with Asb and NLMs the best two pre- 
dictors in the second one. There is 
no significant evidence of response 
availability as either Mg or Fp, but, for 
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TABLE 5 
CORRELATIONS BETWEEN VERBAL INDEXES AND LEARNABILITY oF PAIRS 
Learnability Asb NLMs D Ma Mp Fa Fa 
ae Ts 
d S5% =.01 18 -20 —.20 10 —.30 
Daci yar -20 —.08 41 A7* = 120, 42 —.40 
. [Ib 
and Rohe 45** —.02 -36* 10 09 28 
Backward 30” —.06 —.06 34* —.29 aug 14 
aN = 18 pairs, 
bN = 41 pairs, 
HSH 
> Sor 


some reason, M4 is slightly predictive 
in the second experiment. In back- 
ward learning, M4 is the best overall 
predictor, which is consistent with re- 
sponse availability. There also is a 
trace of an association factor in back- 
ward recall, in terms of Asb. 

Intralist intrusions —The response 
availability principle leads to the expec- 
tation that intralist intrusions (Xs) 
will be higher in M or F than the 
assigned responses (Rs) which they 
displace. Or, with a less specific view 
of response competition, the expecta- 
tion might be that Xs will simply be 
higher in M or F than would be ex- 
pected by chance, i.e., they will more 
often be above the median M or F of 
all possible intruders (Ws) available in 
the same list. In the following anal- 
ysis, W, or a Possible intruder, refers 
to all words in the list except three 
items: the stimulus Present when the 
intruder occurred, the Tesponse it dis- 
placed, and the intruder itself (SAR, 
X, respectively), 

The stimulus association principle 
leads to the expectation that the asso- 
ciation between the stimulus and the 
intruder (S-X) will be higher than the 
association between the stimulus and 
the assigned response (S-R). With a 
less specific view of associative compe- 
tition, the expectation would be that 


S-X associations will be higher than 
would be expected by chance consider- 
ing all possible S-W possibilities. 
This analysis requires knowledge of the 
associative relationship of each stim- 
ulus word to all other words in its list 
which is easily accomplished with the 
aid of the Asb method. In the partic- 
ular lists used in these experiments, the 
necessary D values also were available. 
With D, of course, the direction of the 
prediction is reversed since D is a 
measure of dissociation. 

The response association principle 
leads to the expectation that the asso- 
ciation between the response and the 
intruder (R-X) will be greater than 
the expected chance value, that is, the 
median value for all R-Ws given that 
R. Again, this requires the associative 
value between each R and all its Ws, 
and again, this was available for the 
intralist intrusions in terms of both 
Asb and D. 

All intralist intrusions from each ex- 
periment were recorded and subjected, 
item by item, to the analyses necessary 
to test the three hypotheses: response 
availability, stimulus association, and 
response association. Since the results 
were very similar in both experiments 
they have been pooled in this report. 
The fact that different Asb instructions 
(poetry, association) were used in 


+ 
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Exp. I and II is not a problem because 
all comparisons involved only those 
words occurring in the same list in the 
same experiment. 

The results for all 211 intralist in- 
trusions occurring during forward 
learning are presented in Table 6. The 
left-hand column shows the compari- 
sons appropriate to each hypothesis; 
the next column shows the verbal scal- 
ing method used to make the compari- 
son; and the next column shows the 
percentage of times the first term in 
the comparison was greater than the 
second term. Although there were 
some ties, the number of intrusion 
cases involved in the sign test was 
never below 190. 

There is evidence for response avail- 
ability, but only in terms of F, and 
only in terms of the less specific view 
of response competition, that is, X in 
competition with all Ws, and not just 
X in competition with R. The F value 
of the X word was greater than the 
median F value of the Ws in 63% of 
the cases. 

Stimulus association emerges as the 
strongest factor in the intrusions with 
S-X associability greater than S-R 
associability in 71% of the cases. That 
this is not just a result of the greater 
opportunity for intrusions to occur 
with the more poorly learned, low-Asb, 
S-R pairs is confirmed by the fact that 
S-X also was greater in associability 
than the median S-W in 72% of the 
cases. The role of stimulus associa- 
tion also is confirmed equally well in 
terms of D in the comparison of S-X 
with median S-W, but not in com- 
parison with S-R. There is no evi- 
dence for a response association process 
at work in these data, but this point 
will come up again. 

* Since these results include more 
than one intrusion each for many Ss, 
one may wonder if the results are not 


TABLE 6 
ANALYSIS or FORWARD Intratist INTRUSIONS 
Comparison Scale |P* eal z 
Response Availability 
ye R M | 49.3 | —.21 
X vs. Mdn W M | 46.9 | —.90 
Xvs. R F 54.1 1,15 
X vs. Mdn W F 63.1 3.69* 
Stimulus Association 
S-X vs. S-R Asb | 70.8 6.02* 
S-X vs. Mdn S-W | Asb| 72.2 6.36* 
S-X vs. S-R D | 46.4 | —1.03 
S-X vs. Mdn S-W D | 289 | —5.91* 
Response Association 
R-X vs, Mdn R-W | Asb | 54.2 1.16 
R-X vs. Mdn R-W | D | 44.7 | —1.49 


Note.—N = 211, 

® Percentage of times the first term in the comparison 
was greater than the second term, 

*p <.001. 


exaggerated by repetition of the same 
intruders. One also may argue that 
independence of observations has been 
violated, rendering the sign test in- 
appropriate. First, when the data were 
analyzed removing repeated intrusions 
(both within and between Ss) so that 
every intrusion represented a unique 
S-(R)-X combination, the results were 
nearly identical except that response 
F was no longer a significant factor. 
Second, when intralist intrusions were 
sampled at a rate of only one per $ 
for a total of 79 cases, the association 
factor remained unchanged and the F 
value of the intruder fell just short of 
significance showing up in 61% of the 
comparisons of X with median W. 
An interesting question is whether 
the response availability and associative 
factors work in the same way for in- 
trusions occurring during backward re- 
call. In the two experiments combined 
there was a total of 89 intralist intru- 


„sions in backward recall, The results 


of the analyses on these are shown in 
Table 7. In this case “S” refers to a 
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TABLE 7 
ANALYSIS or BACKWARD INTRALIST 
INTRUSIONS 
Comparison Scale] Percent- 
Response Availability Bn 
vs. R 60.7 | 2.01* 
X vs. Mdn W M | 44.7 | —.98 
Xvs.R F | 64.1 | 2,49* 
X vs. Mdn W F | 57.3 1.27 
Stimulus Association 
S-X vs. S-R Asb| 70.1 |  3.75*** 
S-X vs. Mdn S-W | Asb] 65.4 | 2.71* 
S-X vs. S-R D | 44.9 | —.95 
S-X vs, MdnS-W | D | 32.1 | —3.27** 
Response Association 
R-X vs. Mdn R-W | Asb] 55.6 | 1.00 
R-X vs. Mdn R-W | D | 48.7 | —'23 


word on the response side of the list 
and “R” to a word on the stimulus 
side. Observe that the X word was 
greater in M and F than the R word 


in 61% and 64% 
spectively, p < .05. However, stim- 
ulus association. was still the most 
important factor with S-X greater than 
S-R in 70% of the cases, p < .001. 
Again, there was no evidence of a re- 
sponse association process at work. 
The entire question of the response 
availability and associative factors was 
examined by an alternative method 
which consisted of giving each indi- 
vidual word in each experiment an in- 
trusiveness score. This score was the 
number of times a word occurred as an 
intruder in forward and backward 
learning, corrected for the number of 
times it was on the stimulus or re- 
sponse sides of a list which involved 
different overall rates of Occurrence, 
Intrusiveness then was correlated with 
the M and F values of the words, For 


of the cases, re- 


purposes of determining the associative 
inyolvement of a word with its other 
list words, the mean of the highest 
three Asb scores for that word with 
every other word was calculated, 

There was no significant correlation 
between intrusiveness and either M or 
F in either experiment, but intrusive- 
ness correlated significantly with asso- 
ciative involvement, r = .54 in Exp. I 
and .45 in Exp. II. Since there was 
no evidence for response availability 
in this analysis, it appears that high 
M or F is not a sufficient condition by 
itself to make a word a frequent in- 
truder. Apparently, a word must be 
associated first, and then, if there are 
two such words, the more available one 
will become the intruder (Tables 6 
and7). 

Intralist intrusions in conceptual 
lists—It was noted that there was no 
evidence for a response association 
Process in these data. A study was 
conducted by Hardmann € concerning 
two concept-structured learning lists of 
12 word-word items. In Hardmann’s 
first list, the 12 stimulus words came 
from four concept clusters (person, 
cloth, animal, geometric form) and the 
12 response words came from four 
other conceptual clusters (flower, 
building, fruit, furniture) and the con- 
cept groups were paired (ie., the three 
cloth words were cues for the three 
building words, etc.). In the second 
list the same stimulus and response 
words were paired randomly, 

Over 10 anticipation trials there was 
no difference in the number of correct 
responses between the conceptually 
Organized and disorganized lists.. 
However, Hardmann noted that intra- 
category substitutions were much more 
numerous in the conceptually organized 


* Unpublished paper by Carol Hardmann, 
1 ent of Psychology, Eastern Mich- 
igan University. 
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list than in the other one, suggesting 
category learning, but there was con- 
fusion among specific responses. The 
complete record of intralist intrusions 
was borrowed and subjected to analysis 
in terms of Asb, The results are 
shown in Table 8. In the conceptually 
organized list there is no evidence of 
stimulus association, but response 
association shows up in 96%: of the 
comparisons of R-X with median R-W. 
In the second list, which is conceptually 
disorganized but contains the same S 
and R word sets, response association 
is still present at 68% although it is 
reduced, and stimulus association has 
reappeared. In both lists the response 
association effect is more than just a 
confirmation of intraconcept substitu- 
tions since these accounted for only 
71% of intrusions in the first list and 
only 18% in the second. The stronger 
Asb result is explained by the many 
instances of building-furniture substi- 
tutions and of fruit-flower substitu- 
tions, 

Extralist . intrusions—Among _ the 
235 intrusions in forward learning in 
Exp. I and II, there were 24 extralist 
intrusions. An obvious problem in 
analyzing these cases is that W, a 
possible alternative intruder, should 
refer to every word in S’s vocabulary, 
and there is no sample which ade- 
quately represents this. The decision 
was made to compare the rate of high- 
frequency (AA) words among intru- 
sions to their rate of occurrence in a 
sample of 200 Thorndike-Lorge listings. 
For the sake of comparing S-X or 
R-X with median S-W or median 
R-W, the latter two were assigned 
values of 5.0 in Exp. I (poetry Asb), 
and 4.0 in Exp. II (association Asb), 
both of these values being more than 
1,0 above the means of median S-Ws 
and R-Ws among the intralist intru- 
sions. 


TABLE 8 
ANALYSIS OF CONCEPTUAL List INTRUSIONS 
Comparison Scate/Percent | y 
Conceptually Organized List> 
Stimulus Association 
S-X vs. S-R Asb| 48.7 |—.23 
S-X vs, Mdn S-W Asb| 61.2 | 1.88 
Response Association 
R-X vs. Mdn R-W Asb| 96.1 | 7,90** 
Conceptually Disorganized 
o 
Stimulus Association 
S-X vs. S-R Asb| 72.8 | 3,82** 
S-X vs. Mdn S-W Asb| 60.6 | 1.73 
Response Association 
R-X vs. Mdn R-W Asb| 68.3 | 2,96* 


* Percentage of times the first term in the compari- 
aon wes arator than the second term, 


eN = 70. 


*p < 01, 
p< .001. 


In the Thorndike-Lorge sample the 
rate of AA words was 7%, whereas, 
among the 24 extralist intrusions, AA 
words occurred at a rate of 62%, indi- 
cating a clear role for response avail- 
ability. Stimulus association also 
showed up clearly; S-X was greater 
than S-R in 75% of the cases, and 
greater than the arbitrary value for 
median S-W in 79% of the cases, 
p<.05. There was no evidence for 
response association, R-X being 
greater than median R-W in only 45% 
of the cases. 

There were only 11 extralist intru- 
sions in the Hardmann lists, but these 
were sufficient to show a pattern. Six 
of the 11 were AA words. Two of 
the intruders could be classified under 
stimulus association (PIG-ROSE, as S-R, 
producing PIG-MOLE and PIG-COW as 
S-Xs). Similarly, by the criterion of 
same concept category, 8 of the remain- 
ing 9 intrusions could be classified as 
response association. Thus, the extra- 
list intrusions in each experiment fol- 
lowed the pattern of intralist intrusions, 
except that response availability 
emerged as a more significant factor. 

Final notes on intrusions,—In all of 
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the foregoing analyses the statistical 
test was based on the percentage of 
times that the intruder X was greater 
in some characteristic than the R, or 
the median W for that case. It was 
equally possible to make all of these 
comparisons in terms of mean differ- 
ences rather than percentages. Such 
comparisons were in fact conducted for 
Exp. I and II with the result that re- 
sponse availability and response asso- 
ciation (by Asb, not by D) were 
significant in many more of the com- 
parisons, These differences represent 
the effects of a few strong instances 
on the mean values for intruders and 
therefore established the existence of 
the factors without regard to their 
prevalence. More importantly, the 
relative importance of the various fac- 
tors remained unchanged with stim- 
ulus association remaining the strongest 
factor. 

There were numerous instances ob- 
served in the protocols of intrusions in 
which there was an obvious sound or 
letter similarity between the intruder 
and either the stimulus or the re- 
sponse. However, the Asb values of 
these S-Xs and R-Xs were typically 
quite low; therefore, association by 
sound and letter similarities has not 
contributed to the associative analyses 
as they should have. This is a limita- 
tion on the Asb method which must 
be taken into account, although it is 
possible that relatedness Asb instruc- 
tions may compensate. 

The interpretation of the factors 
operating in intrusions will depend 
importantly on two other findings 
which have nothing to do with the 
various scaling measures, The first 
finding is that in Exp. I and II the 
intralist intruder came from the re- 
sponse. side of the list in 75% of the 
cases in forward learning, and from the 
stimulus side of the list in 74% of 


the cases in backward recall. The 
second is that in the Hardmann lists 
this selective process was even stronger 
with intralist intruders coming from 
the response side of the list in 90% of 
the cases. 


Discussion 


It is apparent that Asb is an associative 
measure and, at the same time, that as- 
sociation plays a substantial role in de- 
termining both the correct responses and 
errors in word-word learning. Asb also 
was more consistently predictive of the 
correct and incorrect responses than was 
D. Response availability also was de- 
tectable in many analyses but it was con- 
sistently the lesser effect. By the analysis 
of intrusiveness it is apparent that a high 
F or M value is not by itself adequate to 
make a word into a frequent intruder, 

It is now possible to give a more com- 
prehensive account of the sources of both 
intralist and extralist intrusions. It is 
apparent that response set is a very im- 
portant factor since S$ is able to draw his 
intrusions from the appropriate side of 
the list in both forward and backward 
learning. This implies that list sides 
form two discriminable classes of items 
to which such sets can be applied. An 
immediate implication is that intralist in- 
truders are not selected simply by their 
recency since both sides of the list are 
about equally recent in both directions 
of recall. The interesting possibility 
arises that recency operates in the selector 
mechanism for intralist (as opposed to 
extralist) intrusions as a class defining 
attribute rather than as an incrementor 
of response availability, 

Response set cannot explain, however, 
why in lists in which associative strengths 
among S words, among R words, and 

een S and R words are, on the aver- 
age, about equal (Exp, I and II), the 
stimulus-intruder association is a potent 
factor and response-intruder association 
is very weak, An obvious possibility is 
that it is the presence of the S word 
which is the salient starting point for 

e associative process, item by item. 
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If this is true, then it needs to be ex- 
plained why response association was 
dominant in the Hardmann lists. = In 
these lists there were some strong asso- 
ciations among S words, some among R 
words, but not between S and R words, 
The relative absence of the stimulus-as- 
sociation effect becomes attributable to 
the dearth of S-X possibilities, given a 
set to respond with some R word. This 
is further supported by the increased dif- 
ferentiation of list side with intruders 
coming from the response side in 90% 
of the cases. 

The presence of the response-associa- 
tion effect must assume that S moves 
from the stimulus to an appropriate as- 
sociative cluster, and then draws out an 
available word from that cluster. That 
it is an associative cluster and not simply 
a concept cluster is shown by the fact 
that the response-association effect cut 
across some clusters and also was present 
in the second Hardmann list in which the 
stimulus was not a reliable cue to con- 
cept cluster. It appears, therefore, that 
associatively related words form clusters 
for which S may become set by the stim- 
ulus word in much the same way that he 
is set by the task to respond from the 
appropriate side of the list. 

This analysis assumes that there is a 
mediating response stage in which the 
attributes of the associative cluster are 
represented before the specific response 
emerges. These attributes might include 
those measured by the semantic differen- 
tial but they are almost certainly more 
varied and extensive. For example, sound 
similarities show up in the extralist 
intrusions studied by Underwood and 
Hughes (1950). Similar perceptual at- 
tributes were found to operate in the 
“tip of the tongue” phenomenon studied 
by Brown and McNeill (1966). The 
variety of factor dimensions determined 
in different word sets by Deese (1965) 
illustrate “semantic” attributes which also 
could be used to give a more precise 
definition of associative clusters than that 
provided by semantic differential profiles, 

In summary, association appears to play 
two different roles in word-word learning. 


All other things equal, the association of 
a stimulus word with the various re- 
sponse words is the dominating effect, 
However, if associations are strong among 
response words and weak between stim- 
ulus and response words, then associative 
clusters are formed and used as mediating 
steps to response selection, In general, 
it appears unprofitable to view even a 
heterogeneous word list as a collection of 
individual items varying only in availabil- 
ity. Rather, it appears that Ss capitalize 
flexibly and rapidly on the associative 
possibilities deriving from natural lan- 
guage experience or semantic relation- 
ships. 
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